Show HN: A tool to benchmark LLM APIs (OpenAI, Claude, local/self-hosted)

I recently built a small open-source tool to benchmark different LLM API endpoints — including OpenAI, Claude, and self-hosted models (like llama.cpp).

It runs a configurable number of test requests and reports two key metrics: • First-token latency (ms): How long it takes for the first token to appear • Output speed (tokens/sec): Overall output fluency

Demo: https://llmapitest.com/ Code: https://github.com/qjr87/llm-api-test

The goal is to provide a simple, visual, and reproducible way to evaluate performance across different LLM providers, including the growing number of third-party “proxy” or “cheap LLM API” services.

It supports: • OpenAI-compatible APIs (official + proxies) • Claude (via Anthropic) • Local endpoints (custom/self-hosted)

You can also self-host it with docker-compose. Config is clean, adding a new provider only requires a simple plugin-style addition.

Would love feedback, PRs, or even test reports from APIs you’re using. Especially interested in how some lesser-known services compare.


Comments URL: https://news.ycombinator.com/item?id=44413921

Points: 3

# Comments: 0

https://llmapitest.com/

Vytvorené 2d | 29. 6. 2025, 18:20:10


Ak chcete pridať komentár, prihláste sa

Ostatné príspevky v tejto skupine

Ask HN: Freelancer? Seeking freelancer? (July 2025)

Please lead with either SEEKING WORK or SEEKING FREELANCER, your location, and whether remote work is a possibility.

Please only post if you are personally looking to hire a freelancer or work a

1. 7. 2025, 16:40:26 | Hacker news
Ask HN: Who is hiring? (July 2025)

Please state the location and include REMOTE for remote work, REMOTE (US) or similar if the country is restricted, and ONSITE when remote work is not an option.

Please only post if you pe

1. 7. 2025, 16:40:24 | Hacker news
Show HN: ToplingDB - A Persistent Key-Value Store for External Storage

As the creator of TerarkDB (acquired by ByteDance in 2019), I have developed ToplingDB in recent years.

ToplingDB is forked from RocksDB, where we have replaced almost all components with mo

1. 7. 2025, 14:30:09 | Hacker news