Show HN: Qwen-2.5-32B is now the best open source OCR model

Last week was big for open source LLMs. We got:

- Qwen 2.5 VL (72b and 32b)

- Gemma-3 (27b)

- DeepSeek-v3-0324

And a couple weeks ago we got the new mistral-ocr model. We updated our OCR benchmark to include the new models.

We evaluated 1,000 documents for JSON extraction accuracy. Major takeaways:

- Qwen 2.5 VL (72b and 32b) are by far the most impressive. Both landed right around 75% accuracy (equivalent to GPT-4o’s performance). Qwen 72b was only 0.4% above 32b. Within the margin of error.

- Both Qwen models passed mistral-ocr (72.2%), which is specifically trained for OCR.

- Gemma-3 (27B) only scored 42.9%. Particularly surprising given that it's architecture is based on Gemini 2.0 which still tops the accuracy chart.

The data set and benchmark runner is fully open source. You can check out the code and reproduction steps here:

- https://getomni.ai/blog/benchmarking-open-source-models-for-...

- https://github.com/getomni-ai/benchmark

- https://huggingface.co/datasets/getomni-ai/ocr-benchmark

Comments URL: https://news.ycombinator.com/item?id=43549072

Points: 61

# Comments: 13

https://github.com/getomni-ai/benchmark/blob/main/README.md

созданный 1mo | 1 апр. 2025 г., 21:40:16

Войдите, чтобы добавить комментарий

Другие сообщения в этой группе

DoomArena: A Framework for Testing AI Agents Against Evolving Security Threats

Article URL: https://arxiv.org/abs/2504.14064

Comments URL: https://news.ycombinator.c

6 мая 2025 г., 18:30:12 | Hacker news

Launch HN: Exa (YC S21) – The web as a database

Hey HN! We’re Will and Jeff from Exa (https://exa.ai). We recently launched Exa Websets, an embeddings-powered search engine designed to return exactly what you’re ask

6 мая 2025 г., 18:30:10 | Hacker news

Curl: We still have not seen a single valid security report done with AI help

Article URL: https://www.linkedin.com/posts/danielstenberg_hackerone-curl-activity-73248208

6 мая 2025 г., 18:30:08 | Hacker news

Is Planet Nine Alone in the Outer System?

Article URL: https://www.centauri-dreams.org/2025/05/06/is-planet-nine-alone-in-the-outer-system/

6 мая 2025 г., 18:30:07 | Hacker news

Matt Godbolt sold me on Rust (by showing me C++)

Article URL: https://www.collabora.com/news-and-blog/blog/2025/05/06/matt-

6 мая 2025 г., 18:30:06 | Hacker news

Show HN: Fast parser and generator for RSS, Atom, OPML and popular namespaces

Hi HN! While working on a project that involves frequently parsing a lot of feeds, I needed a fast JavaScript-based parser to extract specific fields from feed namespaces. Existing Node packages w

6 мая 2025 г., 18:30:06 | Hacker news

Dreariness Index (2015)

Article URL: http://us-climate.blogspot.com/2015/03/dreary-weather.html

Comments URL:

6 мая 2025 г., 16:10:24 | Hacker news

Techie