open-source

# benchmarks → open-source

2 links tagged with all of: benchmarks + open-source

Click any tag below to further narrow down your results

Links

Freedium

Chandra OCR 2, a 4 billion-parameter model from Datalab, outperforms GPT-4o and Gemini on AllenAI’s olmOCR benchmark and a 90-language test while halving the model size. It preserves layout, reads complex tables and math notation, converts diagrams to Mermaid, and runs at two pages per second on an NVIDIA H100. The code is Apache 2.0 but the model weights use an OpenRAIL-M license with commercial restrictions.

Saved by mark · Last saved April 22, 2026 · 5 min read

+ ocr open-source ✓ benchmarks ✓ + document-processing + multilingual

RIP Commercial OCR. An Open-Source Model Just Topped Every Benchmark. | by Sumit Pandey | Apr, 2026 | Towards Deep Learning

A new open-source OCR model outperformed all major commercial tools on standard text and handwriting tests. It accurately transcribed a 1913 handwritten letter by Ramanujan, preserving layout, math notation, and faint ink details.

Saved by mark · Last saved April 22, 2026 · 1 min read

+ ocr open-source ✓ benchmarks ✓ + handwritten-text + machine-learning