Quit Emailing Yourself

# machine-learning → ocr → document-processing

4 links tagged with all of: machine-learning + ocr + document-processing

Click any tag below to further narrow down your results

Links

How Grab Built a Vision LLM to Scan Images

Grab built a specialized Vision LLM to improve the accuracy of information extraction from user documents for eKYC verification. They faced challenges with traditional OCR systems and fine-tuned existing models, ultimately creating a model that can process Southeast Asian languages and diverse document formats. The article details their technical approach and training methods.

Saved by tldr-importer · Last saved February 14, 2026 · 6 min read

+ vision-llm ocr ✓ + southeast-asian document-processing ✓ machine-learning ✓

How we built a custom vision LLM to improve document processing at Grab

Grab developed a specialized Vision LLM to enhance document processing for eKYC in Southeast Asia. The project focused on improving OCR accuracy for diverse languages and document formats, ultimately creating a lightweight model tailored to their needs.

Saved by tldr-importer · Last saved February 14, 2026 · 6 min read

+ vision-llm document-processing ✓ ocr ✓ + southeast-asia machine-learning ✓

zai-org/GLM-OCR · Hugging Face

GLM-OCR is a multimodal optical character recognition (OCR) model designed for complex document understanding. Built on the GLM-V architecture, it features a robust two-stage pipeline for layout analysis and recognition, achieving high accuracy in varied real-world scenarios. The model is open-sourced and comes with an easy-to-use SDK for integration.

Saved by tldr-importer · Last saved February 14, 2026 · 3 min read

ocr ✓ document-processing ✓ machine-learning ✓ + sdk + open-source

Nanonets OCR Small

Nanonets has launched Nanonets-OCR-s, an advanced image-to-markdown OCR model that intelligently recognizes document structures and content, providing formatted markdown outputs suitable for downstream processing. This model excels in handling complex elements such as LaTeX equations, images, signatures, and tables, making it a valuable tool for various industries including academia, legal, healthcare, and corporate sectors.

Saved by tldr-importer · Last saved October 29, 2025 · 2 min read

ocr ✓ + markdown document-processing ✓ machine-learning ✓ + automation