1 link tagged with all of: open-source + document-processing + machine-learning + ocr + sdk
Links
GLM-OCR is a multimodal optical character recognition (OCR) model designed for complex document understanding. Built on the GLM-V architecture, it features a robust two-stage pipeline for layout analysis and recognition, achieving high accuracy in varied real-world scenarios. The model is open-sourced and comes with an easy-to-use SDK for integration.
ocr ✓
document-processing ✓
machine-learning ✓
sdk ✓
open-source ✓