Click any tag below to further narrow down your results
Links
GLM-OCR is a multimodal optical character recognition (OCR) model designed for complex document understanding. Built on the GLM-V architecture, it features a robust two-stage pipeline for layout analysis and recognition, achieving high accuracy in varied real-world scenarios. The model is open-sourced and comes with an easy-to-use SDK for integration.