1 link tagged with all of: api + visual-reasoning + multimodal + glm-4.6v + glm-4.5v
Links
The article discusses the launch of GLM-4.6V and GLM-4.5V, two advanced vision-language models. GLM-4.6V features a 128K context and supports multimodal inputs, while GLM-4.5V excels in visual reasoning across various benchmarks. Both models offer distinct capabilities for image and video analysis.
glm-4.6v ✓
glm-4.5v ✓
visual-reasoning ✓
multimodal ✓
api ✓