1 link tagged with all of: multimodal + glm-4.5v + visual-reasoning
Click any tag below to further narrow down your results
Links
The article discusses the launch of GLM-4.6V and GLM-4.5V, two advanced vision-language models. GLM-4.6V features a 128K context and supports multimodal inputs, while GLM-4.5V excels in visual reasoning across various benchmarks. Both models offer distinct capabilities for image and video analysis.