1 link tagged with all of: multimodal + language-model + image-processing
Click any tag below to further narrow down your results
Links
The GLM-4.6V series introduces two open-source multimodal models, designed for both high-performance cloud use and local deployment. It features a 128k token context window and native tool calling, enabling seamless integration of visual and textual inputs for tasks like content creation and web search.