Quit Emailing Yourself

# multimodal → api

3 links tagged with all of: multimodal + api

Click any tag below to further narrow down your results

Links

Gemini 3 Developer Guide | Gemini API | Google AI for Developers

Gemini 3 is Google's latest AI model series focused on advanced reasoning and multimodal tasks. It includes different versions like Pro, Flash, and Pro Image, each tailored for specific needs. The article covers key features, API usage, pricing, and new parameters for controlling model behavior.

Saved by tldr-importer · Last saved February 14, 2026 · 6 min read

+ gemini-3 + ai-models + reasoning api ✓ multimodal ✓

Thread by @Zai_org on Thread Reader App

The article discusses the launch of GLM-4.6V and GLM-4.5V, two advanced vision-language models. GLM-4.6V features a 128K context and supports multimodal inputs, while GLM-4.5V excels in visual reasoning across various benchmarks. Both models offer distinct capabilities for image and video analysis.

Saved by tldr-importer · Last saved February 14, 2026 · 2 min read

+ glm-4.6v + glm-4.5v + visual-reasoning multimodal ✓ api ✓

Z.AI launches GLM-4.7, new SOTA open-source model for coding

Zhipu AI has released GLM-4.7, a new version of its General Language Model designed for advanced coding and multimodal tasks. It improves reasoning capabilities and supports both text and vision inputs, making it suitable for developers and enterprises. The model features enhanced APIs for real-time and batch processing, aligning with demands for more sophisticated AI applications.

Saved by tldr-importer · Last saved February 14, 2026 · 1 min read

+ zhipu-ai + glm-4-7 + coding multimodal ✓ api ✓