multilingual

2 links tagged with multilingual

Click any tag below to further narrow down your results

Links

Fluid, natural voice translation with Gemini 3.5 Live Translate

Google’s new Gemini 3.5 Live Translate model converts speech to speech in real time across more than 70 languages, preserving speakers’ intonation, pacing and pitch. It streams audio continuously with minimal delay and is available via the Gemini Live API, Google Meet preview, and Google Translate apps. TAGS: live-translation, speech-translation, multilingual, real-time-ai, gemini-models

Last saved Jun 18, 2026 · 3 min read

+ live-translation + speech-translation multilingual + real-time-ai + gemini-models + tldr-a-byte-sized-daily-tech-newsletter

Freedium

Chandra OCR 2, a 4 billion-parameter model from Datalab, outperforms GPT-4o and Gemini on AllenAI’s olmOCR benchmark and a 90-language test while halving the model size. It preserves layout, reads complex tables and math notation, converts diagrams to Mermaid, and runs at two pages per second on an NVIDIA H100. The code is Apache 2.0 but the model weights use an OpenRAIL-M license with commercial restrictions.

Last saved Apr 22, 2026 · 5 min read

+ ocr + open-source + benchmarks + document-processing multilingual