Click any tag below to further narrow down your results
Links
Resemble AI has launched DETECT-3B Omni, a deepfake detection model that analyzes audio, images, and video using a unified system. It boasts enhanced capabilities over its predecessor, DETECT-2B, including expanded training data, support for over 40 languages, and protections against modern threats like replay attacks. The model ranks highly on various benchmarks for its detection accuracy across multiple media types.
Google is testing a new “Lecture” format for its NotebookLM audio overviews, allowing for 30-minute AI-generated lectures in various languages. This feature aims to assist students and professionals in efficiently reviewing dense material. A British English voice is expected to be included by 2026.
Jony Ive and Sam Altman are reportedly developing an AI audio gadget called "Sweetpea," aimed at replacing AirPods. This device, resembling earpieces and designed to be worn behind the ear, may feature a voice assistant powered by ChatGPT, but details on its capabilities remain unclear.
Resemble AI has launched DETECT-3B Omni, a deepfake detection model that analyzes audio, images, and video through a single API. It improves upon its predecessor with expanded training data, increased language support, and enhanced protection against modern synthetic media threats. The model achieves top performance benchmarks across all modalities.
Google announced upgrades to its Gemini 2.5 text-to-speech models, focusing on expressivity, pacing, and multi-speaker capabilities. These changes improve control over tone and style, making it easier for developers to create realistic audio content. The updated models are available in Google AI Studio.
Apple has purchased the Israeli AI startup Q.ai for nearly $2 billion to enhance its audio technology, particularly in interpreting whispered speech and improving sound quality in noisy settings. This marks Apple's second-largest acquisition, following its purchase of Beats Electronics in 2014. The Q.ai team, including CEO Aviad Maizels, will join Apple as part of the deal.
Veo 3.1 enhances the Flow AI filmmaking tool by introducing advanced audio capabilities and improved editing features, providing users with greater artistic control over their videos. New functionalities include "Ingredients to Video," "Frames to Video," and "Extend," allowing for more seamless scene transitions and longer shots, while also enabling precise edits like inserting or removing elements in a scene. These updates aim to enrich video storytelling and creativity within Flow.
Meta has acquired Waveforms, an AI audio startup, to enhance its audio technology and offerings. This acquisition is expected to bolster Meta's capabilities in creating advanced audio experiences for its platforms.