3 links
tagged with audio
Click any tag below to further narrow down your results
Links
The article introduces SuperSonic, a web-based implementation of SuperCollider's audio synthesis engine, allowing users to run scsynth directly in their browser without installation. It provides instructions for integrating the SuperSonic module into web pages and accessing SuperCollider's OSC API for audio synthesis. Users can load synth definitions from Sonic Pi and send OSC commands to create and manipulate audio in real-time.
The article introduces Ovi, a video and audio generation model developed by Character AI, which can create synchronized content from text or text-image inputs. Ovi supports various resolutions and aspect ratios, offers a user-friendly experience with example prompts, and is designed for high-quality audio and video outputs. It also provides integration options and a roadmap for future improvements.
The article discusses the challenges and advancements in integrating neural audio codecs with language models (LLMs) to enhance speech understanding and generation. It highlights the limitations of current speech LLMs, which often rely on text transcriptions, and proposes using audio encoders and decoders to improve audio continuity and comprehension. The author explains how neural audio codecs can help streamline audio data processing for better predictive capabilities in speech models.