Quit Emailing Yourself

2 links tagged with all of: deep-learning + autoregressive

Links

sand-ai/MAGI-1 · Hugging Face

MAGI-1 is an autoregressive video generation model that creates videos by predicting sequences of fixed-length video chunks, achieving high temporal consistency and scalability. It incorporates innovations such as a transformer-based variational autoencoder and a unique denoising algorithm, enabling efficient and controllable video generation from text or images. The model has shown state-of-the-art performance in both instruction following and physical behavior prediction compared to existing models.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

+ video-generation autoregressive ✓ + machine-learning deep-learning ✓ + model-release

Ming-UniVision: Joint Image Understanding and Generation via a Unified Continuous Tokenizer

MingTok introduces the first continuous unified tokenizer for vision, enabling seamless integration of image understanding and generation within a single framework. This innovation leads to 3.5x faster convergence by aligning semantic understanding and generative dynamics, allowing for efficient multi-turn interactions without the costly detours seen in previous models. Ming-UniVision, built on MingTok, effectively harmonizes these tasks, paving the way for more intuitive multimodal AI systems.

Saved by tldr-importer · Last saved October 29, 2025 · 4 min read

+ mingtok + vision + multimodal autoregressive ✓ deep-learning ✓