3 links tagged with all of: machine-learning + autoregressive
Click any tag below to further narrow down your results
Links
DFlash introduces a lightweight block diffusion model that enhances speculative decoding by enabling faster and more accurate parallel drafting. It combines the speed of diffusion models with the verification strength of autoregressive models, achieving significant performance improvements over existing methods like EAGLE-3. The approach demonstrates how to leverage the benefits of both model types without sacrificing quality.
MAGI-1 is an autoregressive video generation model that creates videos by predicting sequences of fixed-length video chunks, achieving high temporal consistency and scalability. It incorporates innovations such as a transformer-based variational autoencoder and a unique denoising algorithm, enabling efficient and controllable video generation from text or images. The model has shown state-of-the-art performance in both instruction following and physical behavior prediction compared to existing models.
The article presents a framework for continuous visual autoregressive generation via score maximization, which is theoretically grounded in strictly proper scoring rules. It highlights the use of likelihood-free learning with an energy Transformer, showcasing competitive performance in generation quality and inference efficiency while addressing limitations of existing methods. The repository includes instructions for setting up the environment, training models, and evaluating performance using provided scripts.