Quit Emailing Yourself

# image-generation → diffusion-transformers

2 links tagged with all of: image-generation + diffusion-transformers

Links

GitHub - helblazer811/ConceptAttention: ConceptAttention: A method for interpreting multi-modal diffusion transformers.

ConceptAttention is an interpretability method designed for multi-modal diffusion transformers, specifically implemented for the Flux DiT architecture using PyTorch. The article provides installation instructions and a code example for generating images and concept attention heatmaps. It also references the associated research paper for further details.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

+ interpretability diffusion-transformers ✓ + pytorch image-generation ✓ + concept-attention

Diffusion Transformers with Representation Autoencoders

Representation Autoencoders (RAEs) enhance diffusion transformers by leveraging pretrained encoders and lightweight decoders to achieve superior image generation results, outperforming traditional methods like SD-VAE. The study reveals that RAE's reconstruction quality is high, and for optimal performance, the model width must match or exceed the encoder's token dimension. Additionally, the proposed DiTDH model demonstrates significant efficiency and effectiveness, setting new state-of-the-art scores in image generation tasks.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

+ representation-autoencoders diffusion-transformers ✓ image-generation ✓ + neural-networks + machine-learning