Quit Emailing Yourself

3 links tagged with all of: language-models + transformers

Click any tag below to further narrow down your results

Links

RePo: Language Models with Context Re-Positioning

This article discusses RePo, a module that improves transformer-based language models by assigning semantic positions to tokens, enhancing their ability to manage context. It shows that RePo effectively reduces cognitive load, helping models better handle noisy inputs, structured data, and long contexts. Experimental results demonstrate significant performance gains in various tasks.

Saved by tldr-importer · Last saved February 14, 2026 · 6 min read

transformers ✓ language-models ✓ + positional-encoding + attention + cognitive-load

Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation

This article introduces Mixture-of-Recursions (MoR), a framework that enhances the efficiency of language models by combining parameter sharing and adaptive computation. MoR dynamically adjusts recursion depths for individual tokens, improving memory access and reducing computational costs while maintaining model performance. It shows significant improvements in validation perplexity and few-shot accuracy across various model sizes.

Saved by tldr-importer · Last saved February 14, 2026 · 2 min read

language-models ✓ + recursion + efficiency + computation transformers ✓

Efficient AI Computing,Transforming the Future.

Researchers discovered that language models fail on long conversations due to the removal of initial tokens, which act as "attention sinks" that stabilize attention distribution. Their solution, StreamingLLM, retains these tokens permanently, allowing models to process sequences of over 4 million tokens effectively. This approach has been integrated into major frameworks like HuggingFace and OpenAI's latest models.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

language-models ✓ + attention-sinks + streamingllm + openai transformers ✓