2 links tagged with all of: reinforcement-learning + context-management
Click any tag below to further narrow down your results
Links
This article discusses the Recursive Language Model (RLM), which allows language models to manage their own context more effectively. By using Python scripts and sub-LLMs, the RLM prevents context rot and optimizes performance for long-horizon tasks. The authors present their experimental setup and findings on the RLM's capabilities.
+ recursive-language-model
context-management ✓
reinforcement-learning ✓
+ long-horizon-tasks
+ tool-use
This article discusses advancements made by Deepseek in reducing attention complexity and improving reinforcement learning training. Key points include their unique approach to context management and task/environment creation, as well as their critique of the open-source LLM landscape.