2 links tagged with all of: reinforcement-learning + software-engineering + coding
Click any tag below to further narrow down your results
Links
The article discusses the release of SWE-1.5, a new coding agent that balances speed and performance through a unified system. It highlights the development process, including reinforcement learning and custom coding environments, which improve task execution and code quality. SWE-1.5 aims to surpass previous models in both speed and effectiveness.
Kimi-Dev-72B is an advanced open-source coding language model designed for software engineering tasks, achieving a state-of-the-art performance of 60.4% on the SWE-bench Verified benchmark. It leverages large-scale reinforcement learning to autonomously patch real repositories and ensures high-quality solutions by only rewarding successful test suite completions. Developers and researchers are encouraged to explore and contribute to its capabilities, available for download on Hugging Face and GitHub.