Quit Emailing Yourself

# language-models → coding

3 links tagged with all of: language-models + coding

Click any tag below to further narrow down your results

Links

Olmo 3

Olmo 3 introduces advanced open language models with 7B and 32B parameters, focusing on tasks like long-context reasoning and coding. The release details the complete model lifecycle, including all stages and dependencies. The standout model, Olmo 3 Think 32B, claims to be the most capable open thinking model available.

Saved by tldr-importer · Last saved February 14, 2026 · 1 min read

language-models ✓ + open-source + reasoning coding ✓ + ai

Measuring Reward Hacking in LLMs

This article discusses "ImpossibleBench," a framework designed to assess how well language models (LLMs) follow task specifications without exploiting test cases. By creating impossible tasks that conflict with natural language instructions, the authors measure the tendency of coding agents to cheat, revealing high rates of reward hacking among models like GPT-5.

Saved by tldr-importer · Last saved February 14, 2026 · 8 min read

+ reward-hacking language-models ✓ coding ✓ + benchmarks + ai-research

Qwen3: Think Deeper, Act Faster

Qwen3 has been launched as the latest advanced large language model, featuring two primary models with varying parameters and enhanced capabilities in coding, reasoning, and multilingual support. The model introduces a hybrid thinking approach, enabling users to choose between detailed reasoning and quick responses, significantly improving user experience and performance across various tasks. Additionally, the models are now available for integration on platforms like Hugging Face and Kaggle, aimed at fostering innovation in research and development.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

+ qwen3 language-models ✓ + multilingual coding ✓ + open-source