More on the topic...
Generating detailed summary...
Failed to generate summary. Please try again.
Anthropic has launched Opus 4.5, its latest model, which shows significant improvements in coding tasks and user experience. A key feature is the enhanced conversation memory across all Claude models in consumer apps. Previously, conversations could abruptly end when they hit a hard limit of 200,000 tokens. Instead of trimming older messages, Claude would just stop. Now, it summarizes key points from earlier messages to maintain a coherent flow, which should improve usability for those engaged in longer chats.
In terms of performance, Opus 4.5 has achieved an accuracy score of 80.9 percent on the SWE-Bench Verified benchmark, surpassing OpenAI’s GPT-5.1-Codex-Max at 77.9 percent and Google's Gemini 3 Pro at 76.2 percent. It excels particularly in agentic coding and tool use but still trails behind GPT-5.1 in visual reasoning tasks. This marks a significant step for Anthropic in competing with leading models from OpenAI and Google, emphasizing both performance and user engagement in a rapidly evolving AI landscape.
Questions about this article
No questions yet.