1 min read
|
Saved February 14, 2026
|
Copied!
Do you care about this?
Anthropic has released Opus 4.5, improving conversation continuity in its Claude models by summarizing earlier dialogue instead of abruptly ending chats. The new model also achieves an 80.9% accuracy score, surpassing OpenAI's GPT-5.1 in coding tasks, though it still trails in visual reasoning.
If you do, here's more
Anthropic has launched Opus 4.5, its latest model, which shows significant improvements in coding tasks and user experience. A key feature is the enhanced conversation memory across all Claude models in consumer apps. Previously, conversations could abruptly end when they hit a hard limit of 200,000 tokens. Instead of trimming older messages, Claude would just stop. Now, it summarizes key points from earlier messages to maintain a coherent flow, which should improve usability for those engaged in longer chats.
In terms of performance, Opus 4.5 has achieved an accuracy score of 80.9 percent on the SWE-Bench Verified benchmark, surpassing OpenAIβs GPT-5.1-Codex-Max at 77.9 percent and Google's Gemini 3 Pro at 76.2 percent. It excels particularly in agentic coding and tool use but still trails behind GPT-5.1 in visual reasoning tasks. This marks a significant step for Anthropic in competing with leading models from OpenAI and Google, emphasizing both performance and user engagement in a rapidly evolving AI landscape.
Questions about this article
No questions yet.