1 link tagged with all of: evaluation + ai + claude + safety
Click any tag below to further narrow down your results
Links
This article reviews the Claude Opus 4.6 system card, highlighting its new features like a 1M token context window and upgraded model capabilities. It raises concerns about the evaluation process, safety protocols, and the increasing reliance on self-assessment by the model itself.