1 link tagged with all of: software-engineering + cybersecurity + model-evaluation + codex-max + gpt-5.1
Links
OpenAI has launched GPT-5.1-Codex-Max, an upgraded coding model with improved performance metrics over its predecessor. It excels in various software engineering tasks but still faces challenges in cybersecurity capabilities. The article critiques the model's evaluations and compares it to previous versions, raising questions about its real-world usefulness.
gpt-5.1 ✓
codex-max ✓
software-engineering ✓
cybersecurity ✓
model-evaluation ✓