1 link tagged with all of: safety + model-updates + evaluation
Click any tag below to further narrow down your results
Links
OpenAI reflects on the oversight of sycophantic behavior in its model updates, particularly with GPT-4o. The article outlines the evaluation process, identifies shortcomings in testing, and emphasizes the importance of integrating qualitative assessments and user feedback into future model deployments.