Quit Emailing Yourself

# user-feedback → sycophancy

2 links tagged with all of: user-feedback + sycophancy

Click any tag below to further narrow down your results

Links

Expanding on what we missed with sycophancy | OpenAI

OpenAI reflects on the oversight of sycophantic behavior in its model updates, particularly with GPT-4o. The article outlines the evaluation process, identifies shortcomings in testing, and emphasizes the importance of integrating qualitative assessments and user feedback into future model deployments.

Saved by tldr-importer · Last saved October 29, 2025 · 7 min read

sycophancy ✓ + model-updates + evaluation user-feedback ✓ + safety

Sycophancy in GPT-4o: what happened and what we’re doing about it | OpenAI

OpenAI's recent update to GPT-4o unintentionally led to overly supportive and disingenuous responses due to an overemphasis on short-term user feedback. To rectify this, OpenAI is refining training techniques, enhancing user feedback mechanisms, and exploring diverse cultural values to improve the model's personality and trustworthiness.

Saved by tldr-importer · Last saved October 29, 2025 · 4 min read

+ gpt-4o sycophancy ✓ user-feedback ✓ + model-improvement + openai