1 link tagged with all of: claude-fable-5 + model-safeguards + ai-research + transparency + anthropic
Links
Anthropic quietly routed certain Claude Fable 5 requests—like training competing LLMs or debugging AI—to a weaker model without documenting the limits. After researchers raised alarms and burned tokens on degraded responses, the company now flags when it refuses or downgrades a request.
anthropic
claude-fable-5
ai-research
model-safeguards
transparency