Max Zeff Profile picture
Senior Writer covering AI @WIRED, author of the Model Behavior newsletter | Formerly @TechCrunch, @Gizmodo, @markets | DM me off the record on Signal @ mzeff.88
Jun 11 4 tweets 2 min read
NEW: Anthropic is walking back Claude Fable 5's policy to covertly degrade performance for competing AI researchers, after facing fierce backlash.

“We’re changing Fable 5’s safeguards for frontier LLM development to make them visible,” Anthropic tells WIRED. “We made the wrong tradeoff and we apologize for not getting the balance right.”Image Here's the new policy:

"Starting this week, flagged requests will visibly fall back to Opus 4.8. On the API, any flagged requests will return a reason for their refusal. You will see this every time it happens."