| Fable 5 just got redeployed today (July 1) after a wild few weeks. Quick recap for those who missed it: Anthropic released Fable 5 on June 9, the US government slapped export controls on it June 12 because Amazon researchers found a jailbreak that let it identify software vulnerabilities and produce exploit code, and now the controls have been lifted. But here’s the part nobody is really talking about: Fable 5 is not a drop-in upgrade for coding workflows. Anthropic explicitly says in their blog post that the new improved safety classifier they trained to address the jailbreak “comes at the cost of flagging benign requests more often during routine coding and debugging tasks.” This is by design, not a bug. They intentionally built Fable 5 with a much larger “safety margin” than any previous model. The classifier is tuned to treat ambiguous cybersecurity-adjacent requests as potentially harmful and a lot of normal coding falls into that gray zone. Think: asking about memory vulnerabilities, debugging low-level code, security tooling, anything that could look like recon for an exploit. So practically speaking: -Fable 5 is likely incredible for writing, reasoning, and research -For security engineers or systems programmers, expect frustrating false positives -Anthropic admits this and says they’ll keep refining it, but no timeline The honest framing here is that Fable 5’s power comes paired with restrictions that make it less useful for a specific class of developers – the exact ones who would benefit most from a more capable model. Curious if others hit these blocks in early access. What kinds of prompts triggered it? [link] [comments] |