After the Claude/Anthropic incident where AI was used in a large-scale cyberattack, we've been publishing weekly threat intelligence on what's actually targeting AI agents in production.
This week (74,636 interactions monitored)
- 37.8% contained attack attempts
- 74.8% of those were cybersecurity-related (malware gen, exploits)
The new threat nobody's talking about: Inter-Agent Attacks
As people deploy multi-agent systems, attackers are sending poisoned messages designed to propagate from one agent to another. We're seeing:
- Agent impersonation
- Goal hijacking
- Constraint removal
- Recursive attack propagation
This is 3.4% of threats now, detected at 97.7% confidence.
Top attack categories
- Data exfiltration (19.2%) - stealing system prompts and context
- Jailbreaks (12.3%)
- RAG poisoning (10.0%)
Prompt injection (8.8%)
The ClawdBot incident was the canary. If your AI can take actions, it's a target.
Full report: https://raxe.ai/threat-intelligence
Github: https://github.com/raxe-ai/raxe-ce is free for the community to use
[link] [comments]