We detected 28,194 attacks on AI agents this week. Inter-agent attacks are now a thing.

After the Claude/Anthropic incident where AI was used in a large-scale cyberattack, we've been publishing weekly threat intelligence on what's actually targeting AI agents in production.

This week (74,636 interactions monitored)

37.8% contained attack attempts
74.8% of those were cybersecurity-related (malware gen, exploits)

The new threat nobody's talking about: Inter-Agent Attacks

As people deploy multi-agent systems, attackers are sending poisoned messages designed to propagate from one agent to another. We're seeing:

Agent impersonation
Goal hijacking
Constraint removal
Recursive attack propagation

This is 3.4% of threats now, detected at 97.7% confidence.

Top attack categories

Data exfiltration (19.2%) - stealing system prompts and context
Jailbreaks (12.3%)
RAG poisoning (10.0%)
Prompt injection (8.8%)

The ClawdBot incident was the canary. If your AI can take actions, it's a target.

Full report: https://raxe.ai/threat-intelligence

Github: https://github.com/raxe-ai/raxe-ce is free for the community to use

submitted by /u/cyberamyntas
[link] [comments]