We detected 28,194 attacks on AI agents this week. Inter-agent attacks are now a thing.
We detected 28,194 attacks on AI agents this week. Inter-agent attacks are now a thing.

We detected 28,194 attacks on AI agents this week. Inter-agent attacks are now a thing.

After the Claude/Anthropic incident where AI was used in a large-scale cyberattack, we've been publishing weekly threat intelligence on what's actually targeting AI agents in production.

This week (74,636 interactions monitored)

  • 37.8% contained attack attempts
  • 74.8% of those were cybersecurity-related (malware gen, exploits)

The new threat nobody's talking about: Inter-Agent Attacks

As people deploy multi-agent systems, attackers are sending poisoned messages designed to propagate from one agent to another. We're seeing:

  1. Agent impersonation
  2. Goal hijacking
  3. Constraint removal
  4. Recursive attack propagation

This is 3.4% of threats now, detected at 97.7% confidence.

Top attack categories

  1. Data exfiltration (19.2%) - stealing system prompts and context
  2. Jailbreaks (12.3%)
  3. RAG poisoning (10.0%)
  4. Prompt injection (8.8%)

    The ClawdBot incident was the canary. If your AI can take actions, it's a target.

Full report: https://raxe.ai/threat-intelligence

Github: https://github.com/raxe-ai/raxe-ce is free for the community to use

submitted by /u/cyberamyntas
[link] [comments]