been building AI agents for a while and noticing a pattern: the LLM reasoning part works. the part that breaks is everything around accounts, logins, and verification.
agent gets to "sign up for this service" and then:
- email verification loop breaks
- OTP times out while the agent is mid-step
- captcha or bot detection fires
- session expires between steps
the model figured out what to do. the infrastructure around it didn't cooperate.
curious if this matches what others are building. where do your agents actually fail in production? is it the reasoning, or is it the plumbing?
[link] [comments]