/u/docybo

The gap between decision and exécution

/u/docybo June 11, 2026 June 11, 2026

I’ve been thinking about a support automation story I read recently. A team replaced a simple rules engine with an LLM classifier. The model was around 92% accurate. Sounds good. Until you realize that at 100 tickets a day, that’s roughly 8 mistakes ev…

artificial

We added cryptographic approval to our AI agent… and it was still unsafe

/u/docybo April 18, 2026 April 18, 2026

We’ve been working on adding “authorization” to an AI agent system. At first, it felt solved: – every action gets evaluated – we get a signed ALLOW / DENY – we verify the signature before execution Looks solid, right? It wasn’t. We hit a few problems a…

artificial

Built a demo where an agent can provision 2 GPUs, then gets hard-blocked on the 3rd call

/u/docybo April 8, 2026 April 8, 2026

Policy: – budget = 1000 – each `provision_gpu(a100)` call = 500 Result: – call 1 -> ALLOW – call 2 -> ALLOW – call 3 -> DENY (`BUDGET_EXCEEDED`) Key point: the 3rd tool call is denied before execution. The tool never runs. Also emits: – …

artificial

This OpenClaw paper shows why agent safety is an execution problem, not just a model problem

/u/docybo April 7, 2026 April 7, 2026

Paper: https://arxiv.org/abs/2604.04759 This OpenClaw paper is one of the clearest signals so far that agent risk is architectural, not just model quality. A few results stood out: – poisoning Capability / Identity / Knowledge pushes attack success fro…

artificial

LLM agents can trigger real actions now. But what actually stops them from executing?

/u/docybo April 1, 2026 April 1, 2026

We ran into a simple but important issue while building agents with tool calling: the model can propose actions but nothing actually enforces whether those actions should execute. That works fine… until the agent controls real side effects: APIs infra…

artificial

What actually prevents execution in agent systems?

/u/docybo March 29, 2026 March 29, 2026

Ran into this building an agent that could trigger API calls. We had validation, tool constraints, retries… everything looked “safe”. Still ended up executing the same action twice due to stale state + retry. Nothing actually prevented execution. It on…

artificial

Where should the execution boundary actually live in Agent systems?

/u/docybo March 21, 2026 March 21, 2026

following up on a discussion from earlier a pattern that keeps showing up in real systems: most control happens after execution – retries – state checks – monitoring – idempotency patches but the actual decision to execute is often implicit if the agen…

artificial

AI agents can trigger real-world actions. Why don’t we have execution authorization yet?

/u/docybo March 18, 2026 March 18, 2026

While experimenting with autonomous agents recently, I keep running into a pattern that feels oddly familiar from distributed systems history. A lot of current discussion around agent reliability focuses on: better prompting model alignment sandboxed …

artificial

Building AI agents taught me that most safety problems happen at the execution layer, not the prompt layer. So I built an authorization boundary

/u/docybo March 17, 2026 March 17, 2026

Something I kept running into while experimenting with autonomous agents is that most AI safety discussions focus on the wrong layer. A lot of the conversation today revolves around: • prompt alignment • jailbreaks • output filtering • sandboxing Those…

artificial

We’re building a deterministic authorization layer for AI agents before they touch tools, APIs, or money

/u/docybo March 16, 2026 March 16, 2026

Most discussions about AI agents focus on planning, memory, or tool use. But many failures actually happen one step later: when the agent executes real actions. Typical problems we've seen: runaway API usage repeated side effects from retries recur…

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this: