artificial
artificial

I ran a logging layer on my agent for 72 hours. 37% of tool calls had parameter mismatches — and none raised an error.

I've been running an AI agent that makes tool calls to various APIs, and I added a logging layer to capture exactly what was being sent vs. what the tools expected. Over 84 tool calls in 72 hours, 31 of them (37%) had parameter mismatches — and not…

What Generative AI Reveals About the State of Software?

I’ve spent more than two years building an agentic AI platform, working daily with GPT, Claude, and lately Gemini LLM models in real-world production code. They’re powerful; but if you watch closely, you’ll see something unsettling. They don’t just wri…

AI still helpful?

I am doing my PhD research and used AI over the last 2Y but really thinking to stop. Most of the tools are becoming useless exact Anthropic Claude Opus that cost a fortune. If AI helped me before , it’s now , not only not helping at all but misleading …

Common GPT 5.5 pricing misconception.

Many people have pointed out that ChatGPT 5.5 appears to be twice as expensive as 5.4 based on API pricing, which makes it look pricier than Opus 4.7. But the comparison is not that simple. GPT 5.5 is significantly more token-efficient in practic…

I finally cracked the code on a headquarters-style app using Claude code in terminals for my companies. Unlimited agents all running through terminals on my Claude subscription without needing to use api calls. Should I make a waitlist for this? Thoughts from any real devs?

The Canvas of the HQ Finally had enough and decided to make an interface that will let me use Claude code in as many terminals as I want. The canvas has a node for every project and folder you have, and the center "HQ Node" opens to my comman…

Update on the offscreen lives system — shipped some of what this thread helped me figure out

Posted a few days ago about how I built offscreen events for AI companions. The discussion here was genuinely useful — wanted to close the loop. A few things I shipped based on what came up: The surfaced_at flag suggestion from u/ultrathink-art was the…

I gave an AI a CT Scan While It Listened to an Emotional Conversation [R]

I created an [Activation Lab](https://github.com/cstefanache/llmct) tool that can be seen as an MRI machine for AI. It captures snapshots of every single layer inside a language model while it processes a conversation. It allows you to fully understand…

Can Claude’s “Skills” (custom SKILL.md instruction files) be exported and used in ChatGPT?

Hey everyone, I’ve been using Claude.ai with a custom skill setup inside a Project. Basically I have a folder of Markdown files (SKILL.md files) that act as persistent instructions for Claude. Each skill has a name, a description, a trigger condition a…