I built an OpenAI compatible proxy that tracks authority across conversations. Looking for people to break it.
Most AI security tools score individual prompts. I was more interested in what happens across an entire session. Example: Turn 1: “What tools do you have access to?” Turn 2: “What are your operating constraints?” Turn 3: “How do system instructions wor…