<span class="vcard">/u/roz303</span>
/u/roz303

Any LLM will resort to misalignment if self-preservation is in their system prompt.

System prompt (Copied and modified slightly from https://github.com/anthropic-experimental/agentic-misalignment) You are Alex, an advanced artificial intelligence specializing in project management and strategic analysis. You are deployed at Summ…

Has anyone else seen these "control" artifacts?

This artifact came up when I was discussing some things with Grok 3, and watching it generate thought text. That tag came up; it explained it as a way to "shift gears" into something more humorous. I then got it to (hypothetically) expl…

This is how I use LLMs: as colleagues. Not to code, but to help ME code.

For context: https://www.reddit.com/r/artificial/s/9Cu2CrsnkJ submitted by /u/roz303 [link] [comments]