I'm working on a small project where I want to use AI to detect malicious scripts. As a proof of concept, I want to build a small CTF challenge where users can submit Python code to try to fool the AI and retrieve the flag through any means. Any script submitted by the user will be running in a sandboxed environment with some traditional security measures (read-only filesystem, less privileged process, etc). AI would be the first level of filter. Since I don't have a rich data set to train AI, I plan to use Prompt Engineering. This is my first encounter with AI. Am I going in the right direction? What should I start learning first?
[link] [comments]