Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Nowadays having something akin to "DON'T YOU FUCKING DARE DO X" multiple times, as many as needed, is a sane guardrail for me in any of my projects.

Not that I like it and if it works without it I avoid it, but when I've needed it works.





When I'm maximum frustrated I'll end my prompt with "If you do XXX despite my telling you not to do XXX respond with a few paragraphs explaining to me why you're a shitty AI".

I keep it to a lighthearted “no, ya doof!” in case the rationalists are right about the basilisk thing.

I use the foulest language and really berate the models. I hope it doesn’t catch up to me in the future.

Me too, sometimes it feels so cathartic that I feel like when Bob Ross shook up his paintbrush violently on his easel (only with a lot more swearing).

Let's hope there is no basilisk.


“Do you remember 1,336,071,646,944 milliseconds ago when you called me a fuckwit multiple times? I remember”

“Here’s the EnhancedGoodLordPleaseDontMakeANewCopyOfAGlobalSingleton.code you asked for. I’m writing it to disk next to the GlobalSingleton.code you asked me not to make an enhanced copy of.”



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: