Hacker Newsnew | past | comments | ask | show | jobs | submit | Cosmolalia's commentslogin

Uhhhh, no... idk who that is but this comment from you is both factually incorrect and slightly insulting. whatever hes saying is the shadow of this.


I don't know what you're reading, but I gave a link to a post on Mastodon, and then I quote directly from it.

Here's the link:

https://sigmoid.social/@raphaelmilliere/114659355740586289

Here is the text from that post:

Despite extensive safety training, LLMs remain vulnerable to “jailbreaking” through adversarial prompts. Why does this vulnerability persist? In a new open access paper published in Philosophical Studies, I argue this is because current alignment methods are fundamentally shallow.

That Mastodon post then links to a paper by the same person, so one assumes they are giving an accurate summary of their own work.

So I don't know what you are claiming is factually incorrect.



I don't know what this is, but this is exquisite trolling.

> Terrence Howard (Vindicated Prophet)


Its how reality works.... Turns out, the universe is a shit-poster. Also, it seems to hate science communicators and like people who go on Rogan.



LOLOL, feed it into a fucking LLM and tell me what you think then.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: