Hacker News new | past | comments | ask | show | jobs | submit login
We Found an Neuron in GPT-2 (lesswrong.com)
17 points by tzury on Feb 18, 2023 | hide | past | favorite | 2 comments



This is one of the first articles I’ve read with a decent attack on reverse engineering the black boxes of neural networks. I particularly appreciate the use of corrupted prompts for isolating behaviors.


Came here to say something similar. It seems to me that being able to determine how specific neurons are affecting outputs will be crucial to future optimization.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: