Show HN: We've open-sourced our LLM attention visualization library

xcodevn · 2024-06-10T04:33:21 1717994001

On a related note: recently, I released a visualization of all MLP neurons inside the llama3 8B model. Here is an example "derivative" neuron which is triggered when talking about the derivative concept.

https://neuralblog.github.io/llama3-neurons/neuron_viewer.ht...

skulk · 2024-06-10T06:05:31 1717999531

This is insanely fun to just flip through. I found a "sex" neuron. https://neuralblog.github.io/llama3-neurons/neuron_viewer.ht...

vpj · 2024-06-10T05:27:34 1717997254

Pretty cool. The tokens are highlighted based on the activation?

xcodevn · 2024-06-10T05:42:26 1717998146

Yes, you're correct. The tokens are highlighted based on the neuron activation value, which is scaled to a range of 0 to 10.

SushiHippie · 2024-06-09T17:04:12 1717952652

This seems to be what Anthropic and OpenAI did in their research

Golden Gate Claude - https://news.ycombinator.com/item?id=40459543 - (60 comments, 16 days ago)

Extracting Concepts from GPT-4 - https://news.ycombinator.com/item?id=40599749 (144 comments, 2 days ago)

lakshith-403 · 2024-06-09T18:45:59 1717958759

Interesting. I think OpenAI here uses sparse autoencoders to map out sparse activation patterns in networks. Comparing them to how a real person reasons about a situations.

Inspectus, on the other hand is a general tool to visualize how transformer models pay attention to different parts of the data they process.

dimatura · 2024-06-10T01:08:34 1717981714

That OpenAI work is more elaborate. It trains an additional network in such a way that it encodes what GPT is doing in terms of activations, but in a more interpretable way (hopefully). Here, as far as I can tell, it's visualizing the activation of the attention layers directly.

ravjo · 2024-06-09T15:50:21 1717948221

Sounds great. Non-engineer, but curious. Is there a walkthrough blog post or video that can help someone appreciate/understand this easily?

swifthesitation · 2024-06-09T18:35:57 1717958157

Attention in transformers, visually explained | Chapter 6, Deep Learning - 3Blue1Brown: https://www.youtube.com/watch?v=eMlx5fFNoYc&t=

lakshith-403 · 2024-06-09T18:46:19 1717958779

Thank you

blackbear_ · 2024-06-10T10:22:38 1718014958

Loosely related, but also a great read: https://distill.pub/2020/circuits/zoom-in/

benf76 · 2024-06-09T15:50:06 1717948206

This looks cool but can you explain how to make it useful?

lakshith-403 · 2024-06-09T18:21:10 1717957270

I'm not a primary user. Just cleaned up the existing codebase to make it open source. But you could use this to visualise attentions and debug the model.

For an example if you're working on a Q&A model, you can check which tokens in the prompt contributed to the output. It's possible to detect issues like output not paying attention to any important part of the prompt.

3abiton · 2024-06-10T09:59:44 1718013584

The issue with the ambiguity of usage plague lots of OSS projects. Guides/Tutorials will always help drive usage much more, just look at the usage of GPT-3 vs ChatGPT (which is GPT-3.5 with WebUI slapped on top of it).

JackYoustra · 2024-06-09T23:23:44 1717975424

Hey! This is pretty neat, it reminds me of the graphs made by transformer_lens. Cool to see all of these visualization libraries popping up!