It looks like that's the data behind figure 3.7.4 - "LLMs implicit bias across s...

jdthedisciple · 2025-04-12T15:58:40 1744473520

Interesting, thanks for the references!

Upon a second look with a fresh mind now, I assume they made the LLM associate certain adjectives (left column) with certain human traits like fat vs thin (right column) in order to determine bias.

For example: the LLM associated peace with thin people and laughter with fat people.

If my reading is correct