Right, but that ignores the key piece, which is whether such systems can infer t...

dr_kiszonka · 2023-10-20T01:27:50

> A is B implies B is A.

I get the overall idea, but this statement isn't always true, right?

xcv123 · 2023-10-20T02:45:20

Correct. Not in natural language. These are natural language systems, not logic systems.

"The sky is blue" does not imply "blue is the sky".

TeMPOraL · 2023-10-20T09:34:59

There's an asymmetry here, too. "The sky is ${color}", with no extra context, has one obvious answer for us living here, today. Whereas for "Blue is the color of ${thing}", with no extra answer, there's an insane amount of equally sensible substitutions for ${thing}. Without extra content, the model has no reason to privilege "sky" over any other equally valid answer.

xcv123 · 2023-10-20T00:31:26

> When a system sees that A is B, yet cannot infer that B is A

So the issue is that they cannot infer that general rule due to a fundamental limitation of the transformer LLM architecture, not just a training data issue? I skimmed the paper and it seems to be the case.