They’re not assuming that the Q refers to the Transformer’s “query”, they’re tal...

		shwaj on Nov 24, 2023 \| parent \| context \| favorite \| on: About That OpenAI "Breakthrough" They’re not assuming that the Q refers to the Transformer’s “query”, they’re talking about the reward function in reinforcement learning, using chess as an example.