> It would be a bad architecture choice (almost certainly...) Naively, it would ... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

danenania 4 months ago | parent | context | favorite | on: DeepSeek: Advancing theorem proving in LLMs throug...

> It would be a bad architecture choice (almost certainly...)

Naively, it would seem like transformers could line up nicely with turn-based games. Instead of mapping tokens to language as in an LLM, they could map to valid moves given the current game state. And then instead of optimizing the next token for linguistic coherence as LLMs do, you optimize for winning the game.

danielmarkbruce 4 months ago [–]

A lot of the games usually used are markov. All the state is right there in front of you, doesn't matter how you got there. As an example - chess - it matters not how you got to state X (... someone will get the edge cases..).

Join us for AI Startup School this June 16-17 in San Francisco!
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact