Look into the training process - https://www.youtube.com/watch?v=zjkBMFhNj_g The... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		sk11001 on April 2, 2024 \| parent \| context \| favorite \| on: Ask HN: Why has ChatGPT a bias towards correctness... Look into the training process - https://www.youtube.com/watch?v=zjkBMFhNj_g The goal of the initial pretraining phase is to make it good at predicting the next word. The rest of the training process is aimed at making it (1) helpful and (2) as correct as possible. I think some people oversimplify things by calling LLMs "next token predictors" and they leave out the tuning towards helpfulness and correctness.

ofrzeta on April 2, 2024 [–]

That's a great talk, thanks. It was enlightening to learn about basic models and the fine-tuning for an assistant personality. It's also interesting that neither the function of the basic LLM nor its interaction with the assistant model are fully understood, according to Karpathy.

Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact