I don't think that's accurate, it generates novel outputs that were not observed...

thomastjeffery · 2023-11-23T02:12:01 1700705521

It doesn't generate new tokens.

Train an LLM on text that only uses lowercase, and it will never output an uppercase letter.

nvrmnd · 2023-11-23T06:04:13 1700719453

So the model is limited to using words and characters that already exist. I agree with you but I don't see why is a limitation worth pointing out.

dacryn · 2023-11-23T08:39:59 1700728799

you literally have to put in every number for it to do mathematics correctly...

its as stupid as that. some try to get around it by indeed only having the 10 different digits and glue them together, but its a hallucination that that works.

an important point in generalization is for example that you teach it something. This is literally important

'ycombinator is a website' is a prompt that is almost impossible of ycombinator is not in your training set

pixl97 · 2023-11-23T03:15:29 1700709329

But can it put two tokens together

10 01 = 1001?