Is this an accurate representation of the GPT driver loop? def generate(prompt: ...

NortySpock · on Feb 24, 2024

I think the "context window" that people refer to with large language models means there's a maximum number of tokens that are retained, with the oldest being discarded. The window is a sliding window.

canjobear · on Feb 24, 2024

Yes, that is the loop. All the magic is in the gpt2 function there.

creatonez · on Feb 24, 2024

This is a very small section of the algorithm. This is just how it collects the tokens it has generated into a sentence.