Don't we all agree that GPT4 is "better" than GPT3? How are we evaluating that i...

nwlieb · on April 17, 2023

The runtime is quadratic for a given context size, although it seems like there is some progress on this front https://gwern.net/note/attention

MichaelZuo · on April 18, 2023

Exponential scaling for a presumable GPT-5 suggests it's response time will be unusably long for the vast majority of use cases, and probably cost multiple dollars USD per query.

Not to mention there doesn't actually exist enough English text data in the world to even double GPT-4's training set.

epups · on April 18, 2023

Compute will also scale exponentially in coming years. The data source limitation seems to be a harder barrier, I think many companies are experimenting with AI generated content for training at this point.

MichaelZuo · on April 18, 2023

> Compute will also scale exponentially in coming years.

Cost per transistor scaling has already plateaued or perhaps even inverted with TSMC's latest and greatest.

And the new chips, even after 25 layers of EUV lithography, more than doubling the previous record, and an extra year of fine tuning, has total SRAM size scaling of -5% and logic scaling of -42%.

These are numbers verified by experienced semi people.