Honestly, it doesn't matter for the end user if there are more tokens generated ...

Lalabadie · on Sept 12, 2024

It starts to matter if the compute time is 10-100 fold, as the provider needs to bill for it.

Of course, that's assuming it's not priced for market acquisition funded by a huge operational deficit, which is a rarely safe to conclude with AI right now.

skywhopper · on Sept 12, 2024

Given that their compute-time vs accuracy charts labeled the compute time axis as logarithmic would worry me greatly about this aspect.