Hacker News new | past | comments | ask | show | jobs | submit login

> On existing hardware, the gains in compute and memory efficiency are significant, without performance degradation (as tested by the authors).

Did they actually show absence of performance degradation?

I think it's conspicuous that Table 1 and Table 2 in the paper, which show perplexity and accuracy results respectively, are only for small model sizes, whereas Figure 2, Figure 3 (latency, memory, energy consumption) and Table 3 (throughput) all show larger model sizes. So it seems like they had every opportunity to show the perplexity/accuracy comparisons at the larger model sizes, but did not include them.




Others have already made the same point in this thread. See my response here: https://news.ycombinator.com/item?id=39539508




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: