Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Some[1] think that things are trending in the opposite direction: away from clever manipulations and hard coded domain knowledge, and towards large scale general models.

[1]: http://www.incompleteideas.net/IncIdeas/BitterLesson.html




This made me think of thr differences FPGAs and microprocessors - with "more laters" being equivalent to "more gates"


Yeah, I was surprised to see the architecture diagram is so complex. It's been a while since I saw a design that wasn't just "stack more transformer layers".




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: