Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

On the surface this is an interesting concept...

The paper however, meh...

No mention of MoE. One would think this is a logical evolution of that but not a mention (that I saw). Its own rubric for the task, Towers of Hanoi, was admittedly weak.

LLM papers are starting to look like the last decade of JS frameworks and Tools. Only with less code and more academics, and thats disappointing, because I think a lack of pragmatism and grounding is now holding the field back...





Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: