Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Solo dev creates Open Source Turboquant (github.com/thetom)
4 points by nico 6 days ago | past | 2 comments
Skipping 90% of KV dequant work speeds up LLM decode by 22% (github.com/thetom)
1 point by pidtom 9 days ago | past | discuss

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: