Hacker Newsnew | past | comments | ask | show | jobs | submit | tejasvaidhya's commentslogin

There’s more to it. https://x.com/NolanoOrg/status/1813969329308021167

I will be archiving the full report with more results soon.


I should note that our linear layers are not the same as Microsoft's, in fact, we think Microsoft made a mistake in the code they uploaded. When I have time later today, I'll link to where I think they made a mistake.

I've been following TriLLM. They've achieved great results, and I'm really impressed with the llama.cpp contributors already getting the models integrated.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: