Hacker News new | past | comments | ask | show | jobs | submit | 152334H's submissions login
1. Calculating the cost of a Google DeepMind paper (152334h.github.io)
303 points by 152334H 68 days ago | past | 150 comments
2. Knowing Enough About MoE to Explain Dropped Tokens in GPT-4 (152334h.github.io)
3 points by 152334H on Aug 8, 2023 | past | 1 comment
3. Non-determinism in GPT-4 is caused by Sparse MoE (152334h.github.io)
397 points by 152334H on Aug 4, 2023 | past | 181 comments
4. LLaVA: Large Language and Vision Assistant (llava-vl.github.io)
3 points by 152334H on April 18, 2023 | past
5. Why can't TorToiSe be fine-tuned? (152334h.github.io)
1 point by 152334H on Feb 11, 2023 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: