Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
152334H's submissions
login
1.
Calculating the cost of a Google DeepMind paper
(
152334h.github.io
)
303 points
by
152334H
68 days ago
|
past
|
150 comments
2.
Knowing Enough About MoE to Explain Dropped Tokens in GPT-4
(
152334h.github.io
)
3 points
by
152334H
on Aug 8, 2023
|
past
|
1 comment
3.
Non-determinism in GPT-4 is caused by Sparse MoE
(
152334h.github.io
)
397 points
by
152334H
on Aug 4, 2023
|
past
|
181 comments
4.
LLaVA: Large Language and Vision Assistant
(
llava-vl.github.io
)
3 points
by
152334H
on April 18, 2023
|
past
5.
Why can't TorToiSe be fine-tuned?
(
152334h.github.io
)
1 point
by
152334H
on Feb 11, 2023
|
past
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: