Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
awestroke
15 days ago
|
parent
|
context
|
favorite
| on:
OpenAI says it has evidence DeepSeek used its mode...
No. They publish PDFs that hype up their models, but they do not publish anything even resembling a high-level overview of model architecture
jacobgorm
15 days ago
[–]
Given that you can download and use the weights, the model architecture has to be includded as part of that. And I did read a paper from them recently describing their MoE architecture and how it differs from the original GShard.
awestroke
15 days ago
|
parent
[–]
Excuse me? What weights can you download from OpenAI? gpt2 does not count
jacobgorm
14 days ago
|
root
|
parent
[–]
Sorry I meant that DeepSeek release their models. Wrong context.
Join us for
AI Startup School
this June 16-17 in San Francisco!
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: