Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
quadrature
28 days ago
|
parent
|
context
|
favorite
| on:
Microsoft CTO says he wants to swap most AMD and N...
I'm not very well versed, but i believe that training requires more memory to store intermediate computations so that you can calculate gradients for each layer.
Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: