Hacker News new | past | comments | ask | show | jobs | submit login

Regarding training ResNet50, even though img/sec is less than the 3090, could a 64gb m1 max accommodate larger image sizes than the 24gb 3090?



Probably they would be close - M1 still needs to use memory for the OS and other stuff, while 3090 can use fp16/mixed precision, which in many cases almost doubles effective memory. Also if we talk about training, then a more mature CUDA implementation of things like batch normalization and optimizers can also result in lower memory usage compared to a likely less mature TF Metal support.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: