Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I’m trying different algorithms to optimize DRAM access patterns to make parallel processing by a GPU of nodes in a graph faster.

My biggest takeaway so far is that you first need to check whether or not your dataset fits into the L2 cache…



Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: