Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Show HN: Built a 9MB GPU kernel achieving 43M ops/SEC with deterministic replay
2 points by TacosInMyPocket 3 months ago | hide | past | favorite
I've developed a custom GPU kernel that handles 40+ million parallel agent operations per second while maintaining apparently deterministic results across runs - something typically considered impossible with GPU parallel processing.

Performance demo: https://youtu.be/Y3Jg8RCZ65c Determinism proof: https://youtu.be/fk7NMNGcfSY

The entire runtime is under 10MB. Open to discussing potential applications!

autoscriptlabs@gmail.com



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: