| 1. | | Cord: Coordinating Trees of AI Agents (june.kim) |
| 154 points by gfortaine 14 days ago | 81 comments |
|
| 2. | | Model Flop Utilization Beyond 6ND (jott.live) |
| 17 points by brrrrrm 5 months ago |
|
| 3. | | What every programmer should know about how CPUs work [video] (youtube.com) |
| 227 points by bschne 8 months ago | 14 comments |
|
| 4. | | Dummy's Guide to Modern LLM Sampling (rentry.co) |
| 228 points by nkko 10 months ago | 37 comments |
|
| 5. | | FFmpeg School of Assembly Language (github.com/ffmpeg) |
| 869 points by davikr on Feb 22, 2025 | 220 comments |
|
| 6. | | Mini-R1: Reproduce DeepSeek R1 "Aha Moment" (philschmid.de) |
| 191 points by jonbaer on Jan 31, 2025 | 15 comments |
|
| 7. | | DeepSeek's multi-head latent attention and other KV cache tricks (pyspur.dev) |
| 292 points by t55 on Jan 28, 2025 | 72 comments |
|
| 8. | | Emerging reasoning with reinforcement learning (hkust-nlp.notion.site) |
| 248 points by pella on Jan 26, 2025 | 211 comments |
|
| 9. | | Show HN: Free e-book about WebGPU Programming (shi-yan.github.io) |
| 471 points by billconan on Aug 4, 2024 | 73 comments |
|
| 10. | | Optimizing a bignum library for fun (austinhenley.com) |
| 145 points by azhenley on July 16, 2024 | 63 comments |
|
| 11. | | GPU compute in the browser at the speed of native: WebGPU marching cubes (willusher.io) |
| 168 points by Twinklebear on April 23, 2024 | 53 comments |
|
| 12. | | Show HN: Exploring Indra's Pearls with WebGPU (medium.com/philogb) |
| 108 points by philogb on April 20, 2024 | 6 comments |
|
| 13. | | The One Billion Row Challenge in CUDA (tspeterkim.github.io) |
| 241 points by tspeterkim on April 12, 2024 | 74 comments |
|
| 14. | | An Introduction to Flow Matching (cam.ac.uk) |
| 72 points by sebg on April 12, 2024 | 10 comments |
|
| 15. | | Llm.c – LLM training in simple, pure C/CUDA (github.com/karpathy) |
| 1050 points by tosh on April 8, 2024 | 168 comments |
|
| 16. | | Best engineering interview question I've gotten (quuxplusone.github.io) |
| 186 points by xelxebar on March 25, 2024 | 180 comments |
|
| 17. | | Show HN: Flash Attention in ~100 lines of CUDA (github.com/tspeterkim) |
| 230 points by tspeterkim on March 16, 2024 | 39 comments |
|
| 18. | | Compressing chess moves for fun and profit (mbuffett.com) |
| 179 points by thunderbong on March 15, 2024 | 135 comments |
|
| 19. | | Diffusion models from scratch, from a new theoretical perspective (chenyang.co) |
| 379 points by jxmorris12 on March 11, 2024 | 40 comments |
|
| 20. | | How video games use lookup tables (frost.kiwi) |
| 673 points by todsacerdoti on Feb 28, 2024 | 106 comments |
|
| 21. | | GGUF, the Long Way Around (vickiboykis.com) |
| 249 points by Tomte on Feb 29, 2024 | 30 comments |
|
| 22. | | Building a deep learning rig (samsja.github.io) |
| 164 points by dvcoolarun on Feb 23, 2024 | 106 comments |
|
| 23. | | Boring Python: dependency management (2022) (b-list.org) |
| 114 points by bruh2 on Jan 30, 2024 | 82 comments |
|
| 24. | | A tiny hand crafted CPU emulator, C compiler, and Operating System (github.com/rswier) |
| 186 points by seansh on Jan 6, 2024 | 10 comments |
|
| 25. | | Fastest autograd in the West (arogozhnikov.github.io) |
| 90 points by tplrbv on Jan 3, 2024 | 46 comments |
|
| 26. | | High Performance Voxel Engine (2021) (nickmcd.me) |
| 66 points by bibanez on Dec 30, 2023 | 20 comments |
|
| 27. | | Game Boy / Color Architecture (copetti.org) |
| 280 points by ronama on Dec 26, 2023 | 40 comments |
|
| 28. | | Understanding every byte in a WASM module (danielmangum.com) |
| 206 points by hasheddan on Dec 23, 2023 | 37 comments |
|
| 29. | | Simulating fluids, fire, and smoke in real-time (andrewkchan.dev) |
| 784 points by ibobev on Dec 19, 2023 | 169 comments |
|
| 30. | | SMERF: Streamable Memory Efficient Radiance Fields (smerf-3d.github.io) |
| 630 points by duckworthd on Dec 13, 2023 | 143 comments |
|
|
| More |