(I guess DarkShikari's comment is nested too deeply for me to reply directly.) I... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

_ihaque on April 24, 2013 | parent | context | favorite | on: Introduction to AVX2 optimizations in x264

(I guess DarkShikari's comment is nested too deeply for me to reply directly.)

In my (admittedly limited) experience [1], the compiler has actually done pretty decently at optimizing register allocation in intrinsic-heavy loops. I wrote out the assembly loop in [2] with manual allocation into all 16 XMMs and then noticed the compiler managed to optimize 1 of them out.

[1] https://github.com/simtk/IRMSD

[2] https://github.com/SimTk/IRMSD/blob/master/python/IRMSD/theo...

jamesaguilar on April 24, 2013 [–]

You can always reply directly by clicking on the link above a person's comment. Thanks for the interesting discussion on this one guys.

_ihaque on April 24, 2013 | [–]

Thanks for the pointer! That's what I get for lurking for too long.

Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact