Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

If anybody was confused by the title:

Vectorizing usually refers to compilers (or people) re-writing code taking advantage of vector instructions (like AVX, SSE).

What the author did is more commonly reffered to as embedding by the ML community.

So, a better title would be: Generating embeddings for the Code of Federal Regulations.



Actually it does both :)

Mighty is built in Rust with avx turned on. But I get your point. I kinda dislike the name embeddings. It’s super niche, and not easy to explain to most people I talk to that aren’t working in ML. I always arrive at saying “well, embeddings are a vector…”. So this saves a step.

Many more people know what a vector is, if they have some college math experience.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: