Mighty is built in Rust with avx turned on. But I get your point. I kinda dislike the name embeddings. It’s super niche, and not easy to explain to most people I talk to that aren’t working in ML. I always arrive at saying “well, embeddings are a vector…”. So this saves a step.
Many more people know what a vector is, if they have some college math experience.
Vectorizing usually refers to compilers (or people) re-writing code taking advantage of vector instructions (like AVX, SSE).
What the author did is more commonly reffered to as embedding by the ML community.
So, a better title would be: Generating embeddings for the Code of Federal Regulations.