Nice, thanks for the feedback. I have a prototype of also using the embeddings for categorizing the steps, with "tags/labels". Almost take it as a challenge to be able to reason better with a smaller modell than those >70B that you can not run on your own laptop.
I actually built something similar to this a couple days ago for finding duplicate bugs in our gh repo. Some differences:
* I used json to store the blobs in sqlite instead of converting it to byte form (I think they're roughly equivalent in the end?)
* For the distances calculations I use `numpy.linalg.norm(a-b)` to subtract the two vectors and then take the normal
* `ollama.embed()` and `ollama.generate()` will cut down on the requests code
[1] https://github.com/ollama/ollama-python