- Still on ada-002, planning on migrating later this week actually - Current pla...

ftkftk · on Feb 21, 2024

One strategy is to store the model version used for each vector, and then using the appropriate model to embed your queries. Of course that means that for a certain time frame you may need to blend two result sets, so that may not be ideal for your use case. You could also choose to just re-embed the last 90 days or so, or in your case on a project by project basis. Of course all of this raises your search complexity because you need to track your model versions.

From a vector store side we have worked with Weaviate, Qdrant, Pinecone, Milvus, and Elasticsearch. All of them had their pros and cons, and none of them were as stable as we liked once you really went to scale. Cloud deployments were rather pricey as well at that volume. We ended up with a mix of Qdrant Weaviate and Elastic for different workloads.