I use PgVector myself but here's the advantages to a true vector db. - Vectors a...

whakim · 2024-01-26T08:18:42.000000Z

I'd also start with pgvector (it's easy to switch), but the limitations around hybrid search and filtering + ANN are real and if you're doing any kind of RAG-like thing it's worth being aware of them upfront. pgvector is also an open-source project with way less manpower behind it than a bunch of venture-backed companies, so while you can expect it to pick up important features, it takes much longer (support for HNSW indices was a good example).

nikita · 2024-01-26T04:24:18.000000Z

What is taking the most time at scale? Is this ingest, index build or lookups ?

visarga · 2024-01-26T05:12:13.000000Z

ingest and index build can take time

nikita · 2024-01-26T05:23:58.000000Z

What volumes are we talking about.

There are ways to speed things up dramatically. Index build just became multithreaded (see above).

We have ideas on what to do with ingest.

Also do you interest from S3 ?

visarga · 2024-01-26T05:25:33.000000Z

np.dot is also multi-threaded, based on BLAS