As someone who has been using pgvector for a while and is vaguely curious about ...

serjester · 2024-01-26T03:54:45.000000Z

I use PgVector myself but here's the advantages to a true vector db.

- Vectors are massive data wise. In our current production database they take up 95% of the memory - should they be stored separately?

- Better support for easily re-embedding, hybrid search, certain RAG workflows

- Stronger performance once you're dealing with millions of vectors.

I would still stick with PgVector until you're dealing with non trivial scale.

whakim · 2024-01-26T08:18:42.000000Z

I'd also start with pgvector (it's easy to switch), but the limitations around hybrid search and filtering + ANN are real and if you're doing any kind of RAG-like thing it's worth being aware of them upfront. pgvector is also an open-source project with way less manpower behind it than a bunch of venture-backed companies, so while you can expect it to pick up important features, it takes much longer (support for HNSW indices was a good example).

nikita · 2024-01-26T04:24:18.000000Z

What is taking the most time at scale? Is this ingest, index build or lookups ?

visarga · 2024-01-26T05:12:13.000000Z

ingest and index build can take time

nikita · 2024-01-26T05:23:58.000000Z

What volumes are we talking about.

There are ways to speed things up dramatically. Index build just became multithreaded (see above).

We have ideas on what to do with ingest.

Also do you interest from S3 ?

visarga · 2024-01-26T05:25:33.000000Z

np.dot is also multi-threaded, based on BLAS

fxtentacle · 2024-01-26T04:33:19.000000Z

If you're still in the "millions of documents" scale range, then PostgreSQL on a beefy EPYC can probably handle everything fast enough so that it doesn't make sense to spend engineering time on using a vector db which would only shave off a few ms in latency.

MarkMarine · 2024-01-26T03:47:27.000000Z