This tutorial is very complex. Here's how to get free semantic search with much ...

zX41ZdbW · on July 12, 2023

I also recommend this approach when you want to understand every step. I recently did a presentation about this topic: https://www.youtube.com/watch?v=hGRNcftpqAk

It covers end-to-end, including ClickHouse as a vector database.

andre-z · on July 12, 2023

TBH, it does not look "less complex", not at all. :) install, install, ... but where to install and run all of this? The topic is "serverless". This means you do not need to run anything, just need two cloud APIs and a Lambda Script.

hiatus · on July 12, 2023

If you are going to shill for your project, the proper thing to do is disclose that.

https://github.com/azayarni is a contributor (andre-z on twitter).

btheunissen · on July 12, 2023

+1, disappointing to see.

hn_20591249 · on July 12, 2023

15 minutes of install, install, install beats getting into the vicious SAAS vendor cycle of pay, pay, pay with heavy lock-in.

vikp · on July 12, 2023

I'd rather run ~10 lines of code locally than setup 3 cloud services and a lambda function, but to each their own...

kacperlukawski · on July 12, 2023

How would you host sentence-transformers model for free? You need it to vectorize each query so that has to be hosted somewhere. Is there any way to do it for free?

vikp · on July 12, 2023

Just run it on CPU, on your own machine. That's the cheapest way. You could also rent a free/cheap VPS, and even parallelize across multiple machines/cores if you need it.

hn_20591249 · on July 12, 2023

Maybe I'm grumpy today but I am shocked at how many responses you are getting where people think this is a novel idea. Has the engineering mindset really shifted into a default of "buy" even when build could take less than a week?

vikp · on July 12, 2023

I was surprised, too, but then I realized they all work at Qdrant.

But the general dialogue around AI-related tools is surprising to me. The production parts of the langchain, embeddings, etc tools can usually be built in a few hours with better observability, performance, and maintainability.

istjohn · on July 12, 2023

How does sentence-transformers compare to OpenAI embeddings? How long does it take to generate an embedding on a CPU?