May as well ask here: what is the best way to use something like an LLM as a per...

routerl · 2024-09-21T15:07:25 1726931245

Imho opinion, and I'm no expert, but this has been working well for me:

Segment the texts into chunks that make sense (i.e. into the lengths of text you'll want to find, whether this means chapters, sub-chapters, paragraphs, etc), create embeddings of each chunk, and store the resultant vectors in a vector database. Your search workflow will then be to create an embedding of your query, and perform a distance comparison (e.g. cosine similarity) which returns ranked results. This way you can now semantically search your texts.

Everything I've mentioned above is fairly easily doable with existing LLM libraries like langchain or llamaindex. For reference, this is an RAG workflow.

perfmode · 2024-09-23T10:11:06 1727086266

Where are LLMs used in this workflow? For creating embeddings?

routerl · 2024-09-24T13:41:02 1727185262

perfmode · 2024-09-28T19:57:28 1727553448

How is this done? I’d like to try it out

dchuk · 2024-09-21T15:54:25 1726934065

Look into this: https://www.anthropic.com/news/contextual-retrieval

And this: https://microsoft.github.io/graphrag/

meonkeys · 2024-09-21T18:30:15 1726943415

https://khoj.dev promises this.

nickthegreek · 2024-09-23T14:17:39 1727101059

Look into AnythingLLM.