For standard datasets we use OpenAI ada embeddings, for hybrid its instructor + ...

andrewlu0 on March 31, 2023 | parent | context | favorite | on: Launch HN: Baseplate (YC W23) – Back end-as-a-serv...

For standard datasets we use OpenAI ada embeddings, for hybrid its instructor + SPLADE. The HyDE toggle in the context variable feeds the query to a prompt first ("Generate a document that answers..."), before embedding. I think in the paper they use the contriever embedding model but we just use the ones supported on our platform

ttul on March 31, 2023 [–]

I think we will be hearing a lot more about HyDE. It’s a neat trick.