shaial's comments

shaial · 2026-04-03T21:34:10 1775252050

The title says you replaced RAG, but ChromaFs is still querying Chroma on every command — you replaced RAG's interface, not RAG itself. Which is actually the more interesting finding: the retrieval was never the bottleneck, the abstraction was. Agents don't need better search. They need `grep`.

functional_dev · 2026-04-04T09:39:18 1775295558

exactly, embeddings destroy information.. exact keywords, acronyms, etc. They get squashed into floats.

That is why grep still beats it for code.

I generated visual schematic of every stage of the pipeline - https://vectree.io/c/retrieval-augmented-generation-embeddin...

shaial · on March 7, 2024

I explain in detail what worked (and didn't work) for us at Orca Security building the AI search.

The video description has a link to an Open Source node.js repo with all the foundation for a good AI search!

shaial · on March 7, 2024

Link to Open Source Booking.com AI search example: https://github.com/shaialon/ai-search

shaial · on Jan 8, 2024

Congrats, this looks neat, and surely great to have more TS products in the ecosystem.

One plugin or feature that I will like to see in an AI gateway: *Cache* per unique request. So if I send the same request (system, messages, temperature, etc.), I will have the option to pull if from a cache (if it was already populated) and skip the LLM generation. This is much faster and cheaper - especially during development and testing.

retrovrv · on Jan 8, 2024

Thank you! We have built out the cache system -- we do both simple caching (matching the request strings 100%) and also do semantic caching (returning a cache hit for semantically similar requests). More here - https://portkey.ai/docs/product/ai-gateway-streamline-llm-in...

The caching part isn't open source yet, but part of our internal workers. Would be very cool to open source it!

shaial · on Jan 10, 2024

Awesome! We built the simple version in-house, and hoped someone would productize it.

shaial · on Dec 17, 2022

I pushed ChatGPT to it’s coding limits in a javascript (Node.js) interview flow, and learned how it reasons about the various challenges it’s presented with.

I was blown away by what this magnificent AI can do - it seems ChatGPT can make the cut for a junior Node.js developer role :)

shaial · on Oct 31, 2021

How would you improve the content?

shaial · on Oct 31, 2021

I get your point. There is certainly a spectrum here. I can tell you (as the author) that for mission-critical assets with less 3rd party dependencies - I do see that most prefer to use the hard-coded or policy, or pull it via api per build in the CI/CD.

However for more dynamic websites (like blogs) that tend to have many 3rd party dependencies - It's very useful and effective to be able to update the policy with one click (or even automatically).

shaial · on Nov 15, 2012

Thanks for your input! Subscribe to get future stuff