Ask HN: Have you hit a context ceiling with an LLM? What did you do about it?

korkybuchek · 2024-12-16T17:56:41 1734371801

Summarize along the way, instead of attempting to use the context like a database. Or look into RAG. Or use Gemini-1.5's 1 million token window.

jMyles · 2024-12-16T20:55:03 1734382503

Those are great tips. The RAG thing looks doable. Is there a pre-fab solution, or do I need to use one of the APIs for the big-box providers?

My concern is: although it looks relatively easy to implement, I'm not sure I want to build a whole interface for it. I wish Cursor exposed, eg, this:

https://github.com/anthropics/anthropic-cookbook/blob/main/s...

korkybuchek · 2024-12-16T22:02:54 1734386574

It's a bear to implement well. I'd start with Gemini and see how far that context window can take you. Then I'd work on summarizing fragments of earlier conversations down to key points to include in the broader conversation. Then I probably still wouldn't try to implement RAG unless time was not an issue.

jMyles · 2024-12-16T22:19:47 1734387587

Gotcha; thanks for the tip.

Gemini seems to be much, much worse at the interpersonal dynamics that are important to us. It can't seem to keep track of who has what life experiences and how they might be relevant to our problem-solving, where Claude does this rather well.

I'm working with Claude now to create a summary capsule.