I had a prompt I used for this just using Claude Code: Let's review <filepath or...

daheza · 2026-06-15T21:24:47 1781558687

I'm new to using more than one agent in a flow so forgive my ignorance here but I have a few questions.

Do you review all the files that are generated to ensure there's no hallucinations? Do you just review the last file of concise findings instead?

Is the intent here that the hallucinations will be countered by running through multiple agents that you end up with only the truth? Have you seen anything in the last version that was egregiously wrong?

I was worried about the cost but if you are using local hosted models, then I suppose you don't need to deal with that as much. Locally hosted models still have issues running commands locally and reaching out to the internet right? So this is all just them running with the context of the file, without reference tot he rest of the project?

Thanks for any responses to this.

all2 · 2026-06-15T21:50:21 1781560221

> Do you review all the files that are generated to ensure there's no hallucinations? Do you just review the last file of concise findings instead?

Sometimes, yes. Typically I'll read the final fusion doc, and then trace backwards if there is something that looks relevant. Sometimes I'll read all the abstracts as they come through.

> Is the intent here that the hallucinations will be countered by running through multiple agents that you end up with only the truth? Have you seen anything in the last version that was egregiously wrong?

The intent isn't so much avoiding hallucinations, rather I was attempting to acquire unique, domain specific insight. My read-through of the final doc is a 'pick and choose', where I weed out what isn't relevant, and keep what is. I haven't seen anything way out of whack, as agents will typically check each other and call each other out in the review/rebuttal stage.

> So this is all just them running with the context of the file, without reference tot he rest of the project?

This is run in an agent harness locally, so each reviewer has access to the whole project. The review rebuttal stage rarely sees agents re-read files, though, unless one of them is particularly aggressive and is going deep on something (GPT 5 series will do this to make a point, sometimes).

> Thanks for any responses to this.

No worries. This is pretty easy to try, even with something you don't own. You can run it in any harness that allows spawning sub-agents.

If there's a lot of interest, I could spin up a simple web app that does this just so folks can see it churn on a target git repo or project files or whatever.

kristjansson · 2026-06-15T21:33:27 1781559207

Seems like a lot of machinery over-and-above running the same review n times + aggregating the result. What led you to the design?

all2 · 2026-06-15T21:45:09 1781559909

The idea is pretty simple, identity drives focus and outcomes of each of the 'agents'. The review/rebuttal stage filters stuff that likely doesn't matter in the grand scheme (you'll find reviewers get pedantic and take hard lines on things that ultimately don't matter very much).

This is based on earlier work from late '25 where people were doing similar things. My added bit is the 'unique identities'. What I found was that the root agent typically picks identities relevant to the project, and so you get relevant 'data' or 'views' from a variety of angles.