I'm uninformed about this, it may just be superstition, but my feeling while usi...

hellcow · 2024-12-06T15:43:31 1733499811

I noticed this too! It's dramatic in the same chat. I'll come back the next day, and even though I still have the full convo history, and it's as if it completely forgot all my earlier instructions.

adr1an · 2024-12-08T09:09:28 1733648968

Makes sense. Keeping the conversation implieas that each new message carries the whole history, again. You need to create new chats from time to time, or throttle to a different model...

ToDougie · 2024-12-06T18:11:26 1733508686

This is my biggest gripe with these LLMs. I primarily use Claude, and it exhibits the same described behavior. I'll find myself in a flow state and then somewhere around hour 3 it starts to pretend like it isn't capable of completing specific tasks that it had been performing for hours, days, weeks. For instance, I'm working on creating a few LLCs with their requisite social media handles and domain registrations. I _used_ to be able to ask Claude to check all US State LLC registrations, all major TLD domain registrations, and USPTO against particular terms and similar derivations. Then one day it just decided to stop doing this. And it tells me it can't search the web or whatever. Which is bullshit because I was verifying all of this data and ensuring it wasn't hallucinating - which it never was.

wkat4242 · 2024-12-08T18:23:14 1733682194

Could it be that you're running out of available context in the thread you're in?

ToDougie · 2024-12-09T19:35:22 1733772922

Doubtful. I started new threads using carbon-copy prompts. I'll research some more to make sure I'm not missing anything, though.

__MatrixMan__ · 2024-12-06T21:36:00 1733520960

Did you ever read Accelerando? I think it involved a large number of machine generated LLCs...

ToDougie · 2024-12-09T19:34:18 1733772858

No, but I'll give the wikipedia summary a gander :)

handfuloflight · 2024-12-06T02:31:22 1733452282

Is that within the same chat?

__MatrixMan__ · 2024-12-06T21:27:19 1733520439

The flow lately has been transforming test cases to accommodate interface changes, so I'm not asking it to remember something from several hours ago, I'm just asking it to make the "same" transformation from the previous prompt, except now to a different input.

It struggles with cases that exceed 1000 lines or so. Not that it loses track entirely at that size, it just starts making dumb mistakes.

Then after about 2 or 3 hours, the size at which it starts to struggle drops to maybe 500. A new chat doesn't seem to help, but who can say, it's a difficult thing to quantify. After 12 hours, both me and the AI are feeling fresh again. Or maybe it's just me, idk.

And if you're about to suggest that the real problem here is that there's so much tedious filler in these test cases that even an AI gets bored with them... Yes, yes it is.