More

fourthark · 2026-05-14T01:55:46 1778723746

Interesting use of evals.

Might help interpretation to say on the front page that it's a five point scale with 0 (or 1?) being the safest score. This can be picked up from colors and the bars in the individual reports, but it takes a minute to figure it out.

timf34 · 2026-05-14T12:42:14 1778762534

Good suggestion thank you! It's between 1-5 but I'll convert that to 1-100

fourthark · 2026-05-06T01:07:22 1778029642

Capitalism doesn't work for big infrastructure projects? Who knew?

lesam · 2026-05-06T01:18:51 1778030331

This is capitalism working as intended. Only the best run airlines can survive, and investors are collectively subsidizing air travel for non-investors.

DangitBobby · 2026-05-06T04:15:14 1778040914

Nothing in the article (or in the real world) even remotely suggests that "the best" airlines survive. Simply, the airlines that survive are the ones that survive.

fourthark · 2026-04-27T22:25:30 1777328730

Interesting, I got a completely different result on green/blue on this one, way more green whereas I got average on the individual test. Going between very different colors makes it hard to reset - they might consider breaks between spectra.

fourthark · 2026-04-25T18:42:00 1777142520

> You gasp. You hyperventilate. Your heart rate jumps. Your blood pressure climbs. All of this in a few seconds.

There's something especially creepy about AIs talking in the second person about biological processes they don't experience.

fourthark · 2026-04-23T02:03:51 1776909831

Yes, using nine specialized cameras. Still very impressive but the human is overmatched on equipment alone.

fourthark · 2026-04-16T04:12:02 1776312722

> reset the context

Yes. Do this. These problems likely mean you have muddled the context.

The article too long and I didn't read the whole thing, but I'm glad the author came to understand that arguing won't help.

fourthark · 2026-03-14T14:00:05 1773496805

> But if you are the kind of person who cries out against this abomination we must warn you that people who go through life expecting informal variant idioms in English to behave logically are setting themselves up for a lifetime of hurt.

fourthark · 2026-03-06T01:48:19 1772761699

That's easy to ignore.

fourthark · 2026-03-04T03:07:44 1772593664

I think the point is that you have a better idea of what you want it to remember and even a small hint can have big impact.

Just saying "write up what you know", with no other clues, should not perform any better than generic compaction.

fourthark · 2026-03-01T16:06:40 1772381200

Wish we could downvote articles. Is it legitimate to flag AI slop?