hi, I made this. thank you for posting. I love clocks and I love finding the edg...

jdietrich · 2025-11-15T02:20:47 1763173247

Clock drawing is widely used as a test for assessing dementia. Sometimes the LLMs fail in ways that are fairly predictable if you're familiar with CSS and typical shortcomings of LLMs, but sometimes they fail in ways that are less obvious from a technical perspective but are exactly the same failure modes as cognitively-impaired humans.

I think you might have stumbled upon something surprisingly profound.

https://www.psychdb.com/cognitive-testing/clock-drawing-test

overfeed · 2025-11-15T05:49:34 1763185774

> Clock drawing is widely used as a test for assessing dementia

Interestingly, clocks are also an easy tell for when you're dreaming, if you're a lucid dreamer; they never work normally in dreams.

ghurtado · 2025-11-15T07:06:28 1763190388

In lucid dreams there's a whole category of things like this: reading a paragraph of text, looking at a clock (digital or analog), or working any kind of technology more complex than a calculator.

For me personally, even light switches have been a huge tell in the past, so basically almost anything electrical.

I've always held the utterly unscientific position that this is because the brain only has enough GPU cycles to show you an approximation of what the dream world looks like, but to actually run a whole simulation behind the scenes would require more FLOPs than it has available. After all, the brain also needs to run the "player" threads: It's already super busy.

Stretching the analogy past the point of absurdity, this is a bit like modern video game optimizations: the mountains in the distance are just a painting on a surface, and the remote on that couch is just a messy blur of pixels when you look at it up close.

So the dreaming brain is like a very clever video game developer, I guess.

tablatom · 2025-11-15T07:20:39 1763191239

Wait, lucid dreamers need tells to know where they are?!?

Kiro · 2025-11-15T07:24:42 1763191482

Yes, that's how you enter the lucid state. You find ways to tell that you're dreaming and condition yourself to check for those while awake. Eventually you will do it inside a dream and realize that you're dreaming.

Kiboneu · 2025-11-15T08:05:22 1763193922

Yeah. It’s very common to notice anomalies inside of a dream. But the anomalies weave into the dream and feel normal. You don’t have much agency to enter a lucid state from a pre-lucid dream.

So the idea is to develop habits called “reality checks” when you are awake. You look for the broken clock kind of anomalies that the grandparent comment mentioned. You have to be open to the possibility of dreaming, which is hard to do.

Consider this difficulty. Are you dreaming?

…

How much time did it take to think “no”? Or did you even take this question seriously? Maybe because you are reading a hn comment about lucid dreams, that question is interpreted as an example instead of a genuine question worth investigating, right? That’s the difficulty. Try it again.

The key is that the habit you’re developing isn’t just the check itself — it’s the thinking that you have during the check, which should lead you to investigate.

You do these checks frequently enough you end up doing it in a dream. Boom.

There’s also an aspect of identifying recurring patterns during prelucidity. That’s why it helps to keep a dream journal for your non-lucid dreams.

There are other methods too.

lordnacho · 2025-11-15T14:00:26 1763215226

Didn't you ever watch Inception? You have to carry around a little spinning top to test which level of VM you're inside of.

conradev · 2025-11-15T17:14:39 1763226879

The first time it happened to me, it was accidental. I dreamed that I was in a college classroom but I realized that I never went to college. I was not trying to and had never lucid dreamed before, and so it was very surprising.

david-gpu · 2025-11-15T10:16:47 1763201807

Plenty of folks out there know when they are dreaming just like they know when they are awake. It varies from person to person.

DuperPower · 2025-11-15T12:04:51 1763208291

be careful as adding consciousness to a dream means CPU cycles so you wake Up more tired, its cool for kids and teens but grown adults shouldnt explore this to avoid bad rest

david-gpu · 2025-11-15T21:17:25 1763241445

Over time, with accumulated experience, all dreams are lucid from the start. Because of that they are very calm and pleasant; the dreamer is no longer reactive to what happens in the dream because they know nothing is at stake.

travisjungroth · 2025-11-15T13:38:52 1763213932

That’s a caution to getting addicted to it, but not never doing it. I’ve had powerful experiences in lucid dreaming that I wouldn’t trade for a little more rest. I was already in a retreat where I was basically resting all the time.

conradev · 2025-11-15T17:19:32 1763227172

I met someone once who claimed that he lucid dreams almost every night by default and it is exhausting. He smokes weed at night to avoid dreaming entirely. I didn’t dig in super deep, but it sounded pretty intense!

david-gpu · 2025-11-15T21:19:42 1763241582

IMO they would benefit from skipping the weed and instead continue to practice lucid dreaming. Over time they will develop their skill and will learn to simply contemplate the dream without reacting to it. It is a calming experience.

BoredomIsFun · 2025-11-15T18:32:26 1763231546

My brain learned how to maintain legible text in dreams, I cannot use it in lucid dreaming anymore...

danw1979 · 2025-11-15T08:02:20 1763193740

For me it’s phones… specifically dialling a number manually. No matter how carefully I dial, the number on the screen is rarely correct.

allarm · 2025-11-15T10:32:50 1763202770

It seems that I’ve been stuck in a lucid dream for a couple of decades, no matter how carefully write text on a phone keyboard it never comes out as intended.

luckman212 · 2025-11-15T15:50:50 1763221850

Tank ypu foe wriiting this

amelius · 2025-11-15T10:55:01 1763204101

Whenever I dial a number while in a dream, the person I'm trying to call always turns out to be right next to me.

biztos · 2025-11-15T17:42:42 1763228562

Do they look normal but just not work normally?

Maybe reality is a world of broken clocks, and they only “work” in the simulation.

teaearlgraycold · 2025-11-16T09:09:30 1763284170

I feel like the heuristic could just be - do I feel like I’m in a dream? Then I am. I’ve never felt that way when awake.

xrisk · 2025-11-15T03:49:22 1763178562

Maybe explainable via the fact that these tests are part of the LLM training set?

jorgesborges · 2025-11-15T04:24:58 1763180698

Conceptual deficit is a great failure mode description. The inability to retrieve "meaning" about the clock -- having some understanding about its shape and function but not its intent to convey time to us -- is familiar with a lot of bad LLM output.

BHSPitMonkey · 2025-11-15T17:48:15 1763228895

I would think the way humans draw clocks has more in common with image generation models (which probably do a bit better with this task overall) than a language model producing SVG markup, though.

ACCount37 · 2025-11-15T08:48:31 1763196511

LLMs don't do this because they have "people with dementia draw clocks that way" in their data. They do it because they're similar enough to human minds in function that they often fail in similar ways.

An amusing pattern that dates back to "1kg of steel is heavier of course" in GPT-3.5.

kaffekaka · 2025-11-15T09:46:11 1763199971

How do you know this?

Obviously, humans failing in these ways ARE in the training set. So it should definitely affect LLM output.

ACCount37 · 2025-11-15T09:59:08 1763200748

First: generalization. The failure modes extend to unseen tasks. That specific way to fail at "1kg of steel" sure was in the training data, but novel closed set logic puzzles couldn't have been. They display similar failures. The same "vibe-based reasoning" process of "steel has heavy vibes, feather has light vibes, thus, steel is heavier" produces other similar failures.

Second: the failures go away with capability (raw scale, reasoning training, test-time compute), on seen and unseen tasks both. Which is a strong hint that the model was truly failing, rather than being capable of doing a task but choosing to faithfully imitate a human failure instead.

I don't think the influence of human failures in the training data on the LLMs is nil, but it's not just a surface-level failure repetition behavior.

TheJoeMan · 2025-11-15T02:36:45 1763174205

Figure 6 with the square clock would be a cool modern art piece.

yencabulator · 2025-11-19T16:16:23 1763568983

I have had this thought of a slow-moving mechanical simulation of a chaotic triple pendulum as a clock hand for a very long time..

Or maybe something like https://www.youtube.com/watch?v=dhZxdV2naw8

bspammer · 2025-11-14T23:18:07 1763162287

If you're keeping all the generated clocks in a database, I'd love to see a Facemash style spin-off website where users pick the best clock between two options, with a leaderboard. I want to know what the best clock Qwen ever made was!

abixb · 2025-11-15T01:17:59 1763169479

We might be on to creating a new crowd-ranked LLM benchmark here.

addandsubtract · 2025-11-15T02:15:09 1763172909

A pelican wearing a working watch

danw1979 · 2025-11-15T08:03:38 1763193818

Using it to time bicycle race ?

nightpool · 2025-11-15T01:03:45 1763168625

Yes! Please do this

layer8 · 2025-11-15T16:56:47 1763225807

Not the best, but the most amusing.

smusamashah · 2025-11-15T01:28:03 1763170083

Please make it show last 5 (or some other number) of clocks for each model. It will be nice to see the deviation and variety for each model at a glance.

charliewallace · 2025-11-15T02:55:13 1763175313

Very cool! I also love clocks, especially weird ones, and recently put up this 3D Moebius Strip clock, hope you like it: https://www.mobiusclock.com

chemotaxis · 2025-11-14T23:54:24 1763164464

This is honestly the best thing I've seen on HN this month. It's stupid, enlightening... funny and profound and the same time. I have a strong temptation to pick some of these designs and build them in real life.

I applaud you for spending money to get it done.

AnonHP · 2025-11-15T03:47:50 1763178470

Could you please change and adjust the positions of the titles (like GPT 5)? On Firefox Focus on iOS, the spacing is inconsistent (seems like it moves due to the space taken by the clock). After one or two of them, I had to scroll all the way down to the bottom and come back up to understand which title is linked to which clock.

anigbrowl · 2025-11-14T20:10:30 1763151030

I really like this. The broken ones are sometimes just failures, but sometimes provide intriguing new design ideas.

jdiff · 2025-11-14T22:54:58 1763160898

This same principle is why my favorite image generation model is the earlier models from 2019-2020 where they could only reliably generate soup. It's like Rorschach tests, it's not about what's there, it's about what you see in them. I don't want a bot to make art for me, sometimes I just want some shroom-induced inspirational smears.

nemomarx · 2025-11-15T00:40:50 1763167250

I really miss that deepdream aesthetic with the dogs eyes popping up everywhere.

ks2048 · 2025-11-14T23:48:30 1763164110

Nice job! Maybe let users click an example to see the raw source (LLM output)

brianjking · 2025-11-15T03:48:20 1763178500

This is an awesome benchmark. Officially one of my favorites now. Thank you for making this.

csours · 2025-11-14T21:41:21 1763156481

LOVE IT!

It would be really cool if I could zoom out and have everything scale properly!

Fabricio20 · 2025-11-14T22:09:11 1763158151

Why is this different per user? I sent this to a few friends and they all see different things from what i'm seeing, for the same time..?

samtheprogram · 2025-11-14T22:31:41 1763159501

It regenerates on page load. I find that pretty useful.

Grok 4 and Kimi nailed it the first time for me, then only Kimi on the second pass.

malfist · 2025-11-15T02:55:03 1763175303

Not on page load, it regenerates every minute. There's a little hovering question mark in the top right that explains things, including the prompt to the models.

layer8 · 2025-11-15T16:58:44 1763225924

It’s different per minute, not per user.

hakcermani · 2025-11-15T00:56:24 1763168184

.. would you mind sharing the prompt .. in a gist perhaps .

ceroxylon · 2025-11-15T01:09:44 1763168984

They have it available on the site under the (?) button:

"Create HTML/CSS of an analog clock showing ${time}. Include numbers (or numerals) if you wish, and have a CSS animated second hand. Make it responsive and use a white background. Return ONLY the HTML/CSS code with no markdown formatting."