More

jwilliams · 2026-02-17T05:46:38 1771307198

This post confuses me a little. With my tests I try not to "reach inside" systems unless it's quite a specific integration test. Especially databases. In this case I feel like we're just... testing known PostgreSQL behavior?

Or to put another way; as others have observed, this could be solved with atomic updates and in some case SERIALIZABLE. These are right tools for balance operations - and if they’re used I’m not sure they need testing in this manner?

lirbank · 2026-02-17T06:18:53 1771309133

Fair concern about reaching inside systems - it's not something to do lightly. The hooks are designed to be minimal: production code never calls them, they only activate in tests. But the core point is narrower than the thread might suggest - the article isn't about whether to use atomic updates vs locks vs SERIALIZABLE. It's about when your code has operations that could race, how do you prove your handling actually works?

jwilliams · 2026-02-18T04:05:32 1771387532

Fair enough and no disagreement there. Perhaps (for me!) the example pulled against it your core point.

jwilliams · 2026-02-16T05:11:55 1771218715

> Here's a thought. Lets all arbitrarily agree AGI is here.

A slightly different angle on this - perhaps AGI doesn't matter (or perhaps not in the ways that we think).

LLMs have changed a lot in software in the last 1-2 years (indeed, the last 1-2 months); I don't think it's a wild extrapolation to see that'll come to many domains very soon.

nradov · 2026-02-16T15:39:04 1771256344

Which domains? Will we see a lot of changes in plumbing?

joquarky · 2026-02-16T17:33:17 1771263197

If most of your work involves working with a monitor and keyboard, you're in one of the the domains.

Even if it doesn't, you will be indirectly affected. People will flock to trades if knowledge work is no longer a source of viable income.

jwilliams · 2026-02-08T21:26:05 1770585965

Although there is the consistent trap of tools that assign threads/workers based on the number of cores (e.g unit testing or bundling tools). This means the efficiency cores get dragged in and can absolutely tank the process.

This was particularly pronounced on the M1 due to the 50/50 split. We reduced the number of workers on our test suite based on the CPU type and it sped up considerably.

jwilliams · 2026-02-07T05:21:27 1770441687

It used to be Factorio for me (I live in Australia, so long flights happen a lot). The problem with Factorio the flight isn't long enough! and the game bleeds into 100+ hours post-flight.

mnw21cam · 2026-02-07T09:11:32 1770455492

Dwarf Fortress. That's really how to suddenly say "Oh, how did it get to 4am already?"

lossyalgo · 2026-02-07T14:02:30 1770472950

DF gets all the news (rightfully so, it's an epic game that I've dumped a ton of hours into) but if you haven't already, consider checking out Songs of Syx. It's like DF but multiplied by 100. You can have tens of thousands of citizens, doing most of the things they do in Dwarf Fortress, and a lot more, including waging huge wars against the neighbors. The limits of DF kinda made me sad, actually, that you are limited to so few Dwarves (and don't say it's because you want to know the story of all of them, because after 30 or so you lose track of who is who anyways, so might as well up the limit from 100 to 50K, or more? ;) Songs of Syx has also routinely been getting massive updates since 2020 and I have a feeling the code is a bit cleaner so the solo dev can add features faster (unlike DF's code base which is, according to one of the new devs a nightmare to work with). It's a game that is never talked about but deserves a whole lot more love from gamers.

I don't mean to cast shade on DF, I really do love it, and am happy for its existence, I just think that DF fans should also look into Songs of Syx.

The defining difference for me are the generated stories in DF, which often are a lot of random trash but still give a feeling of a deeper meaning.

AngryData · 2026-02-11T00:36:58 1770770218

As a long time DF veteran who has installed but never played Songs of Syx, you convinced me to boot it up.

bpye · 2026-02-07T11:48:03 1770464883

I lost the best part of a week of my Christmas break to it when the Steam version was released a couple years back...

jwilliams · 2026-02-05T20:33:48 1770323628

I’ve definitely experienced a subjective regression with Opus 4.5 the last few days. Feels like I was back to the frustrations from a year ago. Keen to see if 4.6 has reversed this.

jwilliams · 2026-01-28T02:56:05 1769568965

> It's so interesting to watch an agent relentlessly work at something. They never get tired, they never get demoralized, they just keep going and trying things where a person would have given up long ago to fight another day. It's a "feel the AGI" moment to watch it struggle with something for a long time just to come out victorious 30 minutes later.

This is true... Equally I've seen it dive into a rabbit hole, make some changes that probably aren't the right direction... and then keep digging.

This is way more likely with Sonnet, Opus seems to be better at avoiding it. Sonnet would happily modify every file in the codebase trying to get a type error to go away. If I prompt "wait, are you off track?" it can usually course correct. Again, Opus seems way better at that part too.

Admittedly this has improved a lot lately overall.

gregjor · 2026-01-28T10:29:31 1769596171

I don't understand why anyone finds it interesting that a machine, or chatbot, never tires or gets demoralized. You have to anthromorphize the LLM before you can even think of those possibilities. A tractor never tires or gets demoralized either, because it can't. Chatbots don't "dive into a rabbit hole ... and then keep digging" because they have superhuman tenacity, they do it because that's what software does. If I ask my laptop to compute the millionth Fibonacci number it doesn't sigh and complain, and I don't think it shows any special qualities unless I compare it to a person given the same job.

akoboldfrying · 2026-01-28T11:37:31 1769600251

You're a machine. You're literally a wet, analog device converting some forms of energy into other forms just like any other machine as you work, rest, type out HN comments, etc. There is nothing special about the carbon atoms in your body -- there's no metadata attached to them marking them out as belonging to a Living Person. Other living-person-machines treat "you" differently than other clusters of atoms only because evolution has taught us that doing so is a mutually beneficial social convention.

So, since you're just a machine, any text you generate should be uninteresting to me -- correct?

Alternatively, could it be that a sufficiently complex and intricate machine can be interesting to observe in its own right?

suddenlybananas · 2026-01-28T11:56:35 1769601395

If humans are machines, they are still a subset of machines and they (among other animals) are the only ones who can be demotivated and so it is still a mistake to assume an entirely different kind of machine would have those properties.

>Other living-person-machines treat "you" differently than other clusters of atoms only because evolution has taught us that doing so is a mutually beneficial social convention

Evolution doesn't "teach" anything. It's just an emergent property of the fact that life reproduces (and sometimes doesn't). If you're going to have this radically reductionist view of humanity, you can't also treat evolution as having any kind of agency.

sponaugle · 2026-01-28T16:03:41 1769616221

"If humans are machines, they are still a subset of machines and they (among other animals) are the only ones who can be demotivated and so it is still a mistake to assume an entirely different kind of machine would have those properties."

Yet.

suddenlybananas · 2026-01-28T16:06:25 1769616385

Sure but the entire context of the discussion is surprisial that they don't.

sponaugle · 2026-01-28T16:23:02 1769617382

Agreed - There is no guarantee of what will happen in the future. I'm not for or against the outcome, but certainly curious to see what it is.

spopejoy · 2026-01-28T16:43:46 1769618626

Humans and all other organisms are "literally" not machines or devices by the simple fact that those terms refer to works made for a purpose.

Even as an analogy "wet machine" fails again and again to adequately describe anything interesting or useful in life sciences.

gregjor · 2026-01-28T19:12:13 1769627533

Wrong level of abstraction. And not the definition of machine.

I might feel awe or amazement at what human-made machines can do -- the reason I got into programming. But I don't attribute human qualities to computers or software, a category error. No computer ever looked at me as interesting or tenacious.

jwilliams · 2026-01-26T09:38:09 1769420289

It is, they've just aligned them under the same umbrella. It's literally "Workers & Pages" under the CF navigation.

h33t-l4x0r · 2026-01-26T11:11:49 1769425909

It is mostly just workers though. There's a teeny tiny link there that says "are you looking for pages?".

tombert · 2026-01-26T18:50:54 1769453454

Pages still seem to work regardless. It was free and it meant that I didn’t have to host it on my own hardware anymore.

jwilliams · 2025-08-07T00:16:12 1754525772

Yes + Appears it's a rigid structure w/ the engine pushing from the back? At 0.1g I suspect even with advanced composites only a few km would be possible.

jwilliams · 2025-06-10T01:33:30 1749519210

This is Windows Aero all over again - why is this a persistent design?

You can't see or process the information behind the glass - at best it's major cognitive load to do so, at worst it's just very noisy with zero added information.

bombcar · 2025-06-10T03:07:10 1749524830

Because it looks really good in a five minute demonstration to the C-level execs.

jwilliams · 2025-04-10T03:58:26 1744257506

That and because UK uses a ring circuit, which was seen as a solve for a copper shortage at the time.

Ring circuits generally way higher in power - and a lot more than a single appliance needs or could handle. Hence the need for a fused plug.

The upside is each socket can take a 13A plug (circa 3kW), whereas a standard US socket maxes out closer to 1.8kW.