More

hmaxdml · 2026-06-06T11:53:53 1780746833

The database is exactly the hardcore piece of engineering that's been designed to scale and be fault tolerant for decades

hmaxdml · 2026-06-06T11:52:01 1780746721

Because you likely already have a database and likely don't need to bring on an entire new distributed system to orchestrate your workflows.

hmaxdml · 2026-06-06T11:43:59 1780746239

A PG-backed queue is in code right after being in PG, and the beauty of a neat durable queue framework is in exposing it conveniently and efficiently.

hmaxdml · 2026-06-02T16:23:20 1780417400

Postgres does scale pretty well: https://www.dbos.dev/blog/benchmarking-workflow-execution-sc...

Tens of thousands of workflows per second

hmaxdml · 2026-05-30T20:54:42 1780174482

That's why their entire business model -- like Astronomer's -- is geared toward cloud hosting. The architecture is so complex it takes a full time team to run it.

hmaxdml · 2026-05-30T20:51:42 1780174302

Have you looked into DBOS? Same thesis: durable and reliable workflows are hard to manage -- it just doesn't have to be as hard as Temporal makes it be :)

bitexploder · 2026-06-02T13:25:10 1780406710

Have not. For my workflows this was fine. Good to keep in mind thoUgh. I don’t plan to manage a truly distributed system with it. Plus my only reason to do so is professional and we rolled our own system here due to our size solutions like DBOS or Temporal would not work well.

hmaxdml · 2026-05-29T22:18:07 1780093087

DBOS python supports SQLite. Go is supporting it next release

hmaxdml · 2026-05-28T23:07:06 1780009626

I've talked to dozens of engineers who built their home grown "durable" stack. Most of them eventually moved on to buying vs building, when their system actually scaled. It's just not a side-hustle to build a foundational reliability layer.

OutOfHere · 2026-05-29T12:01:34 1780056094

That argument comes down to the scalability of RabbitMQ or one's database, both of which can scale fairly well, but require tuning. In the absolute worst case, one would have to use a distributed cloud database, e.g. AWS Aurora or AWS DynamoDB, otherwise a self-hosted one, e.g. TiDb or YugabyteDb, but far less than 1% of users would even need anything like it.

In the pre-AI era, the argument of using a third party tool or service even had some weight, but today, AI can even do much of the heavy work when pointed in the right direction wrt using the aforementioned. For the majority of users, a SQLite database will do the job.

hmaxdml · 2026-05-28T23:03:40 1780009420

Yeah, we've observed that too: people start implementing their own retry logic, idempotency, etc. But then they grow a hard to maintain, complex stack that's not their core business logic. There's a reason why there is a dedicated team building DBOS, every day. Because it's not that easy to build a solid durable workflows engine on Postgres.

hmaxdml · 2026-05-28T19:46:39 1779997599

Listen/notify is poised to become much better in PG 18 and 19

stuartaxelowen · 2026-05-28T19:55:05 1779998105

Why’s that?

TkTech · 2026-05-28T20:17:04 1779999424

In pg19 https://git.postgresql.org/gitweb/?p=postgresql.git;a=commit... will land, which significantly improves NOTIFY performance. Right now LISTEN/NOTIFY doesn't scale to very busy instances because a `NOTIFY` within a transaction takes a global lock.

ivanr · 2026-05-28T20:19:05 1779999545

More context: https://www.recall.ai/blog/postgres-listen-notify-does-not-s...

doctorpangloss · 2026-05-28T21:47:49 1780004869

Well another POV is, AWS sells RDS instances capable of global lock NOTIFY. Clearly people have been using it despite it being really slow.

It's a terrible architecture but does it matter? This article should really say "AWS is a useful but expensive way to run your apps," which isn't say much of anything at all.