chronark_'s comments

chronark_ · 2025-10-15T12:39:55 1760531995

Not everyone is born with experience in distributed systems

sgarland · 2025-10-15T12:47:38 1760532458

Sure, but there are some fundamentals about latency that any programmer should know [0] (absolute values outdated, but still useful as relative comparisons), like “network calls are multiple orders of magnitude slower than IPC.”

I’m assuming you’re an employee of the company based on your comments, so please don’t take this poorly - I applaud any and all public efforts to bring back sanity to modern architecture, especially with objective metrics.

0: https://gist.github.com/hellerbarde/2843375

chronark_ · 2025-10-15T13:00:52 1760533252

I cofounded it yeah

And yeah you’re right in hindsight it was a terrible idea to begin with

I thought it could work but didn’t benchmark it enough and didn’t plan enough. It all looked great in early POCs and all of these issues cropped up as we built it

kburman · 2025-10-15T12:49:16 1760532556

That's fair, but then the framing matters. The article criticizes serverless architecture rather than acknowledging an evaluation failure.

"Serverless was fighting us" vs "We didn't understand serverless tradeoffs" - one is a learning experience, the other is misdirected criticism.

chronark_ · 2025-10-15T13:01:54 1760533314

Yeah that’s fair

lossolo · 2025-10-15T13:25:26 1760534726

You don't need experience and there is not really a lot to know about "distributed systems" in this case, that's basic CS knowledge about networks, latency and what "serverless" actually is, you can read about it. To be honest, to me it reads like people who don't understand the problem they're solving, haven't acquired the necessary knowledge to solve it (either by learning themselves or by asking/hiring people who have it), and seeing such an amateurish mistake doesn't inspire confidence for the future. You should either hire people that know what they are doing or upgrade your knowledge about systems you are using before making decisions to use them.

nougati · 2025-10-15T13:35:08 1760535308

Sometimes I see a post about sorting algorithms online. Some people seem to benefit from reading about these things, but often, I find there isn't much new information for me. That's OK, because I know somebody somewhere benefits from knowing this.

It is your decision to make this a circlejerk of musings about how the company must be run by amateurs. Whatever crusade you're fighting in vividly criticising them is not valuable at all. People need to learn and share so we can all improve, stop distracting from that point.

chronark_ · 2025-10-15T12:38:59 1760531939

I can assure you that was pretty close to the internal conversation lol

Not sure what the different takeaways would be though?

ramraj07 · 2025-10-15T12:45:00 1760532300

What did your internal discussion conclude for the question "Why did we not take a step back earlier and think, why are we doing it this way?"

Im genuinely curious because this is not singling out your team or org, this is a very common occurrence among modern engineering teams, and I've often found myself on the losing end of such arguments. So I am all ears to hear at least one such team telling what goes on in their mind when they make terrible architecture decisions and if they learned anything philosophical that would prevent a repeat.

chronark_ · 2025-10-15T12:58:29 1760533109

Oh we had it coming for quite some time and knew we would need to rebuild it, we just didn’t have the capacity to do it unfortunately.

I was working on it on and off moving one endpoint at a time but it was very slow until we hired someone who was able to focus on it.

It didn’t feel good at all. We knew the product had massive flaws due to the latency but couldn’t address it quickly. Especially cause we he to build more workarounds as time went on. Workarounds we knew would be made redundant by the reimplementation.

I think we had that discussion if “wtf are we doing here” pretty early, but we didn’t act on it in the beginning, instead we tried different approaches to make it work within the serverless constraints cause that’s what we knew well.

hrimfaxi · 2025-10-15T12:57:16 1760533036

I have had CTOs (two in my career) tell me we had to use our AWS credits since they were going to expire worthless. Both experiences were at vc-backed startups.

chronark_ · 2025-10-15T12:35:40 1760531740

We did initially but thought cloud flare was a better solution for scalability and latency.

We believed their docs/marketing without doing extensive benchmarks, which is on us.

The appeal was also to use the same typescript stack across everything, which was nice to work with

ramraj07 · 2025-10-15T12:38:42 1760531922

Where did their marketing or documentation say this service is perfect for low latency APIs?

chronark_ · 2025-10-15T12:46:39 1760532399

I doubt they literally said “perfect for low latency APIs” but their messaging is definitely trying to convince you that they’re fast globally, just look at the workers.ckoudflare.com page

chronark_ · 2025-10-15T12:22:29 1760530949

Author of that blog here, happy to answer any questions :)

synunlimited · 2025-10-15T16:09:41 1760544581

Have you done new benchmarks since Cloudflare announced their latest round of performance improvements for Workers?

Just curious if this workload also saw some of the same improvements (on a quick read it seems like you could have been hitting the routing problem CF mentions)

flerchin · 2025-10-15T12:34:56 1760531696

Really great writeup. The charts tell the story beautifully, and the latency gains are surely a win for your company and customers. I always wonder about the tradeoffs. Is there a measurable latency difference for your non-colocated customers? What does maintenance look like for your Go servers? I assume that your Cloudflare costs dropped?

chronark_ · 2025-10-15T13:09:32 1760533772

It’s faster for non-colocated customers too weirdly

I think cause connections can be reused more often. Cloud flare workers are really prone to doing a lot of TLS handshakes cause they spin up new ones constantly

Right now were just hang aws far hate for the go servers, so there really isn’t much maintenance at all. We’ll be moving that into eks soon though cause we are starting to add more stuff and need k8s anyways

wiether · 2025-10-15T12:46:03 1760532363

Not a question: thanks for the writeup and for the honesty of saying that serverless is not inherently bad, just not the right fit for your usecase!

Unfortunately too many comments here are quick to come to the wrong conclusion, based only on the title. Not a reason to change it though!

chronark_ · 2025-10-15T13:06:35 1760533595

Thanks

It’s totally fair criticism that the title and wording is a bit clickbaity

But that’s ok

Sammi · 2025-10-15T14:04:21 1760537061

Do you have a clearer picture of what use-cases you would use serverless functions for in the future (if any)?