We handle 80TB and 5M page views a month for under $400

BiteCode_dev · on Jan 6, 2022

Honestly you can go very far with unmettered data plans. We consume 2GB/S, maxed out, which ends up being 5184TB a month. We have millions on users a day, all on a streaming video platform.

It cost less than $2K/Month.

The cloud is crazy expensive. Private servers are beasts, and they are cheap.

Of course, for this price, you don't have redundancy and horizontal scaling.

You also don't have to maintain and debug a system with redundancy and horizontal scaling.

PragmaticPulp · on Jan 6, 2022

> We consume 2GB/S, maxed out, which ends up being 5184TB a month. We have millions on users a day, all on a streaming video platform.

> It cost less than $2K/Month.

The solution in this article is serving on the order of 100TB/month for $400/month including a high speed global CDN, their API and database servers being hosted reliably, and redundancy and backup being handled by someone else.

Your solution is hosting on the order of 1000s of TBs of month (ignoring the database and other aspects of this website), but the price is an order of magnitude higher. You’ve also given up all of the automatic redundancy and hands-off management, and you don’t have the benefit of a high speed global CDN.

But more importantly, you have significantly higher engineering and on-call overhead, which you’re valuing at $0.

If anything, that only makes Polyhaven’s solution sound more impressive.

> Of course, for this price, you don't have redundancy and horizontal scaling.

Which is a huge caveat. The global CDN makes a big difference in how the site loads in global locations. Maybe not a big concern if you’re serving of static files with a lot of buffering, but they have a dynamic website and a global audience and they said fast load times are important.

> You also don't have to maintain and debug a system with redundancy and horizontal scaling.

But you have to do literally everything else manually and maintain it yourself, which is far from free.

All of these alternative proposals that value engineering time at $0/hour and assume your engineers are happy to be on-call 24/7 to maintain these servers are missing the point. You pay for turnkey solutions so you don’t have to deal with as much. Engineers don’t actually love to respond to on call events. If you can offload many of your problems to a cloud provider and free up your engineers for a nominal cost, do it.

BiteCode_dev · on Jan 6, 2022

A python that knows python and linux is enough to support the up time needs and evolutions of this particular product. Actually a part time one.

The entire team is composed of half of one dev.

I'm 100% sure it's way cheaper than anybody that has AWS on resume.

Of course, some things have to give, like the global CND, and some data guarantees.

Everything is a compromise. It's all depend of what is important for your project.

EDIT: also, my comment was not meant to oppose the article, but rather confirm the view that you should calibrate your setup to your project. Doing so will lead to great savings in hosting, and project complexity. A lot of projects don't need the cloud.

PragmaticPulp · on Jan 6, 2022

> The entire team is composed of half of one dev.

Who is on-call 24/7/365, never takes vacation ever, and is always available to fix the website?

It’s weird how much HN hates jobs with on-call requirements, but every time cloud services come up the solutions always involve forcing someone to be permanently on call to save a few hundred dollars per month in hosting costs.

BiteCode_dev · on Jan 6, 2022

Nobody is on call 24H.

If the system has a problem during the night, the users will wait until the morning.

The world doesn't stop because a streaming service is down for a few hours. It's not a medical service, or something thousands of businesses rely on.

It just looses a bit of money and users are grumpy for a day because they had to wait until they could access new content.

It's ok.

PragmaticPulp · on Jan 6, 2022

> Nobody is on call 24H.

This one dev is literally on call 365 days a year and can never be away from a computer on vacation. If he leaves, the project has no one.

How is that not a problem? Surely you can see that this isn’t reasonable for anyone who wants to run a business, or any employee who doesn’t want the website to be their life.

> If the system has a problem during the night, the users will wait until the morning.

> It just looses a bit of money and users are grumpy for a day

If you’re running a website where extended outages are no big deal and you don’t care about lost revenue, then it’s not really a valid comparison to the typical business website.

Your situation is unique, not a model by which other companies should follow.

judofyr · on Jan 6, 2022

> If you’re running a website where extended outages are no big deal and you don’t care about lost revenue, then it’s not really a valid comparison to the typical business website.

I think you're severely underestimating how many businesses make a significant amount of money from their website, but doesn't actually have full-time developer available. An extended outage would cause significant revenue loss, but it's typically not a problem because outages are surprisingly rare when you (1) have a very stable traffic pattern and (2) you don't spend a lot of time adding features and refactoring. Pretty much every cloud outage we've seen was caused by a human configuration error, not fault machines.

PragmaticPulp · on Jan 6, 2022

> I think you're severely underestimating how many businesses make a significant amount of money from their website, but doesn't actually have full-time developer available.

No, I’m well aware. But there’s a simple solution to this problem: Don’t try to run and maintain your own servers. Pay a little extra to use cloud hosting and let it be someone else’s problem.

I take issue with these calls to setup and maintain your own custom solutions and servers, while also suggesting that the cost of engineering and maintaining such a custom setup should be ignored.

Running your own servers and not having developers is a recipe for an endless stream of contracting invoices that are going to cost far, far more than just using a hosted cloud solution.

wongarsu · on Jan 6, 2022

I'd be on board with "pay a little extra to rent dedicated servers", but "move to one of the big-three cloud providers" doesn't sound like a sound financial decision for the case presented.

BiteCode_dev · on Jan 6, 2022

> pay a little extra

The whole idea is that it's not a little extra, but 2 orders of magnitude.

CRConrad · on Jan 9, 2022

> No, I’m well aware.

No, apparently not.

> But there’s a simple solution to this problem: Don’t try to run and maintain your own servers. Pay a little extra to use cloud hosting and let it be someone else’s problem.

But that's only if it IS a "problem" in the first place. You have defined it as such, although Bitecode themselves said that for them, it simply isn't. (To paraphrase: "If the site is down, then it's down; so what? We'll fix it when we're in the office again.")

Just plain ignoring whether something is "a problem" or not is hardly being "well aware".

keypusher · on Jan 7, 2022

> An extended outage would cause significant revenue loss, but it’s typically not a problem

This just seems like a bad decision from a business perspective. You are willing to endure a significant outage that will cost a lot of money but not pay to prevent it? Machines can and will fail.

BiteCode_dev · on Jan 6, 2022

You keep moving the goal post.

It seems it's criminal to run a service on the cheap, there must be a terrible human being behing it.

Well no, the dev is not attached 365 days on its computer.

A freelancer is hired part time for the duration of the vaccations. It cost a full dev salary for one month, taking in consideration training time, that's all.

> If you’re running a website where extended outages are no big deal and you don’t care about lost revenue, then it’s not really a valid comparison to the typical business website.

Most services can actually go down once a month, and be a viable business. You are not google or facebook.

In fact, most human service goes down for days: bakeries, lawyers, teachers, plumbers.

The fact your think internet services should be up all time is only in your head. Humans adapt perfectly.

It's not that a big deal. Most of our softwares are not as important as we want to think.

If you really want a 99.99999% up time, you going to increase your service quality by 10%, and your service cost by 1000000%.

The funny thing is, the downtime of ourservice has not being more than github's downtime in the last few years. So honestly the freelacer is mostly hired to have drinks on the house. Because monolythes are very reliable in the first place.

> Your situation is unique, not a model by which other companies should follow.

Every situation is unique, I never, ever stated it was " a model by which other companies should follow". You did.

There is no such thing as "a model by which other companies should follow". You must adapt to the situation and goals. Engineering is about compromises.

My post is simply stating the reality that you can get very far with good old tech.

And a lot of projects don't need the cloud, or high availability. Yet they pay premium for it.

PragmaticPulp · on Jan 6, 2022

> A freelancer is hired part time for the duration of the vaccations. It cost a full dev salary for one month, taking in consideration training time, that's all.

You pay a full dev salary for one month every time someone wants to take a break?

It’s baffling that anyone can read an article about someone spending $400/month on cloud services and then start proposing things like this as an alternative.

Engineering labor is expensive. Cloud is surprisingly cheap once you factor in engineering costs.

BiteCode_dev · on Jan 6, 2022

One month part time once a year as an additional cost is way, way cheaper than anything else.

> Cloud is surprisingly cheap once you factor in engineering costs.

No, it's not, since you need somebody qualified to operate it. And such qualificatied employees is very expensive. And you will need them on call anyway, since it will break, just in different way than a bunch of private servers.

I'd argue the cloud would be more expensive, even if hosting were not, because you need a more expensive team to run it.

wongarsu · on Jan 6, 2022

He never said they don't care about lost revenue. I am willing to guess they considered the lost revenue, weighed it against the cost of having someone on-call 24/7, and considered that the lost revenue was cheaper.

If you look at how frequently sites like Reddit used to have downtime, it doesn't seem to matter too much for consumer products. Having half a day downtime once a year might be completely acceptable.

gunapologist99 · on Jan 6, 2022

> This one dev is literally on call 365 days a year and can never be away from a computer on vacation. If he leaves, the project has no one.

How does the cloud solve this?

kijin · on Jan 6, 2022

You can host on AWS and still get hours-long downtime, as has happened recently.

Whether the cost of trying to add another 9 to your uptime is worth the marginal benefit is for each company to decide. Each 9 gets exponentially more expensive. A lot of companies who think otherwise actually can afford (and will, sooner or later, be forced) to be down once in a while.

jerf · on Jan 6, 2022

"forcing someone to be permanently on call"

The company I work for has a lot of stuff in the cloud. We seem to have quite a few position's worth of people permanently on call.

The percentage of "data center"-type on-call situations has perhaps gone down somewhat since moving into the cloud, but it has not gone to zero, and it was never the majority of problems anyhow.

It seems like you're sneaking the idea in that if they would just pay a lot more money, the person on call wouldn't have to be on call. I'd like to know what cloud you're using, because it doesn't seem to be any of the ones I know of. If your service is critical to your business or project, you've got someone on some sort of call (maybe not overnight call, which is the really rough bit), period, or you've got a business that can disappear at any second.

mjburgess · on Jan 6, 2022

I think it's bizarre you're giving cloud marketing material to a person, whose in a position to know their own situation, as-if you were describing their situation.

Like.. what are you expecting to happen? You'll just gaslight them into thinking they dont know how their business works?

PragmaticPulp · on Jan 6, 2022

I’m discussing the linked article, which is what this comment section is for and it’s what the parent commenter was comparing to.

pcj-github · on Jan 6, 2022

Whether or not a site is hosted in the cloud, it will break from time to time. S** invariably happens, no matter what. So, even if you host in the cloud, you're going to have problems, but they will be different problems. A developer to backup and support the site will be required, one way or the other. Case in point: polyhaven.com (the subject of this article) is not reachable as I write this.

mjburgess · on Jan 6, 2022

One has to presume that they have priced this and found the non-cloud version over-all cheaper.

Labour isnt expensive if you're operating towards a minimum needed to function, and your systems are sufficiently operationally stable.

PragmaticPulp · on Jan 6, 2022

> Labour isnt expensive…

Engineering labor is definitely expensive.

The entire $400/month bill for this linked website will only get you 2-3 hours of consulting time. They’re getting an enormous value by just offloading the work to someone else and not having to worry about it.

BiteCode_dev · on Jan 6, 2022

That is based on the id a cloud base service would not need a team to make it work.

I'd argue it actually need a more expensive team.

fragmede · on Jan 6, 2022

You have to do an apple-to-apples comparison though. If you're comparing a single colo'd machine vs a 3000-EC2 instance fleet with a load balancer, api nodes, database nodes (and requisite db admin team), and Kafka and DynamoDBs somewhere in there, then the cloud is going to be more expensive to manage.

Barring in-depth research (which I'd love to read if someone has any links), it's not clear on a 1:1 basis what's cheaper. Paying for someone's time to research hardware and talk to vendors, run POs for them, figure out where/how to install them (Equinix is expensive), and RMA hard drives as that comes up; versus not paying for that and instead paying a cloud vendor for that privilege. Throw on top a changing hiring landscape (how much 'sysadmins' cost vs 'devops') and it really depends on the size of this hypothetical fleet that we're trying to manage, and how complicated the backend of the site is. If there's no real backend to speak of, Cloudflare's CDN for static assets is going to be way cheaper, and available now, vs anything you could possibly build from scratch, that would maybe be ready in a couple months.

mjburgess · on Jan 6, 2022

if only there were an `if`

chalcolithic · on Jan 7, 2022

Does it work for video as well?

nix23 · on Jan 6, 2022

Thanks! Exactly that! I always tell that customers and their response is:

-But google/facebook/amazon...

-But uptime needs to be 99.999

-But everyone uses cloud

Most businesses are not a trading-market, have less then 100 peoples (aka you are probably not another amazon), and no bonus using a cloud/kubernetes etc.

But it's the same old story, in the 00's i used the ~same arguments against buying OracleDB ;)

Cthulhu_ · on Jan 6, 2022

And you can tell them all of those are possible, but they need to have a massive budget, not just for the monthly bills but also to hire experts to set things up with those constraints.

I worked at a company once that, from higher up, said that they had to have five nines of uptime. We had some really good cloud engineers there (one guy set up a server / internet container for the military in Afghanistan; in hindsight he said they should've just sent a container of porn dvd's), and they really went to town. For that five nines uptime, you're already pretty much required to set up your infrastructure to use multiple availability zones, everything redundant multiple times, etc.

Of course, the actual software we wrote was just a bunch of CRUD services written in nodejs (later scala because IDK), on top of a pile of shit java that abstracted away decades of legacy mainframes.

dna_polymerase · on Jan 6, 2022

Either your compute demand is highly elastic, or your revenue and profit scale with usage, otherwise the Cloud is probably not for you. At least not in the longterm or for running websites.

XCSme · on Jan 9, 2022

> uptime needs to be 99.999

Isn't AWS down like every two months for a few hours? That's far off the 99.999% mark. No one can guarantee 100% uptime and sometimes it's even better to have that under your control (eg. have a dedicated server and a backup one from different providers).

My point is that, if you want the highest possible uptime, you shouldn't rely on a single (cloud) provider.

mro_name · on Jan 6, 2022

any single one of those arguments is cargo-cult actually, isn't it?

All means, no end.

nix23 · on Jan 6, 2022

Well most of them are Non IT businesses, so they just know what peers/family etc told them what todo (or what they do), then read some (oneside-)sponsored media and now they ~know whats best, it's Azure with WindowsServerDatacenter-edition, SQLServer-enterprise and McAffee antivirus ;) but who can be mad about it? It's just not their field of expertise or even interest.

mro_name · on Jan 6, 2022

fair enough, however they juggle the buzzwords.

If they didn't but cared about their business (they know I hope) and make hard requirements to IT, that would help. IT should then come up with solutions to those real-world problems. We're not talking just about hobbyists, do we?

That's what Eric Evans talks about in DDD as I got it.

nix23 · on Jan 6, 2022

> fair enough, however they juggle the buzzwords.

Oh absolutely true, you don't want to look completely clueless in front of people who take your money to setup an infrastructure ;)

>and make hard requirements to IT, that would help

That would make stuff so much more easy. Often it even lacks even a inventory of used applications/hardware....network.

Just one example:

Made a plan for new hardware and network (new cabling etc), then i walked around the workshop and there was that dusty machine running...i asked what it is...a dos-machine with...wait...TOKENRING. That machine was an integral part of the whole workshop ;) However we made a virtual FreeDOS machine and buy'd a software converter for the machine protocol, the 25yo cnc-machine needed a new card (ethernet instead of token-ring, very lucky we found that thing) so there was that one little DOS-machine no one though about who could stop the whole "modernization".

mro_name · on Jan 6, 2022

> you don't want to look completely clueless

That's why I give them money, so they bare with my cluelessness on their field. Would do it myself otherwise.

Or the way round: "I hire smart people for a lot of money, why would I tell them what to do" (Steve Jobs)

> that one little DOS-machine

The donkey does the work and the horse gets the fame.

CRConrad · on Jan 9, 2022

> But it's the same old story, in the 00's i used the ~same arguments against buying OracleDB ;)

But no matter how logically convincing your arguments were, most of the time upper manglement just went on buying Oracle, right...?

midasuni · on Jan 6, 2022

Get two servers, stick them in different cities. $4k a month and you have resilience. $6k a month and you have resilience and double your throughout.

Some stuff makes sense to put on iaas, dns often does for example.

nix23 · on Jan 6, 2022

4k/mnt for 2 Servers? That's a little bit to much, 200euro for two servers at hetzner is perfectly fine ;)

midasuni · on Jan 6, 2022

I say server, presumably it’s the entire solution of however many servers, storage, network etc.

If it’s 2k/month for a non resilient solution, it’s in the order of 4k for a resilient solution (you need to relocate assets both ways but it’s in that order of magnitude)

kijin · on Jan 6, 2022

You can have redundancy and horizontal scaling with private servers and still end up paying less than what you would on AWS and the like.

I have some clients who use AWS and others who prefer colo and/or dedicated servers from traditional datacenters. The latter group can afford to over-provision everything by 3-4x, even across different DC's if necessary. DC's aren't yesterday's dinosaurs anymore. The large ones have a bunch of hardware on standby that you can order at 3 a.m. and start running deployment scripts in minutes.

YetAnotherNick · on Jan 6, 2022

Not including extra human cost in the analysis is just disingenuous. I think to manage private servers of that size you would need at least two extra experts totalling at least $20k/month.

rndgermandude · on Jan 6, 2022

> I think to manage private servers of that size you would need at least two extra experts totalling at least $20k/month.

What? You set up the deployment once, and then you only need to touch it when things go horribly wrong, which is every couple of months, or to make minor quick tweaks and run some updates. Let's be generous, and say you need 10 h/month, which is about 1/16 of a person-month. And if things go horribly wrong, everybody drops what they are doing to fix things, anyway, no matter if you're on AWS, dedicated/colo or run your own data center.

When you significantly change your architecture/deployment, then you need to put in more time again, but if you build your code with need to scale and such things in mind from the get-go, then that won't come up much or at all.

PragmaticPulp · on Jan 6, 2022

> What? You set up the deployment once, and then you only need to touch it when things go horribly wrong, which is every couple of months

Right, which is exactly why people pay extra for cloud managed services.

If things are going “horribly wrong” every couple of months then you must necessarily be on call 24/7 and never take vacation or time away. In practice, you need at least two people to manage on-call coverage so you’re not completely uncovered if someone gets sick, decides to take vacation, wants to travel away from a computer and so on.

rndgermandude · on Jan 6, 2022

Things go horribly wrong with AWS hosted stuff as well. And a lot of companies have a single-point-of-failure AWS person. While you're not wrong in general, nothing you just wrote is specific to running on dedicated servers vs AWS.

southerntofu · on Jan 6, 2022

We're talking volunteer-run projects, though. Who cares if it's not available 24/7? Best-effort is good enough. Those managed cloud services also fail often, you just have no information and no recourse about it.

lupire · on Jan 6, 2022

Things go horribly wrong every year or so. The site goes down. It's fine. This isn't Facebook.

capableweb · on Jan 6, 2022

What?! Maybe if you hire SV-skilled engineers on location in Sillicon Valley, but you can easily serve 2GB/second (on infra worth $2K/month) with one sys-admin dealing with it, and for way less than a whopping $10k/month.

PragmaticPulp · on Jan 6, 2022

> with one sys-admin dealing with it,

And if that one sys-admin wants to go on vacation? Or travel away from a computer? Or takes another job?

You can never have “just one” admin handling a server and being on-call 24/7.

Would you really want a job where you could never, ever be away from a computer because you’re the only on-call person? This doesn’t work.

new_guy · on Jan 6, 2022

> Would you really want a job where you could never, ever be away from a computer because you’re the only on-call person? This doesn’t work.

Works fine for me. $20K a month for two people doing f*k all is insane.

Melkman · on Jan 6, 2022

Why would it be necessary to have an engineer on call 24/7 ? If you do your risk calculations and an outage of 12 hours is acceptable in the expected frequency you just let the engineer have a nice evening and night an deal with the outage in the morning. If outages are only to be expected once a year an you can tolerate 48 hours of outage you don't need any on call engineer. Most outages are cause by changes. You can test those and plan putting these in production and have elevated monitoring after this to catch problems early. Only problem remaining is hardware outages. And those are very rare as long as you do decent lifecycle maintenance. As others said before. Not everyone is Google or Amazon and needs 99,999% uptime.

svelle · on Jan 6, 2022

But that AWS deployment doesn't manifest out of thin air either. Kubernetes and AWS knowledgable engineers aren't any less expensive.

bezospen15 · on Jan 6, 2022

Are you hiring? What infrastructure engineer are you paying $10,000/month for just to manage 4 servers? LOL

kasey_junk · on Jan 6, 2022

They didn’t mention a fleet size but 2GB/s is a single commodity server.

You don’t even need a single employee to manage a single server…

YetAnotherNick · on Jan 6, 2022

Which "commodity server" could serve 2GB/s(16gbps) to public internet?

jleahy · on Jan 6, 2022

Pretty much anything. How about a Dell R340 with a dual-10G NIC and some SSDs? That's not commodity, that's cheap, a commodity server would be a dual-Xeon but that's overkill for serving 16gbps.

ksec · on Jan 6, 2022

I think that is the problem with modern day's Web Dev. ( Sorry )

With Cloud and SaaS. They are so abstracted from Hardware that their knowledge on basic hardware and server, everything from CPU, Storage I/O and Network are close to zero.

virtuallynathan · on Jan 6, 2022

At Netflix we’re doing close to 400Gbps on 1U commodity hardware, and pretty inexpensive.

ksec · on Jan 6, 2022

And that is 400Gbps encrypted!

Side Note, aren't they 2U?

virtuallynathan · on Jan 7, 2022

Yep! All TLS.

The production boxes are 2U, but it can be done in a 1U box.

ralgozino · on Jan 6, 2022

How many DevOps do you need to manage your cloud? How much do they cost?

Dma54rhs · on Jan 6, 2022

You need high skilled people who are comfortable in AWS or other cloud offering as well. Have to care of tons of things, set them up etc. They are not one button setups in real life when things get complicated.

ziggus · on Jan 6, 2022

What do you imagine these experts would do? Rebuild the servers from raw materials each month, and hand-code all software in ASM weekly?

In the real world, once most hosting platforms are up and running, the maintenance overhead is pretty low.

BiteCode_dev · on Jan 6, 2022

No, the system is maintained by one single part time dev. That's the entire dev team.

cblconfederate · on Jan 6, 2022

> extra human cost

Where? Costs vary hugely across the world

willcipriano · on Jan 6, 2022

If you are worried about getting your blog post to the top of Reddit or Hacker News (I've never been there myself); you can have a very modest web server or even a pay per request serverless sort of thing and pay $20 real quick to Cloudflare if you happen to get popular. It's the Bart Simpson method of highly scalability[0], for static content you can have global datacenter coverage in a couple minutes or so if you use them for DNS to start with. It even works if the origin server goes down.

[0]https://www.youtube.com/watch?v=aKU3hMvD31w

PragmaticPulp · on Jan 6, 2022

> you can have a very modest web server or even a pay per request serverless sort of thing and pay $20 real quick to Cloudflare if you happen to get popular.

I get the impression that a lot of the critics in this thread don't really understand Cloudflare, how cheap it is, or even the concept of CDNs in general.

$20/month for Cloudflare Pro is a steal for what you get. Spinning up a dedicated server in a single datacenter somewhere isn't going to give the same results, especially if your users are geographically distributed like in this case.

kortilla · on Jan 6, 2022

> I get the impression that a lot of the critics in this thread don't really understand Cloudflare, how cheap it is, or even the concept of CDNs in general.

You’re talking past the point here. It doesn’t matter how cheap if you’re fundamentally opposed to enabling cloud flare to reach its meat hooks further into the Internet.

This is no different from arguments about embedding google analytics or “just paying for windows” instead of using Linux.

manigandham · on Jan 6, 2022

CDNs are a commodity business with tons of competition. Cloudflare is winning by offering a better product.

You should take issue with all the other companies that have failed to deliver something as compelling.

capableweb · on Jan 6, 2022

I don't think the problem is a specific CDN. Is that everyone ends up using the same CDN, so when Cloudflare has problems, it affects everyone. Same with AWS, large swaths of the internet goes down if AWS does, which sounds great for AWS in marketing material, but less great for the general usability of the web.

Karrot_Kream · on Jan 6, 2022

> You’re talking past the point here

What point? Nobody said anything about "cloud flare to reach its meat hooks" in the article or the above thread except you?

fivea · on Jan 6, 2022

> What point? Nobody said anything about "cloud flare to reach its meat hooks" in the article or the above thread except you?

The OP mention "cloud flare to reach its meat hooks" in the thread attacking those who haven't jumped into Cloud flare's bandwagon by putting up a strawman on how that's only due to ignorance.

OP clarified that misrepresentation by pointing out the risk of allowing a single company to control the CDN market specifically and serving web content in general.

bryanrasmussen · on Jan 6, 2022

I figure helping Cloudflare get it's meathooks in the internet offsets all the really big companies that have their meathooks in, or at any rate doesn't worsen the real problem.

fivea · on Jan 6, 2022

> (...) helping Cloudflare get it's meathooks in the internet offsets all the really big companies (...)

What? No. Cloudflare reported a revenue of half a billion dollars, and already controls about half the CDN market.

Let's put things in perspective: in comparison with Cloudflare's business, AWS is a minor player and an underdog with less than half of Cloudflare's market share.

Cloudflare is by no means a small company or an upstart or a David among Goliaths. Cloudflare is in fact and by far the Goliath of the CDN world.

rambambram · on Jan 6, 2022

Just to understand you better: are you only talking about CDN activities from AWS here? Because I see websites talking about a quarterly revenue of tens of billions of dollars for AWS.

bawolff · on Jan 6, 2022

> Spinning up a dedicated server in a single datacenter somewhere isn't going to give the same results, especially if your users are geographically distributed like in this case.

Maybe not, but is the target audience that shills out $20/month really the type of people who have optimized their site to such an extent that shaving 50ms off the request latency by having your edge cache geolocated is really the type of thing that makes the difference? most of that group could probably do a lot of other optimizations that probably count for more.

PragmaticPulp · on Jan 6, 2022

> Maybe not, but is the target audience that shills out $20/month really the type of people who have optimized their site to such an extent that shaving 50ms off the request latency by having your edge cache geolocated is really the type of thing that makes the difference?

The common mistake is to pick a server geographically close to yourself, only access it from low-latency connections, and then assume that everyone in the world is seeing the same thing.

Or to only visit your own site with everything already in the browser cache. If you're not seeing cold start loads, you're not seeing what every new visitor to your website is seeing.

Consider the Photopea.com website. The author explained in a comment below that he spends $60/month to host the site without a CDN. Several of us loaded the site and it took 2.5 - 5.0 seconds to load. He could sign up for a cheap Cloudflare account, reduce the size of his server (due to caching), and the load times for everyone would drop by a significant amount.

If you're hosting simple, static content like a blog for an audience that doesn't care about load times, then of course nothing matters. But for modern, content-rich websites (photos especially) it can actually be a substantial improvement to add a CDN even if you have a single fast server. You may not see it, but visitors from distant locations definitely will see a difference.

southerntofu · on Jan 6, 2022

With some browser security policy that blocks part of the download, the homepage www.photopea.com clocks in at 3.80MB (so it should be much higher in practice). In this case, it's mostly JS, so designing your website properly (without JS, especially if the app itself is wasm not JS) would have much better savings than moving to CloudFlare CDN.

A CDN is more times than not the wrong answer to a real problem. Shave off your website and consider content-addressed protocols for big static asset download (like the textures from the article). If you run your website as a lightweight glorified Bittorrent index you'll notice your costs are suddenly a lot less, and you can still have a smaller "Download over the web" button as fallback.

bawolff · on Jan 6, 2022

> Consider the Photopea.com website. The author explained in a comment below that he spends $60/month to host the site without a CDN. Several of us loaded the site and it took 2.5 - 5.0 seconds to load

This is a conclusion i am extremely doubtful of.

Ping time new york <-> tokoyo is about 180ms. So lets say as a worse case the ping time to the single server is 180ms (its probably not that bad), and lets say the latency to cloudflare edge server is 20ms.

So using cloudflare on a cache hit (best case), you save something like 160ms per roundtrip.

Which don't get me wrong is a huge savings and worth it (although this scenario is hugely exagerated).

However say you want to load the page in under 1 second instead of 5 seconds. In this scenario you would basically have to have 25 round trips to bring the site from 5 seconds to 1 second just on rtt savings of having a geo located edge server. If your site needs 25 round trips to load, something else is clearly wrong. (And this is an exagerated case, the real world the benefit would probably be much less)

To be clear i'm not saying that geo located edge caches are bad or useless. They are clearly very beneficial thing. Its just not the be all and end all of web performance, and most people in the demographic we are talking about probably have much more important things to optimize (otoh using cloudflare is cheap and doesnt require a lot of skill, so it is a very low hanging fruit)

bostik · on Jan 6, 2022

> So using cloudflare on a cache hit (best case), you save something like 160ms per roundtrip.

Per packet. If you're doing a cold start, you'll pay that latency cost several times over: first the TCP handshake (3 roundtrips), and then the TLS handshake (2 more roundtrips). That's 800ms of extra latency before you even get to sending the first HTTPx request.

Cold start latency matters a lot.

PragmaticPulp · on Jan 6, 2022

> In this scenario you would basically have to have 25 round trips to bring the site from 5 seconds to 1 second

You’re forgetting that the TCP protocol itself is bidirectional. High latency connections will have lower throughout, especially at the beginning of transmission, because the data isn’t literally just streaming in one direction.

Karrot_Kream · on Jan 6, 2022

Anything over 100ms [1] is perceived as not-instant by a user. If you wait 2RTTs with 50ms per round trip, then you've already exceeded this threshold.

[1]: https://stackoverflow.com/questions/536300/what-is-the-short...

bawolff · on Jan 6, 2022

To clarify, i don't disagree.

I just think if your site takes 700ms, is there really a difference between that and 650ms?

lsaferite · on Jan 6, 2022

If that additional 50ms hits your user's bail-out threshold, yeah.

iso1631 · on Jan 6, 2022

1.13 seconds to load https://www.google.co.uk/ (even after preloading it and agreeing to their popup)

3.44 seconds to do a search for "donkey"

encryptluks2 · on Jan 6, 2022

Don't you get tht same for free with GitLab Paged and last I checked Cloudflare was free too for caching static HTML assets?

CRConrad · on Jan 9, 2022

> last I checked Cloudflare was free too for caching static HTML assets?

If not free then very cheap, as I understood TFA: Wasn't that why they have two separate domains and serve static assets from one of them, to be able to use the cheapest Cloudflare tier for that domain = those assets?

superkuh · on Jan 6, 2022

[flagged]

PragmaticPulp · on Jan 6, 2022

> If you're going to centralize in Cloudflare you might as well just skip the hosting the website bit entirely and make a business account on Facebook as your host.

These responses are getting bizarre. Facebook pages have nothing to do with web hosting or Cloudflare.

Also, hosting on a single server in a single datacenter is, literally, the definition of centralized. Cloudflare distributes the content to a huge number of edge nodes which are spread around the world. How did we end up in this situation where people are calling the distributed solution centralized and suggesting a centralized solution as the alternative?

kortilla · on Jan 6, 2022

Because cloud flare administratively centralizes a solution that was previously distributed.

Guess what percentage of websites Cloudflare could take down in minutes if they decided?

It doesn’t matter if cloudflare itself uses a distributed architecture. It’s a massive central point of failure/malice.

cortesoft · on Jan 6, 2022

If cloudflare did that, then you do a simple DNS change and host your content somewhere else.

You are ALWAYS going to be contracting with a third party to provide your connection to the internet... why is trusting cloudflare not to block you riskier than trusting your ISP or the data center that has your server?

kortilla · on Jan 15, 2022

> If cloudflare did that, then you do a simple DNS change and host your content somewhere else.

You missed the point. It’s an illustration of how much of the global internet cloudflare intermediates and can eavesdrop, filter. Guess how many tor users you fucked by putting cloudflare between you and them. Vpn users, etc.

aniforprez · on Jan 6, 2022

Then... use something else? Cloudflare doesn't do anything to lock you in

nix23 · on Jan 6, 2022

>Guess what percentage of websites Cloudflare could take down in minutes if they decided?

Yes and? People change their DNS entry and will never come back.

ibic · on Jan 6, 2022

Yup, I’m still wrapping my head reading these comments, “bizarre” is the word I was looking for.

superkuh · on Jan 6, 2022

Conflating cloudflare's distributed architecture with distributed control is just silly. It is extremely centralized control and the CEO of Cloudflare has already terminated accounts of a business he had a personal distaste for on whim.

"Non-commercial" might be a better way to understand this point of view. Instead of prioritizing profit (the reason for people using cloudflare, it's cheap and good) the idea is to minimize the damage done by large centralizing forces on the internet. So, in the above comment I suggest Facebook as an equal option because it is analogous to using Cloudflare. The intent was to get you to think like a human person and not a business owner or employee on the clock. It's short term gain for long term damage to the internet.

int_19h · on Jan 6, 2022

But then again, if Cloudflare terminates your account, the website is still up; it's just going to be slower, and you're going to pay more to serve the same number of users. There's no lock-in there that I can see.

nemothekid · on Jan 6, 2022

>It is extremely centralized control and the CEO of Cloudflare has already terminated accounts of a business he had a personal distaste for on whim.

As opposed to the CEO of Amazon/Rackspace/your favorite host here who doesn't have the ability to terminate your account? What are you saying? Or are there other non-profit web hosts and CDNs that I missed?

If you have a personal axe to grind against the CEO of Cloudflare, just say that.

Dracophoenix · on Jan 6, 2022

Superkuh's point is that depending on any single service to protect/host/route your content is setting up oneself up to be Parler'd or 8chan'd. It doesn't matter how good the technology. If you don't any have any control over it, you're one copyright strike or bad mood from a CEO away from being deplatformed.

There's.no need to grind an axe to observe how past actions have set the course for the future, perhaps for the worse.

nemothekid · on Jan 6, 2022

>If you don't any have any control over it, you're one copyright strike or bad mood from a CEO away from being deplatformed.

Again, _as opposed to what_? Are you saying polyhaven should go multi-cloud and spend triple what they need? You aren't actually presenting any real solutions, you are just complaining about the cloudflare ceo.

I'm a guy who wants to host a service. You are telling me Cloudflare bad. What is the alternative, and how do I ensure the CEO of that service doesn't null route me?

Dracophoenix · on Jan 6, 2022

>Again, _as opposed to what_? Are you saying polyhaven should go multi-cloud and spend triple what they need? You aren't actually presenting any real solutions, you are just complaining about the cloudflare ceo.

I haven't complained or suggested a damn thing in my previous comment. All I've provided is an extended summary of Superkuh's comments and supported those claims with evidence of past events. Exercising due diligence shouldn't be regarded as a controversial position.

>I'm a guy who wants to host a service. You are telling me Cloudflare bad.

I'm telling you that depending on a single service, whether that service is Cloudflare, Youtube, AWS, etc., is a bad idea. If you don't have a credible alternative provider you can migrate to at a moment's notice, you're website and content is at risk.

>What is the alternative, and how do I ensure the CEO of that service doesn't null route me?

Alternatives:

https://www.esecurityplanet.com/products/distributed-denial-...

https://www.techradar.com/news/best-ddos-protection

Not mentioned is DDos-Guard, which has a pretty good offering if you don't mind that it's in Russia (perhaps that even might be a bonus)

You can't ensure the CEO of a company doesn't null route you. That's why it's important to have alternatives and plan migration ahead of time.

nemothekid · on Jan 6, 2022

>You can't ensure the CEO of a company doesn't null route you.

So the alternatives aren't better than Cloudflare, Superkuh just had an axe to grind specifically with Cloudflare. And there is no an alternative solution that wrests control from a CEO having a bad day.

At the end of the day, he's still at the whims of the Cloudflare/Bunny/Akamai and if he wants to be fully in control he must spend millions building his own CDN.

It's not as if Cloudflare has major switching costs either.

superkuh · on Jan 7, 2022

Any alternative is better than everyone using Cloudflare. This would be true even if Cloudflare hadn't already demonstrated their untrustworthiness. It's true for LetsEncrypt even if LE is awesome and really improved the internet and there are other options. If people only use one thing in practice it is a locus of control.

Karrot_Kream · on Jan 7, 2022

"Why are you using this thing that solves your problems and does it cheaper than you making your own solution to your problems? You're leading to centralization of the internet!" Good luck changing human nature. Writing these comments here is helping, I'm sure of it. /s

Karrot_Kream · on Jan 6, 2022

It's a caching layer. If you're deplatformed from your caching service, you still have the backing service.

immibis · on Jan 6, 2022

To be fair, Parler and 8chan did deserve to get Parler'd and 8chan'd respectively. To also be fair, even if you are not Parler or 8chan, it is a valid concern.

Karrot_Kream · on Jan 6, 2022

Dealing with fraud and abuse has _long_ been a centralizing force on the internet. Think about email which is the way it is largely because of spam. We need to structurally stop spam not just shame people from embracing solutions that make their life easier.

systemvoltage · on Jan 6, 2022

It's interesting - I see Cloudflare a rising force against network attacks more than its CDN properties. It will become the defacto centralized network. Not sure if I like that philosophically, but practically and as a engineer, most enterprises will choose to get their DDoS, WAF, Zero Trust products. Networks are the most vulnerable part of the internet infrastructure. Cyber warfare isn't just a talking point on a 60-minutes episode, it is a real threat to large businesses and they'll opt for centralized control over decentralized risk. They'll keep Cloudflare CEO in check, if not the shareholders/BoD.

darkwater · on Jan 6, 2022

No. The definition of "centralized" that matters here is "everyone hosting their website in the same place, using the same vendor".

acidburnNSA · on Jan 6, 2022

I've had my personal $5/mo Digital Ocean VPS Wordpress site hit the top of HN before. I kept an eye on htop, but it handled it just fine. Exciting times.

wyager · on Jan 6, 2022

As long as you have a decent backend it's no problem. If you're using some python/ruby/JS thing you're probably going to need some kind of reverse proxy to keep up with top of HN. If you're using a Haskell/Rust/C++/etc. compiled backend you're probably fine.

anaganisk · on Jan 6, 2022

$20 a month for a Static site? that feels like a first world problem. I feel like everything is over engineered, unless you need subsecond delivery of your assets which are very huge, cdn doesn’t even makes sense. For a blog whats the point, your site is not going to receive heavy traffic every hour from global locations everyday. If you are concerned about returning users cache your assets on their machine. If your site is video heavy I can understand. I run a Django site for a zoo on a $10 instance, I have 2 others running on same instance. Never had issue with page speed even on 2G or under load my instance didn’t suffer. My storage on s3 is proxied via Nginx and cached on user device and in Nginx, I never even had a downtime due to traffic. I use fail2ban for basic protection. If it comes to DDOS im behind cloudflare free tier. $20 per month for a blog? Lol.

throw14082020 · on Jan 6, 2022

You missed the point, it's "pay $20 real quick to Cloudflare if you happen to get popular". There is a generous free tier, and for years I have paid nothing to Cloudflare, but I serve 2GB total every month to visitors, YMMV.

anaganisk · on Jan 6, 2022

No I didn’t miss the point, $20 is weeks worth of meal in majority of countries. People from those countries run successful blogs without paying a penny to cloudlfare. I serve much more than 2GB per month without all that on a dynamic site. Paying cloudlfare for that is like putting a tarp over a dumpster fire to quickly hide it. I would rather think why my static site fails like shit when it doesn’t even need server side processing and correct it. I’m not saying cloudflare/CDN is useless, it shines when you are serving huge assets every hour to lot of people globally, or want to secure from bot traffic, or other useful features it provides. Heck you could even host a static site on s3 or netlify for free and answer all the traffic in the world. Remember Google returned subsecond results much before cloudflare or global CDNs were in place.

ivanhoe · on Jan 6, 2022

One variable you miss is the price of your own time.

To pay USD $20 and unload a problem to some 3rd party service takes under 30min, while running a scalable, high performance web-server on low budget is hard and time consuming - even impossible for devs with no sufficient devops/admin skills, which is sadly a majority.

Back in the day, as a student with no money and a time to spare I used to do it all myself too, guerrilla style. Nowadays tinkering with my private servers would mean taking time from my real job, and that just doesn't make sense financially, my time is way more valuable and scarce now.

That's why we have the economy of specialists in the first place. One can do everything in DIY fashion, but in our civilization it's usually cheaper to hire a plumber's or carpenter's services than to invest in learning the skills, buying the tools and then doing it, if it's not your primary source of income. It's no different with CDN services.

anaganisk · on Jan 6, 2022

I never spent any other time other than the initial deployment. Like I said if your tap is leaking find a competent plumber or if you have the skill fix the leak, don’t pay for water tanker every time you are out of water. Im not sure why you need high performance, scalable multisharded and other big cloud tech to run a static site. Im not against using cloudlfare, there is a usecase for it. I cant stress more about why cloudflare is such an overkill for static sites. A cheap VPS can go a long way before you have to start worrying about not being able to serve traffic for a static site, its not a blocking call. We would be wasting unnecessary resources, if we don’t fix the actual problem.

innocenat · on Jan 6, 2022

Then let your site fail, it's your choice.

For the rest of us who would rather not let our site fail, a quick one time $20 tarp over dumpster to handle the traffic in the meantime is good.

anaganisk · on Jan 6, 2022

My site never failed thats the thing without paying $20. And many developers can actually do it too. If you can’t serve decent traffic for a static site, I would rather fix the leaking tap once and for all than wasting money on a water tanker every time I am out of water.

innocenat · on Jan 6, 2022

And I am not saying you have to pay $20 every time. It's there to handle the traffic while you identify and fix the problem.

It's your choice if you don't want redundancy in place if any incidents happened.

collinmanderson · on Jan 8, 2022

What does the $20 buy vs free plan? During a spike I was able to handle 30k pageviews per hour with Cloudflare free plan + $40 vps. Pretty much all pageviews were cached by Cloudflare and didn't hit my server. How does the $20 plan help?

elaveriao · on Jan 6, 2022

I invite you to do the calculations of serving 80tb of data serving from S3. Over 6k was my result but happy to be proved wrong.

anaganisk · on Jan 6, 2022

What are you serving on a static site/ blog worth 80TB? I also said CDN makes sense for huge files. Please don’t nitpick.

Aeolun · on Jan 6, 2022

4 kg of feathers is the same weight as 4 kg of lead.

Same thing applies to millions of 10kb files. Whether or not files are large is irrelevant to whether a CDN is a good idea.

If every file only ever gets requested once it may be pointless.

anaganisk · on Jan 6, 2022

Makes sense, thats what I mean too. There is a use case for CDNs which do no make sense to pay for if its a static site. Which can be done much cheaper and for free most of the time. Unless you are running a google scale static site.

Aeolun · on Jan 6, 2022

Interpreting ‘weeks’ as 2 weeks, the GDP necessary to call that weeks worth of meals is $520 dollars. There’s only like 6 countries with a GDP that low.

If you are only talking about food, it may stretch a bit further, but you’re still far away from the majority of countries.

the8472 · on Jan 6, 2022

You know what isn't served from a CDN? Hacker News. If HN can handle the traffic then so should be some static blog site.

yawnxyz · on Jan 6, 2022

As a designer, it's really easy for me to put something on Cloudflare... doing what HN does would take at least more than a few hours (and the knowledge) to set up properly.

killingtime74 · on Jan 6, 2022

Hacker news is text only though

wyager · on Jan 6, 2022

Most blogs should be as well.

immibis · on Jan 6, 2022

Mostly text with the occasional picture that's part of the content and not just a weird space-filling header

manigandham · on Jan 6, 2022

Cloudflare's free tier is more than enough for basic CDN serving and DDOS protection. No need for Pro.

ognarb · on Jan 6, 2022

I had a blog post on the frontpage of hn for more tht 20 hours and my cheap Vps for 3€ a month could handle it perfectly since my website is just a statically generated website with hugo.

goodpoint · on Jan 6, 2022

> pay $20 real quick to Cloudflare

Once cloudflare captures the web market we'll all pay back with interest. They are not a charity.

AussieWog93 · on Jan 6, 2022

Is there a good service to do this automatically (my site uses DigitalOcean and CloudFlare free), or even test how much load you can handle?

I tend not to realise when my site goes viral, as I'm based in Australia whereas my largest audience is in b the US (and I'm a bit of a Luddite!)

bytebln · on Jan 6, 2022

https://loader.io is a nice tool for that

AussieWog93 · on Jan 6, 2022

Legend, thanks!

AussieWog93 · on Jan 6, 2022

Wow, really glad I ran this tool.

Looks like with some dead basic optimisations (free versions of WP Fastest Cache and Autoptimise), my Wordpress site can handle around 1500 requests per second on a $5 DigitalOcean VPS before it starts to slow down.

On the old site, running on a shared host with less optimisations it would crap out at less than 10!

Seems like I don't need to worry about this after all.

bvm · on Jan 6, 2022

> This website blog.polyhaven.com/how-we-handle-80tb-and-5m-page-views-a-month-for-under-400/ is currently offline. Cloudflare's Always Online™ shows a snapshot of this web page from the Internet Archive's Wayback Machine. To check for the live version, click Refresh.

hm

iso1631 · on Jan 6, 2022

The blog's down, but polyhaven.com isn't. Presumably the blog isn't as important as the core service and thus isn't hosted on the same infrastructure

leke · on Jan 6, 2022

Yep, I got that too. Absolutely beautiful.

alphachloride · on Jan 6, 2022

I tried looking, and I find it very irritating that in that whole web page, there is not one single link in the menu/footer/sidebar to polyhaven.com home page so I can click and discover what it actually is. Not one.

This occurs on many company blogs as well operating under a subdomain like blog.whatever.com

To be clear, this is a very tangential and irrelevant nitpick and I understand it does not contribute to the content of the website itself.

helsinkiandrew · on Jan 6, 2022

Yes! - the same goes with support sites on different domains (support.whatever.com or whatever.zendesk.com etc) that appear in google results and don't have a link to the company.

CrimsonRain · on Jan 6, 2022

I think this is a result of app stores enforcing links to external digital products thing. That's why all these support pages which need to be linked from app, don't have a single link to main site.

cerved · on Jan 6, 2022

I had no problems navigating there. Every single link in the hamburger lead me to polyhaven.com

alphachloride · on Jan 8, 2022

To the polyhaven.com home page? I don't think it did. They are all sub pages which adds another navigation step to see the root page, which is what I find irritating.

chillfox · on Jan 6, 2022

Not having a link forces people to navigate to the page on their own. For a lot of people that takes the form of a Google search. Lots of people googling specifically for your brand increases your search rankings.

Or so goes the theory anyway.

mad182 · on Jan 6, 2022

I handle ~150TB and 26M page views for ~$500 by simply renting a few dedicated servers at hetzner. And if I didn't need quite a lot of processing power (more than average website), it would be much lower. I only need so many servers for the CPU power, not traffic.

razemio · on Jan 6, 2022

This! Hetzner root servers, there is nothing else I know on the market that comes close in price/power ratio when you use the "Serverbörse". I run multiple low traffic websites, databases and a heavy traffic elk instance + my complete homelab in proxmox / docker for 48€ a month and there is even room for more. Highly recommended!

SergeAx · on Jan 6, 2022

What is your strategy for dealing with server failure?

mad182 · on Jan 6, 2022

My app allows for "share nothing" architecture, basically using multiple DNS A records as load balancing. Currently it has 6 servers.

Even if 5 of them would go down at the same time, the site would still work as intended (thought probably couldn't handle peak load with less than 3 or 4). If one of or two are down, nothing happens.

Also completely reinstalling a server takes around an hour.

SergeAx · on Jan 6, 2022

Yes, but comment I was replying to mentioned a 48 euro budget, it is a price of a single server.

razemio · on Jan 7, 2022

I do not have an HA setup BUT all of my proxmox vms are snapshot backuped by proxmox backup server every night to my home NAS and to my office NAS. You can use one of many storage providers. SSHFS also works. This is the cheapest and lowest administration solution I used till today. For production usage I would recommend 3-4 similar speced 28€ machines and run a replicated proxmox cluster or ceph proxmox cluster.

heipei · on Jan 6, 2022

As long as you treat servers as cattle, you can use Hetzner's own load balancer service and then you don't have a SPOF that you manage yourself. Their LBs are advertised as redundant / fault tolerant.

SergeAx · on Jan 6, 2022

I don't know if it is possible to treat "Serverbörse" servers as cattle. They are all different. I know that k8s and Docker Swarm could in theory balance load between different machines, but never tried it in practice. But I had in practice some weird glitches with different CPU/motherboards/memory.

Also, comment I was replying to mentioned a 48 euro budget, it is a price of a single server.

hansel_der · on Jan 6, 2022

i think the whole point of vm/container is to enable "servers as cattle"

SergeAx · on Jan 6, 2022

Yes, but "Serverbörse" machines are very non-uniform, which may be bad for load balancing. See yourself: https://www.hetzner.com/sb

halilduygulu · on Jan 6, 2022

Still you can get a nice server with i7 cpu 32gb memory with 20tb traffic for ~28 euro, vat incl. This is super competitive even with DO or others.

hansel_der · on Jan 9, 2022

yes, it may be bad for indiscriminate/dumb load balancing. then again this only works in the firstplace if the work units represent a somewhat equivalent and small-ish workload.

once you place value on determinism (in regards to time spent on a task) you want a tightly specced distribution mechanism and/or a feedback loop to communicate busystate back to the LB.

withinboredom · on Jan 6, 2022

I have 3 servers, 2/3 don't have hardware timestamping on the interface, 1 does. Makes a huge difference when it comes to NTP.

csdvrx · on Jan 6, 2022

Which hetzner server have hardware timestamping on the interface?

I'd be really interested to know, since to the best of my knowledge, they don't have PTP solutions in their datacenter.

withinboredom · on Jan 6, 2022

I have two of the AX41-NVMe, and only one was lucky enough to get a "Intel Corporation I210 Gigabit Network Connection (rev 03)" which has the hardware timestamping.

In the finland datacenter, it appears there is a PTP running, though the offset is ~2.33 seconds off from NTP. Chrony says its a false-ticker, but I haven't really figured out how to get it configured correctly nor have I asked Hetzner for help. I've mostly just played around with it.

I did manage to also accidentally discover a local ISP's stratum 1's server is extremely close to me, as in a few microseconds away (I accidentally put hetzner's servers as a pool instead of a server and my NTP 'discovered' the stratum 1) ... I'm not using it, but I've thought about reaching out to them to ask if I can use it if I'm very nice.

csdvrx · on Jan 7, 2022

Would you care to share the IPs? (both for the Hetzner PTP one and the local ISP stratum 1)

Is the 2.33s stable? If so, after deducing the stable offset, it could still be valuable.

withinboredom · on Jan 7, 2022

`/dev/ptp0` and `/dev/pps0` magically showed up and yeah, it appears to be stable. Chrony actually has it marked as a non-false ticker atm and is marked as a backup. Here's the stats:

  Name/IP Address            NP  NR  Span  Frequency  Freq Skew  Offset  Std Dev
  ==============================================================================
  PPS                        64  40   67m     -6.827     33.059    -72ms    99ms
  PTP                         6   3    20     -9.585      0.005   -119ms    10ns
  192.168.100.1               6   3   88h     -0.159      0.000    -12ms  3815ns
  192.168.100.2               6   3   29h     -0.176      0.299    -11ms  2817us
  ntp1.hetzner.de            14  10  154m     -0.003      0.006    -47us    12us
  ntp2.hetzner.de            11   6  103m     -0.001      0.005    -21us  5711ns
  ntp3.hetzner.de             9   6  137m     -0.006      0.007    -72us  8949ns

I don't feel great about sharing the discovered stratum 1's on the open internet, but my email address is in my profile.

ornornor · on Jan 6, 2022

That sounds very cool, did you happen to write about it? I currently run all this at home using a dedicated computer but electricity here isn’t cheap and I’d like to move everything to proxmox (although I’m struggling to figure out how to move all these more or less manually set up docker compose services and lxc containers)

folli · on Jan 6, 2022

Any experience with other providers of dedicated servers, e.g. Digitalocean?

mad182 · on Jan 6, 2022

Does DO even offer dedicated servers? I have used their vps in the past. It was ok, no issues, but more expensive than hetzners vps or dedicated for large traffic/cpu needs.

reducesuffering · on Jan 6, 2022

Yes DO does. Generally slightly more expensive than Hetzner, with a few more AWS-like features.

GOATS- · on Jan 6, 2022

You cannot go wrong with Leaseweb or Nforce if you're looking for something in EU.

xurukefi · on Jan 6, 2022

> This website blog.polyhaven.com/how-we-handle-80tb-and-5m-page-views-a-month-for-under-400/ is currently offline. Cloudflare's Always Online™ shows a snapshot of this web page from the Internet Archive's Wayback Machine. To check for the live version, click Refresh.

oh the irony

fzliu · on Jan 6, 2022

Indeed not the greatest advertisement for their infrastructure, but regardless you have to admit that <$400 is fairly impressive.

gionn · on Jan 6, 2022

I bet it's powered by wordpress.

anatolinicolae · on Jan 6, 2022

Aged well I see...

throw14082020 · on Jan 6, 2022

> To avoid this problem in the future, I decided to splurge a bit and go for a cloud solution where I wouldn’t have to worry about reliability, performance, scaling or integrity ever again: Google Firestore.

> Google Firebase is nice and convenient, but it is quite expensive. We could investigate some other managed database options in future.

I've only seen people get annoyed with Firestore over time, and migrating out of it. People do end up worrying. At first, They seem to choose Firestore because it's strongly marketed and seems suitable for a new project. And then data modeling, high prices or scalability becomes a problem.

MichaelRazum · on Jan 6, 2022

What is a good solution compared to firebase ? I mean for a new project, alone something like user handling, registration, email sending, recovering passwords seem to cost some effort. Also be able to react to events, is kind of trivial in firebase. So is it really a bad choise (for the beginning)? Later you can always try to optimize.

mellavora · on Jan 6, 2022

so the problem here is if you have all of those needs, then really soon in the life of the project you are going to need good tooling to handle them.

So choosing something which makes it "one-click" to set up but total madness to manage is a really short-term optimization, only worth it for a pure prototype which you will throw away no matter how successful.

If you know you need those things to reach success, then it is better to make the up-front investment to get good tools for those.

If you still want to go with a cloud provider, AWS Amplify has some interesting tooling. I've build products both against Amplify and Firestore (and Firebase). Yes, Firebase is a few days to a week faster to set up (integrated user management, as you say), but AWS gives more sophisticated control and is built around scripted deployments.

You pay for it, of course, and I'm not arguing AWS vs running your own server. I am saying if the choice is AWS or Firebase, that a few days researching the choice would give you knowledge you could use for launching the next 10 prototypes you have in mind.

simonbarker87 · on Jan 6, 2022

SupaBase is a good alternative in my opinion but they are not as feature rich yet - but you do get SQL again which is delightful

janto · on Jan 6, 2022

Many use Firebase for auth and user management only.

qeternity · on Jan 6, 2022

This is undoubtedly better than most chuck-it-on-AWS types would achieve, but it's still quite a bit more expensive than could achieved with marginally more effort. They are already on Digital Ocean - if they used their managed database offering, they could slash their Firestore bill and get rid of their Argo bill, bringing the cost sub $200 (but - massively reduced returns on investment of effort).

goldbattle · on Jan 6, 2022

My thought is why don't the authors use github pages to host their static site with their links to a dedicated server with an unmetered link since their whole project is CC0. Downloading assets doesn't need to be low latency, thus I don't think the caching spatially near the user is important. Having a user login for patreon is something that would be missing and require some more thought I admit.

I think quite a lot of other people have mentioned in the thread that they are getting a lot of other "benefits" from using multiple services, but I don't see how these help solve the problem of data delivery besides taking advantage of the Cloudflare + Backblaze alliance which is $31 if their main website is a static one.

aardvarkr · on Jan 6, 2022

80TB/mo is quite a bit beyond the generosity of GitHub pages

lorey · on Jan 6, 2022

any source on that?

klohto · on Jan 6, 2022

I mean, kinda obvious since everything is metered on GH. Soft limit is around 100GB.

_skel · on Jan 6, 2022

When I see per month or per day stats I get suspicious.

Let's say a month has 30 days -- 5 million views a months is 166,666k per day, 6944 per hour, 116 per minute, and ~1.92 per second.

1.92 qps.

Of course it's not expensive, it's a tiny amount of traffic!

19h · on Jan 6, 2022

At 80TB (~83886080 MB) and 5000000 that's 16,77 MB per request in average. And at 1.92 qps that's 32,2 MB per second.

So they're saturating 25% of a gigabit uplink every second of the month.

PUSH_AX · on Jan 6, 2022

To look at a different angle, I would imagine the traffic isn't evenly distributed, and part of being happy with your offering will be if things don't grind to a halt during peaks.

ivanhoe · on Jan 6, 2022

In ideal world where the traffic is a horizontal line, yes. In real world you probably get idle times, and peaks of 20x that traffic.

Also note that they say "page views" which can translate in many more requests per each page opened.

davidkuennen · on Jan 6, 2022

It is indeed tiny. The backend for my app currently handles 500 qps for 300$ in GCP. I guess traffic is the biggest chunk for OP.

ksec · on Jan 6, 2022

Yes. It should be per month with peak request per minutes or better per seconds. Since we are always optimising for the rare peak usage. Instead once you sum out to monthly or even yearly a lot of things are meaningless.

Karishma1234fff · on Jan 6, 2022

I know, but it is impressive that they are doing it for $400.

richardwhiuk · on Jan 6, 2022

It's impressive it takes that much money to do almost no traffic.

A raspberry PI could handle the compute.

andyp-kw · on Jan 6, 2022

Only if the traffic is evenly distributed during the day. Which it's almost certainly not.

netr0ute · on Jan 7, 2022

You can get a $15 2.5 gigabit ethernet adapter for that Raspberry and now you're set for peak loads.

richardwhiuk · on Jan 6, 2022

The traffic graphs in their blog look pretty stable. Maybe 1.5x, but not hugely uneven.

jabej · on Jan 6, 2022

No it’s not. 2 qps is pitiful. If anything $400 is crazy expensive. I guess most of it goes to traffic/storage costs, otherwise they are being ripped off.

_skel · on Jan 6, 2022

According to the blog post half the money goes to their CDN (Cloudflare) which is a good decision. And about $100 for the database hosting.

I don't think they are overpaying for what they are getting. $400 is not a lot for a proper site that serves a lot of bytes. I just don't think it's that impressive either -- it's just yet another website serving static assets through a CDN, with low QPS too, so I don't see why it's noteworthy.

bcjordan · on Jan 6, 2022

Fun read, love finding those combination perf/cost wins!

Have you considered using Firestore in Datastore mode[0]? It might make all of your reads free[1], though migrating could be a project.

[0] https://cloud.google.com/datastore/docs/firestore-or-datasto...

[1] https://stackoverflow.com/a/53313540

helsontaveras18 · on Jan 6, 2022

This website is one of the fastest web pages I’ve ever used. Although it’s simple, it immediately loads pages and pages of images. Like in a flash. I’m on mobile web and it’s such a snappy experience.

lelandfe · on Jan 6, 2022

If Poly Haven is reading this: you nearly have an amazing FCP on the models browser but the external normalize.css file is killing you.[0] Self-hosting would drastically improve your 75p paint time.

I'd also encourage you load your fonts late via JS. Your main JS package competes right now with WOFF files from Google Fonts for priority and there's no need for that.

[0] https://www.webpagetest.org/result/220106_BiDc42_428a3caec56...

hansel_der · on Jan 6, 2022

> I'd also encourage you load your fonts late via JS

can't tell if joke, so:

there are already enough sites that display content for a split second and then some script runs (or fails?) and there is either nothing on screen or an error message.

this is ridiculous - please stop!