GCP, AWS, and Azure ARM-based server performance comparison

adduc · on Sept 25, 2022

The benchmarks appear to show AWS wins by a considerable margin in all tests, and is cheaper at all quoted price points, but is hardly above Azure in the cost performance table. This appears to be due to including Azure's reserved pricing but continuing to use AWS and GCP's on-demand pricing. This is misleading, as both GCP and AWS offer discounts for yearly commitments.

deadmutex · on Sept 25, 2022

Also, the RAM ratios for the machines are different across vendors. E.g. In the blogpost, the AWS machines have 2GB/vCPU vs 4GB/vCPU for GCE. This does depends on the workload though.

jjzhiyuan · on Sept 25, 2022

Thanks for your comment, for this part I will try to let the author knows this case.

cherioo · on Sept 25, 2022

The article end up feeling rather overwhelming. The main difference and advantage AWS holds is that it uses the newer, more advanced (and more expensive) Neoverse V1 design, while GCP and Azure are based on Neoverse N1 design which is older, cheaper, and less performant. These are largely due to how these chip were designed by ARM. It may be argued that AWS also adds its secret sauce, but so far it feels unlikely. A cursory search leads to an phoronix article [0] which has a much more in depth comparison between V1 and N1 (through AWS's c7g vs. c6g instance types.) There are also upcoming N2 and V2 design; NVIDIA's Grace CPU is reportedly based on V2 design, which will be interesting to watch for.

The 41% discount thrown in at the end for Azure, without any explanation, was also jarring. Maybe there truly is promotional rate for Azure's ARM instance, but as another poster pointed out, it's likely reserved pricing, which is available for all the providers.

[0] https://www.phoronix.com/review/aws-graviton3-c7g

lostmsu · on Sept 25, 2022

Why even use images to display text tables?

mr_toad · on Sept 25, 2022

How common is the use of ARM CPU in cloud workloads these days?

aeyes · on Sept 25, 2022

There is no reason to not run AWS offerings like RDS and Elasticache on ARM for instant cost savings. Performance is comparable to Intel/AMD so I see no reason not to use ARM even if your own code is not compatible.

I have no doubt that AWS will also switch over services like DynamoDB, ELB and countless others which don't let you chose machine type to improve their cost basis.

daneel_w · on Sept 25, 2022

We are a telecom services provider - SMS and SIP-based telephony - and moved everything to Arm, without a hitch.

daneel_w · on Sept 25, 2022

Neither rare nor common. In 2020 Amazon reported that Arm EC2 deployments in AWS occupied just over 10% footprint. It's growing.

mianos · on Sept 25, 2022

Do you have the reference for this? It sure seems a lot.

daneel_w · on Sept 25, 2022

It was mentioned in one of the topics during AWS re:Invent 2020. I just spent a minute searching the Internet for "aws graviton market share" and various sources cite everything from 10% to 15%.

mianos · on Sept 27, 2022

Thanks, very interesting for sure. I have managed 1000s of AWS instances and never seen an ARM one. I guess it depends on your application stack and business.

manv1 · on Sept 25, 2022

I've moved all our Lambdas to ARM, with no issues AFAIK. Cheaper + faster = why not?

tanelpoder · on Sept 25, 2022

Was hoping to see the results, but the images were all blurry dummy placeholders (some sort of lazy loading for images?). The images had a text on "click here for preview", but when I clicked it, nothing happened (I have ad-blockers, maybe that's why some JS code was disabled).

But what the heck, this is not how wepages should work - click here to "unblur" the image? Why add such an extra step? To save bandwidth cost?

hardwaresofton · on Sept 25, 2022

Weird the images worked for me... Maybe try reader mode if you're using FF?

Re-uploaded just in case it doesn't work:

https://imgur.com/a/QSF2mnh

tanelpoder · on Sept 25, 2022

Good idea, thanks - the reader mode in Safari worked ok.

progval · on Sept 25, 2022

There seems to be a proper lazy-loaded image (with low-res + high-res) with a thumbnail (low-res) on top; and the thumbnail is removed with JS code.

This is probably meant as a fallback to look nicer on browsers which don't support lazy-loading, at the expense of breaking browsers which don't support JS or fail to execute it.

magicalhippo · on Sept 25, 2022

Had to allow some javascripts from a third-party site[1] in uBlock Matrix to get the images to show properly. So yeah, JS at it again...

[1]: apisix-website-static.apiseven.com

jjzhiyuan · on Sept 25, 2022

AFIK, that’s because the CDN provides the ability to make the original images as WEBP. I will let maintainers know this case, may I know which browser you’re using?

Jenk · on Sept 25, 2022

this is a feature of asynchronously loading the "heavy" images to unimpede the loading of the text and markup. The "click to unblur" is because it failed to automatically load the image, and as a fallback uses a CTA (i.e., an event triggered by the user explicitly) to reattempt to load the images. CTAs have a higher privilege for fetching content.

waynesonfire · on Sept 25, 2022

gatta have the means to measure that engagement

tbillington · on Sept 25, 2022

Curious how they would compare to a non-cloud option https://www.hetzner.com/dedicated-rootserver/matrix-rx.

wmf · on Sept 25, 2022

That's the same processor GCE and Azure use so performance should be similar.

rgbrenner · on Sept 25, 2022

but without the virtualization overhead. considering how much work Intel/AMD put into virtualization, it would be interesting to see if the overhead is more/less on ARM.

systemvoltage · on Sept 25, 2022

Virtualization over head is about 3-5% in my benchmarks in the past (on amd64, not not in ARM though).

anonymoushn · on Sept 25, 2022

it's 3-5% plus whatever mitigations cost

JoshTriplett · on Sept 25, 2022

Many (though not all) of those mitigations remain important even for single-tenant remote servers, lest you get your keys or other secrets exfiltrated.

anonymoushn · on Sept 25, 2022

Which parts of a default install of which linux distro are remotely exploitable via spectre?

daneel_w · on Sept 25, 2022

Not to be forgotten: per-core shared tenancy. I'd love to be proven wrong, but I doubt AWS allocates a full core for every non-burstable Arm "vCPU" sold.

lathiat · on Sept 25, 2022

It was my understanding Amazon does in fact do this for most instance types, it's why the amount of RAM and other resources scales with CPU core count.

There are exceptions though in the very small "burstable" etc instances, like the micro etc.

EDIT: Seems I am wrong.

"Each vCPU is a thread of a CPU core, except for T2 instances and instances powered by AWS Graviton2 processors."

From: https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/instance...

electroly · on Sept 26, 2022

The exception for Graviton here is because every ARM vCPU is a full core, whereas an Intel/AMD vCPU is one hyperthread (i.e. half a core). They're not saying Graviton2 cores are shared. For Intel/AMD, you always get allocated both hyperthreads on a dedicated core; that's why hyperthreaded families start at 2 vCPUs. Only the "T" families feature shared cores.

dijit · on Sept 25, 2022

Google does, this was a real sticking point for me when I was evaluating cloud options, at the time GCP was the only provider allocating (and affining) CPU cores to tennants

throwayyy479087 · on Sept 25, 2022

[flagged]

tbillington · on Sept 25, 2022

I'm quite jealous of their closet then :)

For anyone curious here's a tour of one of their data centers https://youtu.be/5eo8nz_niiM?t=813.

the_duke · on Sept 25, 2022

That's complete nonsense, Hetzner has large data centers.

gw99 · on Sept 25, 2022

Also not as flammable as OVH ones.

BonoboIO · on Sept 25, 2022

I kind of forgot about that ... what a disaster.

ignoramous · on Sept 25, 2022

Once true for Google, as well ;)

speedgoose · on Sept 25, 2022

And Microsoft azure today in smaller zones.

dis-sys · on Sept 25, 2022

I think the title is misleading - it is not a server performance comparison, it is just a comparison on a single application using a single kind of workload on a fixed dual core setup.

It also lacks basic insights into the results, e.g., when both AWS and GCP are using ARM's Neoverse V1 design, why those two servers are having such significant performance gap? Maybe it was just caused by some bad software configuration in the stack which can void all relevant results?

When there are up to 64 cores available, why only 2 cores were used? Surely other 62 cores are relevant, right?

adrian_b · on Sept 25, 2022

Only Amazon, i.e. AWS uses Neoverse V1. Google uses Neoverse N1.

Neoverse N1 is a much older and slower CPU, derived from Cortex-A76.

Neoverse V1 has a performance similar to Cortex-X1, but it also implements SVE (vector extension), which allows improved performance for floating-point applications.

The big ARM cores from the Cortex-X and Neoverse V series have more execution units, up to a double number, in comparison with the medium-size ARM cores from the Cortex-A and Neoverse N series.

For applications without much floating-point or large integer computations (where the Intel/AMD CPUs are the best), the Cortex-A/Neoverse N cores have a performance intermediate between a thread and a core of the Intel/AMD CPUs, so you need an 128-core Neoverse N CPU to beat a 64-core AMD CPU (which has 128 threads).

For the same kind of applications, the Neoverse V cores have a performance similar to (slightly older) Intel/AMD cores, so a 64-core Neoverse V CPU can match a (previous generation) 64-core AMD CPU.

dis-sys · on Sept 25, 2022

Ah, good to know that, thanks a lot!

cherioo · on Sept 25, 2022

AWS c7g uses V1 design; GCP and Azure are N1.

ajross · on Sept 25, 2022

So... the news here is that GCP seems to be significantly slower in practice, where all three products are priced at the same spot.

But... the weird question is why are the Azure financials funny? The listed prices are about the same, but the "annual cost" for Azure seems to include a "41% off" discount that I can't find an explanation for? And then they use the latter in the calculation of price efficiency, which seems... what? This is the kind of thing Comcast does in its marketing. What is that discount and what's the justification for assuming that all users get it in perpetuity?

RcouF1uZ4gsC · on Sept 25, 2022

I wish Oracle Cloud was also included. The 24 GB, 4 core ARM vps you can get for free is a really nice way to try out ARM for individual developers.

yandie · on Sept 25, 2022

Only if you can pass their fraud detection! I tried signing up under various personal and business credit cards without any success!

qwytw · on Sept 25, 2022

Do you need a car to sign up for the free tier? I can't recall if they asked for one on initial registration but I had to add a card and went through the fraud detection bs (basically waiting for close to a week to be approved) when I had to upgrade to the paid tier.

Havoc · on Sept 25, 2022

Busy playing with this at the moment (trying a k8s)...their availability is quite patchy. Struggling to get 4x ARMs even created let alone reliably created via terraform.

qwytw · on Sept 25, 2022

Never had any issues with provisioning, it might depend on the region though (I'm using eu-frankfurt-1)

Havoc · on Sept 26, 2022

Yeah it’s the region. London used to be fine but lately not so much.

Thaxll · on Sept 25, 2022

Performance is terrible though.

wmf · on Sept 25, 2022

Is Oracle performance really worse than GCE or Azure? It's the same processor.

ksec · on Sept 25, 2022

I remember Oracle used to provide the free instances and resources using older ARM chip. But a quick search now this no longer seems to be the case.

qwytw · on Sept 25, 2022

It's still a magnitude or two better than the free tiers on AWS/GCP/Azure and you get 10 TB of free traffic which alone would be $100-200 on AWS

attentive · on Sept 25, 2022

What specifically? - in my benchmarking oci ARM cpu was faster than aws graviton2. That was before graviton3 came out so I didn't test it.

Thaxll · on Sept 25, 2022

The performance is equivalent to an Intel quad core from 2013.

m_mueller · on Sept 25, 2022

Just FYI, if I copy out the URI of the article and want to open it again (or share it with someone) it gives a 404.

philliphaydon · on Sept 25, 2022

I just ran into issues last night with t4g instances.

For the most part them have been great and the performance has been better for asp.net apps than Intel/AMD.

But I ended up with a stack overflow exception with System.Text.Json in my personal project that only occurs on arm but not others.

Still love the graviton instances.

znpy · on Sept 25, 2022

You should blame that on your asp.net runtime though, not on the processor.

philliphaydon · on Sept 25, 2022

I’m not blaming the processor. Just sharing an experience I had last night.

I wish GitHub actions supported arm64 so I could write some unit tests to reproduce it and figure out where the issue is in the runtime.

londons_explore · on Sept 25, 2022

And how does that same benchmark run on an apple M1 chip?

smoldesu · on Sept 25, 2022

Probably not all that favorably, considering that MacOS (moreover, XNU and APFS) aren't really optimized for server use. Hell, to my knowledge most of the networking drivers are quite old if not poached wholesale from early FreeBSD distributions.

Running it on Asahi might tell a different story, but it's almost completely irrelevant considering how painful it is provisioning Macs as opposed to Graviton instances.

qwytw · on Sept 25, 2022

M1/2 will probably be ahead significantly per core especially price/performance wise. But yeah, entirely different products, even if it might be worth it on paper, stacking stacking 10 mac minis is not a real alternative to something like Altra Q80-30 even though it might be a bit cheaper.

Would be cool if Apple reintroduced Xserve. Extremely unlikely, though, considering that server margin are way to low for Apple to even consider and they really can't bet that their CPUs will be ahead of everyone else indefinitely.

whalesalad · on Sept 25, 2022

are u srs

smoldesu · on Sept 25, 2022

Have you seen Monterey benchmarks when compared to identical Linux hardware? Performance for server applications and IO-constrained operations favor Linux consistently.

rovr138 · on Sept 25, 2022

I think the whole point is that the hardware isn’t the same.

whalesalad · on Sept 29, 2022

would rly love to see this if you don't mind sharing a link

perryizgr8 · on Sept 25, 2022

Is there any advantage to running an ARM instance in the cloud, as opposed to a x64 one?

neonsunset · on Sept 25, 2022

It comes down to resulting cost and resources required to reach SLOs of your services. Major factors for such are the quality of support for arm64 by your workloads and specific optimizations available for such (running code that has AVX512 optimizations vs something that likes SHA256 + AdvSimd tricks).

Overall, compiled languages like Rust, C / C++ have very good support due to years of dev effort invested in LLVM/GCC to support arm64 will, C# will also become one of the better running platforms starting with .NET 7 (6 is ok but 7 improves perf dramatically and vectorizes many code paths). Can't say anything about Java though.

All in all, Graviton 2/3 on AWS is supposed to provide cheaper and denser compute which are its main selling points at the moment.

jiggawatts · on Sept 25, 2022

Azure for example will sell you an 8 vCPU x64 VM for more than an 8 vCPU ARM VM, but the ARM VM vCPUs are entire physical cores, whereas the Intel-compatible VM vCPUs are hyperthreads. So you get more cores and more cache for less money with ARM. For some workloads this makes a big difference.

neonsunset · on Sept 25, 2022

While true, the ampere cores are weaker than intel/amd ones. It remains a question of how much juice you get out of such vCPUs vs Ampere at capacity. Graviton 3 OTOH is much closer in terms of performance to what is usually offered in Xeon / Epyc.

sharpy · on Sept 25, 2022

My employer has been achieving decent (around 15%) cost savings on most services we migrate to ARM on AWS. Not all though. Some of our services we had to revert to x64 because it didn't perform as well on ARM.

jeffbee · on Sept 25, 2022

As always, you should measure and decide based on your own software and requirements. My personal experience has been that for throughput-oriented workloads the Graviton stuff tends to be cheaper than x86-64 EC2 instances, but the latest x86-64 instances have better straight-line single-core performance so they are more appropriate for critical latency-sensitive services, and cost more.

sidkshatriya · on Sept 25, 2022

It would also be useful to compare AMD/Intel chips with ARM chips on security issues arising out of speculation issues etc.

The AMD/Intel server chips are used more intensively so they have greater scrutiny. Compensating for that, would it be possible to know which implementations are objectively better from a security standpoint?

watermelon0 · on Sept 25, 2022

If developers are using MacBooks, the main benefit is that you don't need to maintain both arm64 and x86 support for application/Docker images.

It's not simple to maintain support for both, and the only way to be sure application works as expected is to run CI pipelines for both architectures, which can get quite expensive.

HyperSane · on Sept 25, 2022

You don't have to pay the Intel premium so you get better performance/$. I would be willing to bet AWS uses Graviton internally for many of their services like S3, DynamoDB, et. One thing to keep in mind is that ARM doesn't support Hyperthreading so you get twice the number of cores than with x86.

dehrmann · on Sept 25, 2022

Assuming your workload can run on ARM, you should think of it as a different CPU generation or AMD vs Intel. For some applications, you might care about speed. Others, +/- 5ms isn't an issue. Then you see how much throughput each supports. Finally, you go through the total cost math. It's hard to make a blanket statement about which will be better for your usecase.

thejosh · on Sept 25, 2022

Pricing. We run our stuff on ARM on Graviton3 (company policy for AWS) and the performance is decent.

IntelMiner · on Sept 25, 2022

Aren't AWS ARM servers marginally cheaper than their x86 counterparts? Depending on the workload they could be useful

philliphaydon · on Sept 25, 2022

Yup if you’re in the ecosystem of AWS and your language or whatever is supported on arm then it’s worth it.

- cheaper

- generally comparable or better perf

Works for Lambda and EC2 and RDS.

tyingq · on Sept 25, 2022

Cost, and your vCPU is an actual core instead of a hyperthread.

antifa · on Sept 25, 2022

Does GCP Cloud Run support ARM?

Shorel · on Sept 25, 2022

I have a process in AWS that can run on ARM or AMD64 machines.

It runs in AMD64 machines because, for some reason, the Graviton instances are killed without warning by AWS.

So yeah, great performance, as long as you don't actually need it.

skywhopper · on Sept 25, 2022

Are you using Spot instances?

Shorel · on Sept 25, 2022

Yes, they are spot instances in an ASG.

Still, the ASG group page doesn't show why are they killed.

And I put a very high price limit (5 USD per hour) which is over 50 times what it actually costs per hour.

philliphaydon · on Sept 25, 2022

Why not use non spot instances then?

Shorel · on Sept 26, 2022

Cost optimization. This only needs to run when there are pending events, not 24/7.

Also: this setup works perfectly fine with Intel instances.

philliphaydon · on Sept 26, 2022

I don't understand, you're paying more than normal hourly cost? Then it's not cost saving...

Shorel · on Sept 27, 2022

It's an auction price. I am paying half than normal cost in average.

The 'high' price was set up in case the instances were being killed because I was outbid by other AWS users. But the spot price has never been that high.

The instances were killed anyway. So the reason is some technical glitch and not the price.

It's a CPU and RAM intensive process. Something in the VM governor is killing VM processes if they exceed some threshold, killing my instances.

philliphaydon · on Sept 27, 2022

That makes more sense now. I thought you were spot higher than the AWS costs so I was confused.