I'm definitely not at the stage where I can say anything definitive about performance. For a SIMD-intensive task, a preliminary run on a 3.3Ghz eMag core gets about 400 megabytes/second of JSON parsing relative to 2.2 gigabytes/second on a 4.0Ghz Skylake. This may come down to having a maximum of 2 x 128-bit NEON operations per cycle vs 3 x 256 AVX2 operations per cycle, as well as some clock speed differences. Will eventually post about this at branchfree.org when I don my Nomex long johns (required whenever doing any benchmark that might imply to anyone that any processor runs faster or slower than any other processor).
I will give your benchmark a run once I have some time.
Min: 0.00171284 bytes read: 631514 Gigabytes/second: 0.368695
Do note that at this stage the benchmark is so preliminary as to be largely meaningless - I haven't really done much more than eyeball the results. But as a preliminary data point, that seems to be a stronger showing clock-for-clock than the eMag one.
Also building with gcc 8.2 not I needed to change -march=native to -march=armv8-a+crypto
Min: 0.00111424 bytes read: 631514 Gigabytes/second: 0.566768
I would also like to point out that the LX2160 is also a TDP of 32Watts compared to the 125W TDP of the eMag.
Geekbench scores: https://browser.geekbench.com/v4/cpu/compare/11678329?baseli...
Here are the results:
34m9.347s EPYC 7401P 24C/48T 2.2GHz (Packet c2.medium.x86 bare metal server)
75m14.661s eMAG 32C/32T 3.3GHz (Packet c2.large.arm bare metal server)
96m31.901s i5-8259U 4C/8T 2.30-3.80GHz (Intel NUC8I5, NVMe SSD)
139m52.184s ThunderX 96C/96T 2.0GHz (Packet c1.large.arm bare metal server)
194m52.745s A10-6800K 4C/4T 4.1GHz (Old self-built desktop, slow-ish SSD)
535m52.642s Celeron N3150 4C/4T 1.60-2.08GHz (Gigabyte Brix, SSD)
I would not build a high end gaming machine with it. Motherboard tray cable routing cutouts are not well positioned if you have a large, high feature motherboard. I knew this but as I used an mATX board I didn't care. Also, airflow is limited by the solid front panel as is always the case with 'quite' designs.
The supplied fans are excellent but only one front case fan is supplied; I knew this and obtained a second "be quiet!" 140mm fan for my build.
I have mixed feelings about the gauge of steel. Typically quiet focused cases of the sort I've been using for many years now rely in part on heavy steel. The steel in this case is comparatively thin; similar to what you get with OEM machines from Dell or HP. On one hand I miss the rigidity of prior cases, on the other I've been surprised at how happy I am with the reduction in weight.
The size is perfect. It's a little larger in every dimension than a traditional mid tower making assembly and changes easier.
This is a bit of a boutique product; Amazon doesn't have this exact model and it took a while for shipment through newegg; it shipped from the manufacturers US warehouse and took extra time.
I would buy it again.
I did a double take when I saw the Arm workstation but I suppose it's not really surprising. The market is full of windowed, LED riddled gaming cases on one hand and low end 'value' stuff on the other. There are few quality 'grown up' looking cases available. I had 'workstation' in mind when I went hunting for a case and I imagine that's what Ampere was thinking as well; we made thoughtful choices and ended up in the same place.
Mini/Mid/Full tower cases generally are designed to take advantage of heat wanting to raise. So the intakes are often large and low (140mm is not unusual), and the top rear for the exhaust. Even 200mm isn't unusual for the exhaust. Air moving efficient increases quickly with fan size. 1U fans often move at 15k rpm and make more noise and vibration than anything else. Desktop fans are often 1200 rpm or lower and just take a few watts to dump substantial heat.
As an example the Fractal Design Mini C (a small, quiet, under $100 case) has room for 2 x 140mm in the lower front, and 2 x 140mm on top. It's a smaller case, so there's only 120mm in the rear. With $80 ish for the case and a few extra fans (Fractal design isn't bad, but not quite class leading) you can easily dump a few 100 watts quietly.
Find a rack mount case that can move as much air as quietly is challenging and when possible often prohibitively expensive and/or crazy loud. Last time I build one to house a single socket motherboard and a GPU (much like a desktop) it included 4 delta fans that I needed ear protection on to be in the same room.
I just build a rack for my desk. I have not really put it under load, but I have to see the GPU, Power supply or Case fan spinning yet. For now I only have only once fan for the case. The only fan that moves, but silent is the CPU fan. The only thing that you can hear: The 10 TB HDD.
TL;DR: 32 Cores, up to 3.3 Ghz boost, 32MB L3 cache, 125W TDP
So you are right.
I used some cheap ($20 USD!) Xeons (E5-2620v2) and have 24 cores.. so it's doable for $500.
better :) we just announced it.
> work with Linaro on the work they did for the MCBin to get a UEFI plugin that uses a QEMU emulation layer for supporting booting Video Card BIOS
Linaro? I thought the EDK2 for the MCbin was maintained by Semihalf :)
Anyway, 16 cores and dual-channel RAM for $500 is a lot more interesting than 4 cores and single-channel for $269. A few questions:
Will the EDK2 tree be fully open source? There was that one blob on the MCbin…
Will the core be overclockable?
We are waiting for the final release of the EDK2 tree from NXP. I will refrain from promising anything until that is integrated.
The cores are not overclockable. 2.2Ghz will be the limit.
Looks like anyone who makes these 16-core A72 chips is making them for networking (this NXP one, Mellanox Bluefield is a somewhat similar idea I think). So you can't not pay the cost of the NIC that's on the chip already…
The reason to buy a workstation like this is not performance, but rather being able to natively develop ARM software locally. That's the only real killer feature it has over Intel/AMD/IBM processors, but a legitimate reason to buy it given the availability of ARM servers at cloud providers.
Daniel Lemire's single-threaded Mandelbrot benchmark: 15s on the Ampere eMAG (Skylark), 24s on a 4GHz Skylake. (Also wins in bitset_count.) It's definitely faster 1/2 of Sandy Bridge.
eMAG will have a disadvantage in SIMD, but for normal workloads (make -j32 on a huge project :D) it should be plenty fast.
Moreover, the benchmark source code  clearly uses OpenMP to parallelize the benchmark tasks.
Different benchmarks favor different processors… Here's another one, now multi-threaded
My quick run of `sysbench cpu` (something with prime numbers) shows:
Skylark 3.3GHz 32core (at Packet; Ubuntu 18.04):
events per second: 44287.87
events per second: 14632.61
Oh, my mistake; I thought it was 4 logical cores (/threads) vs 32 logical cores. If it's 16 vs 32 logical cores, POWER will positively destroy ARM.
long lived tree of depth 21 check: 4194303
pixels = 7
With sysbench cpu (prime number something), it's 3 times faster than my Ryzen (8c16t 3.85GHz).
if your build target is an arm environment (industrial, embeded?) or you are a uni teaching/researching arm isa, then i imagine it would be a pretty nice bit of kit.
GBPUSD is trading at 1.3