Just tested a i4i.32xlarge: $ lsblk NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINTS loo...

Aachen · 2024-02-20T18:20:12 1708453212

So if I'm reading it right, the quote from the original article that started this thread was ballpark correct?

> we are still stuck with 2 GB/s per SSD

Versus the ~2.7 GiB/s your benchmark shows (bit hard to know where to look on mobile with all that line-wrapped output, and when not familiar with the fio tool; not your fault but that's why I'm double checking my conclusion)

Nextgrid · 2024-02-20T18:12:40 1708452760

If you still have this machine, I wonder if you can get this bandwidth in parallel across all SSDs? There could be some hypervisor-level or host-level bottleneck that means while any SSD in isolation will give you the observed bandwidth, you can't actually reach that if you try to access them all in parallel?

highfrequency · 2024-02-28T21:11:19 1709154679

The aggregate throughput matches the advertised number of 22,400 MB/s: https://aws.amazon.com/blogs/aws/new-storage-optimized-amazo...

dekhn · 2024-02-20T19:06:55 1708456015

Can you addjust --blocksize to correspond to the block size on the device? And with/without --direct=1

zokier · 2024-02-20T18:57:49 1708455469

I wonder if there is some tuning that needs to be done here, it seems suprising that the advertised rate would be this much off otherwise.

jeffbee · 2024-02-20T19:18:15 1708456695

I would start with the LBA format, which is likely to be suboptimal for compatibility.

zokier · 2024-02-20T22:22:22 1708467742

somehow I4g drives don't like to get formatted

    # nvme format /dev/nvme1 -n1 -f
    NVMe status: INVALID_OPCODE: The associated command opcode field is not valid(0x2001)
    # nvme id-ctrl /dev/nvme1 | grep oacs
    oacs      : 0

but the LBA format indeed is sus:

    LBA Format  0 : Metadata Size: 0   bytes - Data Size: 512 bytes - Relative Performance: 0 Best (in use)

jeffbee · 2024-02-20T22:47:30 1708469250

It's a shame. The recent "datacenter nvme" standards involving fb, goog, et al mandate 4K LBA support.

zokier · 2024-02-20T23:56:33 1708473393

it'd be great if you'd manage to throw together quick blogpost about i4g io perf, there obviously something funny going on and I imagine you guys could figure it out much easier than anybody else, especially if you are already having some figures in the marketing.

dangoodmanUT · 2024-02-20T18:35:35 1708454135

that's 16m blocks, not 4k

wtallis · 2024-02-20T19:18:36 1708456716

Last I checked, Linux splits up massive IO requests like that before sending them to the disk. But there's no benefit to splitting a sequential IO request all the way down to 4kB.