They are doing it with custom silicon with several times more area than 8x H100s...

coder543 · 2024-11-19T02:33:26 1731983606

To be specific, a single WSE-3 has the same die area as about 57 H100s. It's a big chip.

cma · 2024-11-19T03:22:40 1731986560

It is worth splitting out the stacked memory silicon layers on both too (if Cerebras is set up with external DRAM memory). HBM is over 10 layers now so the die area is a good bit more than the chip area, but different process nodes are involved.

tomrod · 2024-11-19T05:27:38 1731994058

Amazing!