I don't think this is correct. 5 years power usage of 4090 is $2600 giving TCO o...

bick_nyers · 2024-07-24T17:00:34 1721840434

To be fair, you need 2x 4090 to match the VRAM capacity of an RTX 6000 Ada. There is also the rest of the system you need to factor into the cost. When running 10-16x 4090s, you may also need to upgrade your electrical wiring to support that load, you may need to spend more on air conditioning, etc.

I'm not necessarily saying that it's obviously better in terms of total cost, just that there are more factors to consider in a system of this size.

If inference is the only thing that is important to someone building this system, then used 3090s in x8 or even x4 bifurcation is probably the way to go. Things become more complicated if you want to add the ability to train/do other ML stuff, as you will really want to try to hit PCIE 4.0 x16 on every single card.

lostmsu · 2024-07-24T17:11:33 1721841093

With 2x 4090 you will have 2x speed of RTX 6000 A. So same energy per token.

Will need more space, true.

angoragoats · 2024-07-24T19:42:48 1721850168

Yeah, after digging more into RTX 6000 Ada cards, I don't see any way they'd be more economical even over many years, no matter how you slice it.