Great benchmark, very interesting. Although, I am not sure about the extrapolati...

bihan_rana · 2024-12-06T10:46:24 1733481984

Yes it’s a different model + backend and obviously the extrapolation will never be as good as experimental values. but, 1. We have only used the multiplier value 3.4, and not the exact throughput from Lambda’s experiment. 2. We have also used the same input/output sequence length as Lambda's experiment. 3. Also our extrapolated value is inline with the specs of H200 when compared to Mi300x

mufasachan · 2024-12-06T12:05:15 1733486715

Thanks for the details!