Hacker News new | past | comments | ask | show | jobs | submit | Muhtasham's comments login

Thanks for detailed analysis, would be curios to see FP8 comparison too, given vllm has some custom kernels

Unfortunately, neither VLLM nor TGI support FP8 on AMD yet. But once they do, we will look into it.

it’s serious crime


Super useful for spinning LLM fine-tuning jobs!


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: