Hacker News new | past | comments | ask | show | jobs | submit login
Benchmarking LLM Inference Back Ends: VLLM, LMDeploy, MLC-LLM, TRT-LLM, and TGI (bentoml.com)
12 points by sherlockxu 3 months ago | hide | past | favorite | 2 comments



Hello! We're the authors of this blog post. Please let us know if there are other models and inference backends you'd like us to benchmark next.


Great work




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: