Ask HN: What's the best LLM model that on a 24 GB VRAM GPU? | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		Ask HN: What's the best LLM model that on a 24 GB VRAM GPU?
		3 points by max93 2 hours ago \| hide \| past \| favorite \| 2 comments
		What’s the best model right now that outperforms Qwopus3.6-27B-v2-MTP-GGUF 8-bit on a 24 GB VRAM GPU? Looking for real reviews. I found 4 bit not usable in production.
		help

jr_isidore 1 hour ago | [–]

Good question. I was told in 2024 to get an RTX 3090 (24 GB VRAM), so I did, and nothing on HuggingFace was usable.

roscas 1 hour ago | [–]

I would love to see Laguna.xs-2 on that, because it's very good on cpu only.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact