Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
Ask HN: What's the best LLM model that on a 24 GB VRAM GPU?
3 points
by
max93
2 hours ago
|
hide
|
past
|
favorite
|
2 comments
What’s the best model right now that outperforms Qwopus3.6-27B-v2-MTP-GGUF 8-bit on a 24 GB VRAM GPU? Looking for real reviews. I found 4 bit not usable in production.
help
jr_isidore
1 hour ago
|
next
[–]
Good question. I was told in 2024 to get an RTX 3090 (24 GB VRAM), so I did, and nothing on HuggingFace was usable.
reply
roscas
1 hour ago
|
prev
[–]
I would love to see Laguna.xs-2 on that, because it's very good on cpu only.
reply
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
reply