Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
wkat4242
on June 16, 2024
|
parent
|
context
|
favorite
| on:
Cost of self hosting Llama-3 8B-Instruct
No, the ollama default quantisation is 4 bit
brrrrrm
on June 16, 2024
[–]
I meant 8b -> 8billion rather than 70b
wkat4242
on June 16, 2024
|
parent
[–]
Ah sorry!
Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: