Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
Practical Llama 3 inference in Java
(
github.com/mukel
)
4 points
by
mukel
28 days ago
|
hide
|
past
|
favorite
|
1 comment
mukel
28 days ago
[–]
Llama3.java: featuring .GGUF file format support, Q8_0 and Q4_0 quantizations, fast matrix/vector multiplication routines using Java's Vector API; served by a simple CLI with a --chat mode to interact with the Llama 3 models.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: