Practical Llama 3 inference in Java | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

		Practical Llama 3 inference in Java (github.com/mukel)
		4 points by mukel 28 days ago \| hide \| past \| favorite \| 1 comment

mukel 28 days ago [–]

Llama3.java: featuring .GGUF file format support, Q8_0 and Q4_0 quantizations, fast matrix/vector multiplication routines using Java's Vector API; served by a simple CLI with a --chat mode to interact with the Llama 3 models.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact