Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The quality does not yet measure up exactly to ChatGPT (even 3.5), but yes it is possible

Probably the fastest way to get started is to look into [0] - this only requires a beta chromium browser with WebGPU. For a more integrated setup, I am under the impression [1] is the main tool used.

If you want to take a look at the quality possible before getting started, [2] is an online service by Hugging Face that hosts one of the best of the current generation of open models (OpenAssistant w/ 30B LLaMa)

[0]: https://mlc.ai/web-llm/ [1]: https://github.com/oobabooga/text-generation-webui [2]: https://huggingface.co/chat



I downloaded a version of that openassistant model for llama.cpp and it’s at least on par with GPT-3 or a little beyond. It’s to the level of being generally useful.




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: