IME ollama ran mixtral on a 1070 fast enough. | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

baq on Jan 27, 2024 | parent | context | favorite | on: Brave Leo now uses Mixtral 8x7B as default

IME ollama ran mixtral on a 1070 fast enough.

dimask on Jan 27, 2024 [–]

Though it most probably does not run in on the 1070 but rather on the cpu. It cannot fit on a 1070, it is not about speed, a 1070 cannot run it period.

Dkuku on Jan 28, 2024 | [–]

In llama.cpp You can offload some of the layers to gpu with -ngl X. Where x is the number of layers

Join us for AI Startup School this June 16-17 in San Francisco!
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact