Hacker News

brucethemoose2 · 2023-10-09T18:38:27

I do. But my old laptop 2060 and (sometimes) my 3090 are running on the AI Horde instead:

https://lite.koboldai.net/#

I don't really see the point in hosting a tiny fraction of raw Falcon 180B or whatever when I can fully host a small finetune, of my choosing, that will perform better for what users actually want (which is mostly RP/Chat at the moment). And I can serve dozens of users on a single 2060 with a Mistral finetune.

Now, if it was a finetuning cluster, I would throw my 3090 at it without even blinking, but my understanding is that Petals doesn't finetune yet.

londons_explore · 2023-10-09T18:38:26

Plenty of people happily would...

I personally would do it in return for the electricity cost of the GPU's and machine. That's my marginal price to do so.

However, that offer is only valid if a project can guarantee my computer experience won't be degraded. So it needs to reliably run only during idle time, or, if it wants to run while I'm using the machine it can't make any application laggy or throw errors about not having enough GPU RAM.

I think there are probably millions of GPU's available out there with similar terms to mine.

gwern · 2023-10-09T18:40:44

I think this could use a bit more commentary, OP.

hexo · 2023-10-09T18:40:41

Benefit how exactly? This title/subject is way too biased.