Hacker News new | past | comments | ask | show | jobs | submit login

What scale of company do you need to be to actually be able afford and get return on investment on retraining base models with your own proprietary knowledge and docs? Considering also the implications of continually retraining?

I was under the impression that you wouldn't. If you want access to proprietary knowledge, you would use RAG + LLM.

The only experience I have is first hand, what my company is doing for our client base. We are doing continuous pretraining and the rest of the alignment stack training on about 10B private tokens + private customer data to produce private custom models for companies in the 500 to 3000 employee range. We built and operate a single rack cluster that cost mid 6 figures in order to be able to do this.

These models get combined with rag for highly specific technical doc authoring and other uses.

This is very helpful context on what works right now, thanks for sharing.

I don't think anyone has the answer to this question yet.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
