felix089's comments

felix089 · 2024-10-10T00:30:18.000000Z

Thank you! We currently don't support direct labeling, but if you can extract the text, our platform helps you organize it for fine-tuning. What use case are you looking to train the model for?

felix089 · 2024-10-09T23:24:17.000000Z

Yes, you can fine-tune using plain text completions. You don't need structured conversations unless you want conversational abilities. Plain text works great if you want the model to generate text in a specific style or domain. It all depends on what you're trying to achieve.

skerit · 2024-10-09T23:38:55.000000Z

Nice.

And about the cost of finetuning: is there a difference in price when only training the model on completions?

felix089 · 2024-10-09T23:53:14.000000Z

The cost depends on the number of tokens processed, so fine-tuning on completions costs the same per token as any other data.

felix089 · 2024-10-09T23:06:09.000000Z

Hi, current pricing for Llama 3.1 8B for example is: Training Tokens: $2 / 1M, Input and Output Tokens: $0.30 / 1M. We'll update pricing on the website shortly to reflect this.

felix089 · 2024-10-09T20:31:06.000000Z

Thanks!

felix089 · 2024-10-09T20:05:38.000000Z

To get the outcome you want, RAG (retrieval augmented generation) would be the way to go, not fine-tuning. Fine-tuning doesn't make the model memorize specific content like a book. It teaches new behaviors or styles. RAG allows the model to access and reference the book during inference. Our platform focuses on fine-tuning with structured datasets, so data needs to be in a specific format.

This is a very common topic, so I wrote a blog post that explains the difference between fine-tuning and RAG if you're interested: https://finetunedb.com/blog/fine-tuning-vs-rag

hodanli · 2024-10-09T23:39:22.000000Z

I dont think it is a big deal but you can use your own image or give credit to openai presentation on YouTube.

thomashop · 2024-10-09T20:20:46.000000Z

These days, I'd say the easiest and most effective approach is to put the whole book in the context of one of the longer context models.

felix089 · 2024-10-09T20:26:48.000000Z

Agreed, for this use case probably the easiest way to go.

swyx · 2024-10-09T21:40:02.000000Z

(and most expensive)

felix089 · 2024-10-09T22:22:42.000000Z

Agreed too

KaoruAoiShiho · 2024-10-09T20:31:00.000000Z

Not really, for something like gemini the accuracy and performance is very poor.

farouqaldori · 2024-10-09T21:15:00.000000Z

The magic behind NotebookLM can't be replicated only with fine-tuning. It's all about the workflow, from the chunking strategy, to retrieval etc.

For a defined specific use-case it's certainly possible to beat their performance, but things get harder when you try to create a general solution.

To answer your question, the format of the data depends entirely on the use-case and how many examples you have. The more examples you have, the more flexible you can be.

felix089 · 2024-10-09T19:54:57.000000Z

Very happy to hear, please do reach out to us with any feedback or questions via founders@finetunedb.com

felix089 · 2024-10-09T19:45:06.000000Z

Other co-founder here, so we offer more specific features around iterating on your datasets and include domain experts in this workflow. And I'd argue that you also want your datasets not necessarily with your foundation model provider like OpenAI, so you have the option to test with and potentially switch to open-source models.

felix089 · 2024-10-09T19:37:14.000000Z

Thank you, and yes that is possible. Which model are you looking to fine-tune?

ilovefood · 2024-10-09T19:58:15.000000Z

If that's the case then I'll try the platform out :) I want to finetune Codestral or Qwen2.5-coder on a custom codebase. Thank you for the response! Are there some docs or infos about the compatibility of the downloaded models, meaning will they work right away with llama.cpp?

farouqaldori · 2024-10-09T20:20:01.000000Z

We don't support Codestral or Qwen2.5-coder right out of the box for now, but depending on your use-case we certainly could add it.

We utilize LoRA for smaller models, and qLoRA (quantized) for 70b+ models to improve training speeds, so when downloading model weights, what you get is the weights & adapter_config.json. Should work with llama.cpp!

felix089 · 2024-10-09T17:33:13.000000Z

Thanks! We have a free tier with limited features. Our pro plan starts at €50 per seat per month and includes all features. Teams often collaborate with domain experts to create datasets. And for custom integrations, we offer custom plans on request.

More details here: https://docs.finetunedb.com/getting-started/pricing

Any specific features or use cases you're interested in?