Is this a good test case for all of these competing open (and even closed) sourc...

_lqaf · on July 26, 2023

> I’m just curious

- Some people are committed to open source.

- Some people want to play with/learn/modify the technology, not just use it instrumentally.

- Some people want to play with these models without the surveillance that comes with renting them on OPC.

I'm sure there are other reasons I'm not thinking of, but I'm in the middle of that particular Venn diagram.

yacine_ · on July 26, 2023

If you fine tune them to be task specific, they'll perform well. In my experience, this control loop is a better investment than "prompt engineering". (When I say task specific, I mean very task specific)

GPT4 over the API is too fine tuned, which constrains its behavior. It fails to capture nuance in instructions. When you have the bag of weights, you can actually control your model. Having actual control over the model, and understanding the infrastructure that it's running on helps you meet actual SLAs.

And it's cheaper, if you're not backed by infinite venture money.

https://arxiv.org/abs/2307.13269

MuffinFlavored · on July 26, 2023

Could you explain what you mean by fine-tune? For example, I don't have the answers to what the songs parsed out + genre identified into JSON looks like. You're saying I'd have to train the model with known answers, and then maybe it could predict with some accuracy going forward?

I don't see how this warrants the extra exciting popularity of LLAMA2, etc.

I still haven't found my own personal niche "good enough" test case

yacine_ · on July 30, 2023

Here's an example: https://paperswithcode.com/task/coreference-resolution

josh-sematic · on July 26, 2023

I think it depends a lot on the scale of what you're trying to do, whether it's worth it or not to invest in OSS/DIY. If you're one person looking to do a "one off" task like organizing some of your own music, then you're correct that it's probably not worth it to invest time and effort into getting an open source model to do it for you. Just pay $0.50 and be done with it! But if you want to build an app that does that for people, and you want to host it for free/cheap, the costs could add up quickly. And especially if you are a company with a language task that will have lots of users--the up front R&D cost can definitely be worth it to save on costs of usage.

MuffinFlavored · on July 26, 2023

I was trying to argue “the open source models can’t do it in their current state”

I’m curious what could be done time investment wise by a single person like myself that could “tweak” LLAMA2 into being able to do a task it by default can’t