I use scientific language models professionally. I skimmed the paper and was imm...

kouteiheika · on Nov 16, 2022

My big disappointment is, as always with models released by Facebook, is that they're all under a non-commercial license, which means they're effectively useless for anything.

They have something like this on the website:

> We believe models want to be free and so we open source the model for those who want to extend it.

But non-commercial licenses are NOT open source:

https://opensource.org/faq#commercial

https://community.oscedays.org/t/why-are-non-commercial-lice...

macrolime · on Nov 16, 2022

IANAL, but it would seem to me this license covers the model itself and not output of the model.

This is a copyright license for the model, so I think that should just mean you can't sell the model or a derivative of the model.

I guess when it's released you have to fill out some form or click some box to accept some license agreement, that in practice is a contract saying you won't use it for commercial purposes, but if you were to just download it from somewhere your only restrictions would be on redistributing it, but not on its use.

Not entirely sure if this is correct though.

rat9988 · on Nov 16, 2022

This definition is too restrictive. Wether you choose to suscribe to this definition or not is a matter of opinion.

pnt12 · on Nov 16, 2022

Open source is not a just a term with margin for interpretation: to be open source, you must comply with the 10 rules defined by the open source initiative. Restricting commercial usage goes against rule 6.

You can call it readable source or whatever, but it's not open source as defined by OSI.

"6. No Discrimination Against Fields of Endeavor

The license must not restrict anyone from making use of the program in a specific field of endeavor. For example, it may not restrict the program from being used in a business, or from being used for genetic research."

https://opensource.org/osd

rat9988 · on Nov 16, 2022

> to be open source, you must comply with the 10 rules

The open source initiative didn't not invent this expression. They worked hard to promote their idea of it, and its application. They did a lot of good, but aren't an autorative source when it comes to its definition.

counttheforks · on Nov 16, 2022

> Open source is not a just a term with margin for interpretation

Yes it is. Who is the open source initiative to define the english language?

insanitybit · on Nov 16, 2022

The reality is that the vast majority of software developers do not consider a strict conformance to the 10 OSI criteria as being necessary to apply the term "open source".

Maybe they're all just wrong, but it's worth considering why.

jimsmart · on Nov 16, 2022

> the vast majority of software developers do not consider a strict conformance to the 10 OSI criteria as being necessary to apply the term "open source"

[citation needed]

My counter claim, without citation, is that I actually believe (from experience) that the vast majority of 'open source' projects are in fact released under licenses that already comply with the 10 OSI criteria, and are therefore 'approved' OSI licenses. This is easily witnessed by looking at the licenses of the majority of open source projects — or perhaps even just the most popular ones.

That would seem to go against your claim regarding 'most developers'.

But it's not actually a debate about 'most developers', it's about the OSS projects out there, not individual devs, no?

ted_dunning · on Nov 17, 2022

Here is a citation for you. 3/4 of the repos on github use MIT, GPL or Apache.

Done and won. Most open source projects (in 2015) use real open source licenses.

https://github.blog/2015-03-09-open-source-license-usage-on-...

https://solutionshub.epam.com/blog/post/examining-open-sourc...

jimsmart · on Nov 17, 2022

Indeed... MIT, GPL and Apache are all approved OSI licenses and on the list! [0] — Most well known open source licenses are, of course.

Thanks for the citations, appreciated. I was on a mobile device when I made my previous comment, and it would've been fiddly to sort out.

[0] https://opensource.org/licenses/alphabetical

mindcrime · on Nov 16, 2022

Right. The OSD is not a "de jure" (by law) definition, but it is clearly the "de facto" definition of Open Source.

msamwald · on Nov 16, 2022

Well they are not useless for academic research.

laokoala · on Nov 16, 2022

They can be used by Facebook to get more data on user....

mxwsn · on Nov 15, 2022

Can I ask how you use scientific language models professionally? Or do you have any articles/reviews on how they are being used, and how people see their potential and shortcomings?

smeeth · on Nov 16, 2022

Not going to get into details on my own work here, but I'll comment generally on use-cases.

I think a good way to think about scientific language models is that they're useful in exactly the same ways general language models are, but in a very narrow domain (stuff having to do with scientific papers & patents, for the most part).

Use-cases that are possible/useful today:

- Annotation of scientific texts: is this paper about computer science?

- Scientific search: please give me researchers or papers most similar to an input query.

- Helping PhD Students graduate (only kind of kidding)

Use-cases I think will be possible/useful in the forseeable future:

- Scientific question answering: e.g. ask the model to explain a chemical process

- Scientific advice or guidance: e.g. ask what method might be appropriate in a situation.

- Text completion/editing/etc: e.g. help me write my paper. You could probably do more of this today if more $ was invested in science models, we're likely ~5 years behind whatever is going on in the "normal" language space.

As far as potential / shortcomings I'm really pessimistic. I don't think large language models for science are very useful outside of bespoke projects or ever will be for people doing serious science. The main issue is that these models are way too general - if you have a specific science problem you want to solve, its almost always going to be better to train a model to specifically address that problem. You would never, for example, ask a model like Galactica to do what AlphaFold does. Eventually you might be able to, but its never going to outperform a specific model, so if you're a researcher trying to get the best results why would you use it?

I should also add, scientists really care about precision. When summarizing a news story exact words might not be that big a deal, but if you're trying to summarize a scientific paper getting a word wrong can REALLY matter. The bar these models need to clear before scientists trust them with tasks where precision matters is likely much, much higher than in other domains.

I think the most likely outcome is that ~75% of LLM use for scientific text outside of academic research papers will be for search related products. That's definitely a place where they can make a big difference: help people find and understand cool papers that are relevant to their research.

dr_dshiv · on Nov 15, 2022

Yes, please share!

nestorD · on Nov 15, 2022

My big disappointment is that the model does not provide sources and recommended reading. Which is something we can now do and would increase the usefulness of the model significantly.

dekhn · on Nov 16, 2022

I'd love to hear more about using scientific language models profesionally. Is this for data curation and annotation?

orbifold · on Nov 16, 2022

I tried it on two topics I am a domain expert in both in the suggested „lecture notes on …“. It produced rethorically nice sounding sentences with little actual content, that quickly desolved into non-sense. I guess to an outside observer that might appear similar to what happens in academia often :):

joaogante · on Nov 17, 2022

The models are on huggingface now -- https://huggingface.co/models?other=galactica

rafaelero · on Nov 15, 2022

There is no doubt in my mind that Galactica fine-tuned on these specific datasets will outperform all these previous models. But yeah, someone should definitely do that and perform the benchmarks.

Robelius · on Nov 16, 2022

I’ve been vaguely following all the AI news on text to image and text that comes out from promos. But I have no idea how a benchmark for text would work. Is benchmarking subjective? Is it based on accuracy of information? How do you actually measure a benchmark for something like this?

smeeth · on Nov 16, 2022

Different benchmarks are performed for different tasks. As there are a lot of things you can use language models for, there are a lot of benchmarks.

With respect to subjectivity it really depends on the task - some tasks are quite amenable to objective classification. One common task for science language models is citation prediction: do these two papers share a citation link? Obviously that's a really simple accuracy metric to report.

Often things are not so simple. An example might be keyphrase extraction - standard practice there is to have grad students sit down with a highlighter and use the terms multiple students agree on (simplification, but not by much). From there it just gets messier. Are you reporting accuracy of all keywords identified or all sentences correctly processed? What about sentences with multiple keywords? What about sentences with no keywords? Very messy, appropriate metrics can be a real topic of debate.