Most Jobs I've worked at would've forbidden the use of personal subscriptions like that, as you'd be effectively uploading their intellectual property to foreign actors.
This can end with more then a termination - as in being literally liable/on the hook for serious contract violations.
So ymmv, you may want to take care with such an approach
People usually make the determination by reading at least part of the text and then find multiple smoking guns / llm-isms
The comment you responded to did not have those.
Fwiw, the article we're commenting on was likely not LLM written. The sentence structure is too convoluted, no LLM would've generated it like that - unless very carefully prompted ... But at that point it's no longer pure AI slop (imo).
Isn't that precisely the reason why we introduced the term hallucination? Because llms have historically always made up bullshit of they cannot answer directly... If they now nailed this to maybe the model not respond instead of responding incorrectly, then a lot of previously unusable usecases would become feasible.
So I feel like that's exactly the right metric and the way to track it wrt hallucinations.
The point is that it's not a useful metric on its own. For example, redirecting from /dev/null also achieves a zero hallucination rate.
We want the hallucination rate to decrease while the overall answer rate of queries remains sufficiently high. For more specifics, look into ROC and AUC.
Also if I were to guess the damages because of sci hub is higher than Anthropic training the models. I don't think I know anyone who didn't bought a book because the summary is available or they can ask about it to AI.
All AI companies should be forced to re-train their models without the offending materials, and this should also extend to all LLMs distilled from models exposed to copyrighted works. Also cover code under licences such as GPL as well. Not to mention patents and designs. This whole LLM business is a giant IP laundromat.
I see, i actually like these tells. It let's us easily distinguish garbage from someones thoughts.
And you can also see how brainrotten someone's gotten when they start accidentally sneaking in these tells into their normal communication.
As a matter of fact, after a full workday in which I'm essentially forced to read LLM garbage for 9h a day... I sadly notice myself adding the same fluff pointlessness to how I express myself.
like I caught a viral contagion that's actively siphoning my humanity away.
And expectedly, when coming back to those opinions with a less infected mindset, I frequently have to reevaluate these thoughts later on
There never was.
There were just a few people profiting from ads trying to gaslight you into believing there was.
reply