If the marketing naming is to be believed, in 1.4nm vs 4nm you'd be able to fit ~twice the transistors in your chip. That's twice the cores, twice the cache... That usually makes it faster.
That's what noise ordinances are for, and noise ordinances still allow you to engage in free speech. You just have to keep it reasonably quiet during the night.
This you? What made your stance on this subject change?
fuzzbazz 10 months ago | parent | context | prev | next [–] | on: Brazilian court orders suspension of X
How can you even have democracy without freedom of speech?
How can you freely choose who to vote without free exchange of information?
OP did not at all imply they felt that way, rather was implying an argument on behalf of the drafters.
God, this forum is becoming more like reddit every day with folks ignoring commenting rules and assuming bad faith. (of which ,I am doing myself right now.)
Can't verify from the low quality photo in the article.
This [1] press release from the Southern Environmental Law Center hints to a possible reason - they may be migrating from small to bigger turbines:
> Aerial images obtained by SELC revealed 35 turbines at the site in March (...) while the company has removed some smaller-sized turbines, it has recently installed three larger turbines
From a quick web search I can find that there are book review sites that allow users to enter and rate verbatim "quotes" from books. This one [1] contains ~2000 [2] portions of a sentence, a paragraph or several paragraphs of Harry Potter and the Sorcerer's Stone.
Could it be plausible that an LLM had ingested parts of the book via scrapping web pages like this and not the full copyrighted book and get results similar to those of the linked study?
This is in fact mentioned and addressed in the article. Also, there is pretty clear cut evidence Meta used pirated book data sets knowingly to train the earlier Llama models
The fact that Meta torrented Books3 and other datasets seems to be by self-admission by Meta employees who performed the work and/or oversaw those who themselves did the work, so that is not really under dispute or ambiguous.
My comparison was illustrative and analogous in nature. The copyright cartel is making a fruit of the poisonous tree type of argument. Whatever Meta are doing with LLMs is doing the heavy lifting that parity files used to do back in the Usenet days. I wouldn’t be surprised if BitTorrent or other similar caching and distribution mechanisms incorporate AI/LLMs to recognize an owl on the wire, draw the rest just in time in transit, and just send the diffs, or something like that.
The pictures are the same. All roads lead to Rome, so they say.
What are your thoughts on the origin of the LLaMA leak? It's interesting that the training data was torrented, and so was the leak. Perhaps we will never know? For the OSINT folks, not a lot to go on, or maybe a lot, depending?
I didn’t ask for info, I asked for your views. I gave you all the info anyone has publicly, so you have enough to comment.
I suspect that it was a limited hangout self-own by Meta to claim that they aren’t responsible, and then they are doing research on a leaked LLM that they developed, but then was leaked, so they can claim that the subsequent research is not tainted by the fruit of the poisonous tree legal doctrine. Or, their torrent client or other software on the same machine had 0-days and they got hacked by someone on the Books3 swarm or knowledgeable of what IPs were connecting to it.
I appreciate your posts and I am replying to you to humbly ask you to post more. :P
I'm not really sure what you are insinuating? You think Meta leaked LAMA so they could claim, legally that they are in the clear for copyright violation? Sorry, I just don't really get what you want me to opine about.
If that is what you are asking, I don't think that's what happened. It's far more likely that it was just leaked or grabbed by a hacker
I just thought the whole situation was interesting. You commented about the current LLM research being clean, while being based on prior LLMs which were perhaps less clean, so I thought that it was a curious coincidence how torrents kept popping up.
All written text is copyrighted, with few exceptions like court transcripts. I own the copyright to this inane comment. I sincerely doubt that all copyrighted material is scrubbed.
There is no way to decompile an LLM's weights and obtain a somewhat meaningful, reproducible source, like with a program binary as you say. In fact, if we were to compare both in this way that would make a program binary more "open source".
These names show a "right to forget" notice when you google them. Crashing or completely bailing out when they're about to be generated by the LLM sure looks like a glitch, or somebody forgot that multiple people can have the same name. Anyways, these are now being Streissand efected due to this behavior, the opposite of the intended result of the legislation.