Unauthorized copying (aka pirating) is definitely a copyright violation.
That appears to be a huge problem with the large models and training. They don't secure legal access to the materials they train on, and thus fail to compensate authors for their work.
AKA students are required to buy or otherwise obtain legal access to their text books(like checking the book out of the library).
Training AI should play the same rules humans students have to follow.
Obtaining copies of pirated works is not infringement. Unauthorized sharing is infringement but being on the receiving end of sharing is not (even if one is an active participant).
Unauthorized copying (aka pirating) is definitely a copyright violation.
That appears to be a huge problem with the large models and training. They don't secure legal access to the materials they train on, and thus fail to compensate authors for their work.
AKA students are required to buy or otherwise obtain legal access to their text books(like checking the book out of the library).
Training AI should play the same rules humans students have to follow.