The GPL license is license-viral, the BSD license is just attribution-viral, as I understand it.
For BSD licensed code, you must reproduce the license and attribution for that code, but the rest of the code can be licensed as you want. For GPL code there are additional requirements.
I don't think so, which makes the resulting amalgamation in violation of the original source in that respect for both of them. Beyond that, even if you add the license and attribution manually, the GPL is still not satisfied, and may require the entire source it was used in be released depending on whether you've shipped that product to customers or used it in specific ways. That's an entirely different class of problem than just needing to go back and add some comments to your code to note where some of it came from.
Yes, technically they do. But if the problem is discovered, it is very easy to comply with the BSD terms by adding proper attribution. With GPLed code it is much more difficult.
The default shouldn't be "you can violate people's licenses until you notice or get caught". Sure, in practice it probably won't cause anyone legal trouble: it would be really hard for, say, the author of some BSD-licensed code to prove that a user of Copilot actually caused them monetary damages by not providing attribution.
But this isn't really a world I want to live in; we shouldn't treat other people's work with such entitlement. Copilot should be able to trace the provenance of a suggestion it makes, and notify the author of their licensing obligations.
If that's not possible, then Copilot should not exist.
> If that's not possible, then Copilot should not exist
Github and Microsoft should have trained it on their own code because they could make it public domain. Using other peoples code with various licences and many different requirements is just going to be a complete shitshow.
All of these issues have existed since GPT-2, maybe earlier. I remember the AI dungeon guys narrowly avoided a lawsuit from training on copyrighted novels. I find it amusing that the tech community was very pro GPT, but anti-copilot. Now that it affects them personally. Especially ironic because this place is usually very skeptical of copyright matters.
All AIs are trained on copyrighted data scraped from the internet. What you guys want effectively amounts to making most AI illegal. At least outside of big tech companies with large private datasets. Is that the world you prefer to live in?
Yes, in this case the licence is violated by default, by all code that CoPilot generates.
The point was that you can fix attribution errors pretty simply once they are uncovered, its a lot harder to pull GPL code out of your project to prevent you from having to relicence.
I don’t see how the GPL is special in the discussion around Copilot.