Hacker News new | past | comments | ask | show | jobs | submit login

What if I train an AI model on exactly one copyrighted work and all it does it spit that work back out?

eg if I upload Marvels_Avengers.mkv.onnx and it reliably reproduces the original (after all, it's just a fact that the first byte of the original file is OxF0, etc)






A work that is “substantially similar” to a copyrighted work infringes that work, under US law, no matter how it was produced. (Note: Some exceptions apply and you have to read a lot of cases to get an idea of what courts find “substantially similar” .)

> no matter how it was produced

IIRC, this is wrong. Independent creation is a valid (but almost impossible to prove) defense in US copyright law.

This example is not an independent creation, but your reasoning seems wrong.


If the sole purpose of your model is to copy a work, then that's copyright infringement.

Oh, in this case, the model can either reproduce the work exactly, or it can play tic-tac-toe depending on how you prompt it.

We can change "sole purpose" to "primary purpose", and I'd argue something that happens 50% of the time counts as a primary purpose.



Consider applying for YC's W25 batch! Applications are open till Nov 12.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: