Hacker News new | past | comments | ask | show | jobs | submit login

When you perform lossy compression, you feed it one file at a time, not every file in existence.



If you concatenate images into a stream container (say as tar) and then compress the stream, the compression coding will (generally) cross over the individual images. True, that's generally not lossy compression.

But concatenating images is also how you create video. Lossy video compression does typically cross over frames. So I don't actually see a difference. If you want to think about mkv or mp4 instead of zip it's still the same concept.

There's nothing stopping you from putting every available image into a video and figuring out how to compress it lossily.

Maybe there's some bounds for how much information was lost? Obviously piping everything into /dev/null destroys the input. And piping /dev/random from a true random source creates information. So somewhere between that and lossless compression there's the nebulous "plagarism" threshold. And then there's another threshold that is copyright infringement that's considered "fair use".

But the general structure of the "AI" this is about are fundamentally storage and retrieval.


What does any of this have to do with creating a new expression?


What makes anything new? Is anything created by "AI" actually new? How much entropy is in a prompt vs in the output?


>What makes anything new?

In copyright law? It's not being a copy




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: