Nightshade: An offensive tool for artists against AI art generators

ink404 · 2024-01-19T17:48:58 1705686538

Paper is here: https://arxiv.org/abs/2310.13828

542458 · 2024-01-20T15:38:26 1705765106

This seems to introduce levels of artifacts that many artists would find unacceptable: https://twitter.com/sini4ka111/status/1748378223291912567

The rumblings I'm hearing are that this a) barely works with last-gen training processes b) does not work at all with more modern training processes (GPT-4V, LLaVA, even BLIP2 labelling [1]) and c) would not be especially challenging to mitigate against even should it become more effective and popular. The Authors' previous work, Glaze, also does not seem to be very effective despite dramatic proclamations to the contrary, so I think this might be a case of overhyping an academically interesting but real-world-impractical result.

[1]: Courtesy of /u/b3sn0w on Reddit: https://imgur.com/cI7RLAq https://imgur.com/eqe3Dyn https://imgur.com/1BMASL4

kmeisthax · 2024-01-20T23:31:43 1705793503

The screenshots you sent in [1] are inference, not training. You need to get a Nightshaded image into the training set of an image generator in order for this to have any effect. When you give an image to GPT-4V, Stable Diffusion img2img, or anything else, you're not training the AI - the model is completely frozen and does not change at all[0].

I don't know if anyone else is still scraping new images into the generators. I've heard somewhere that OpenAI stopped scraping around 2021 because they're worried about training on the output of their own models[1]. Adobe Firefly claims to have been trained on Adobe Stock images, but we don't know if Adobe has any particular cutoffs of their own[2].

If you want an image that screws up inference - i.e. one that GPT-4V or Stable Diffusion will choke on - you want an adversarial image. I don't know if you can adversarially train on a model you don't have weights for, though I've heard you can generalize adversarial training against multiple independent models to really screw shit up[3].

[0] All learning capability of text generators come from the fact that they have a context window; but that only provides a short term memory of 2048 tokens. They have no other memory capability.

[1] The scenario of what happens when you do this is fancifully called Habsburg AI. The model learns from it's own biases, reinforcing them into stronger biases, while forgetting everything else.

[2] It'd be particularly ironic if the only thing Nightshade harms is the one AI generator that tried to be even slightly ethical.

[3] At the extremes, these adversarial images fool humans. Though, the study that did this intentionally only showed the images for a small period of time, the idea being that short exposures are akin to a feed-forward neural network with no recurrent computation pathways. If you look at them longer, it's obvious that it's a picture of one thing edited to look like another.

scheeseman486 · 2024-01-21T04:58:29 1705813109

Hey you know what might not be AI generated post-2021? Almost everything run through Nightshade. So given it's defeated, which is pretty likely, artists have effectively tagged their own work for inclusion.

hkt · 2024-01-21T06:01:11 1705816871

It is a great shame that we have come to a no-win situation for artists when VCs are virtually unable to lose.

ToucanLoucan · 2024-01-21T15:48:23 1705852103

I mean that's more or less status quo isn't it? Big business does what it wants, common people can get fucked if they don't like it. Same as it ever was.

hkt · 2024-01-21T19:36:31 1705865791

That's exactly right. It is just the variety of new ways in which common people get fucked that is dispiriting, with seemingly nothing capable of moving in the opposite direction.

kmeisthax · 2024-01-21T17:58:57 1705859937

Why wouldn't an artist just generate AI spam and Nightshade it?

visarga · 2024-01-21T17:28:39 1705858119

Modern generative image models are trained on curated data, not raw internet data. Sometimes the captions are regenerated to fit the image better. Only high quality images with high quality descriptions.

og_kalu · 2024-01-22T04:19:15 1705897155

I wouldn't call what Stable Diffusion et al are trained on "high quality". You need only look through the likes of LAION to see the kind of captions and images they get trained on.

It's not random but it's not particularly curated either. Most of the time, any curation is done afterwards.

visarga · 2024-01-22T16:28:49 1705940929

Have you seen the BLIP paper? It's a bit old now, but it introduced a curation method.

https://arxiv.org/abs/2201.12086

KTibow · 2024-01-21T05:23:11 1705814591

Correct me if I'm wrong but I understand image generators as relying on auto-labeled images to understand what means what, and the point of this attack to make the auto-labelers mislabel the image, but as the top-level comment said it's seemingly not tricking newer auto-labelers.

michaelbrave · 2024-01-21T09:56:45 1705831005

not all are auto labelled, some are hand labelled, some are initially labelled with something like clip/blip/booru and then corrected a bit by hand. The newest thing though is using llm's with image support like GPT4 to label the images, which kind of does a much better job most of the time.

Your understanding of the attack was the same as mine, it injects just the right kinds of pixels to throw off the auto-labellers to misdirect what they are directing causing the tags to get shuffled around.

Also on reddit today some of the Stable Diffusion users are already starting to train using Nightshade so they can implement it as a negative model, which might or might not work, will have to see.

webmaven · 2024-01-21T05:00:43 1705813243

Even if no new images are being scraped to train the foundation text-to-image models, you can be certain that there is a small horde of folk still scraping to create datasets for training fine-tuned models, LoRAs, Textual Inversions, and all the new hotness training methods still being created each day.

GaggiX · 2024-01-21T06:26:51 1705818411

If it doesn't work during inference I really doubt it will have any intended effect during training, there is simply too much signal and the added adversarial noise works on the frozen and small proxy model they used (CLIP image encoder I think) but it doesn't work on a larger model and trained on a different dataset, if there is any effect during training it will probably just be the model learning that it can't take shortcuts (the artifacts working on the proxy model showcase gaps in its visual knowledge).

Generative models like text-to-image have an encoder part (it could be explicit or not) that extract the semantic from the noised image, if the auto-labelers can correctly label the samples then the encoded trained on both actual and adversarial images will learn to not take the same shortcuts that the proxy model has taken making the model more robust, I cannot see an argument where this should be a negative thing for the model.

ptdn · 2024-01-21T04:55:27 1705812927

The context windows of LLMs are now significantly larger than 2048 tokens, and there are clever ways to autopopulate context window to remind it of things.

jerbear4328 · 2024-01-21T03:49:43 1705808983

[3] sounds really interesting - do you have a link?

ittseta · 2024-01-21T04:48:15 1705812495

https://www.nature.com/articles/s41467-023-40499-0 https://deepmind.google/discover/blog/images-altered-to-tric...

Study on the Influence of Adversarial Images on Human Perception

brucethemoose2 · 2024-01-20T18:44:15 1705776255

Yeah. At worst a simple img2img diffusion step would mitigate this, but just eyeballing the examples, traditional denoisers would probably do the job?

Denoising is probably a good preprocessing step anyway.

achileas · 2024-01-21T18:28:24 1705861704

It’s a common preprocessing step and I believe that’s how glaze (this lab’s previous work) was defeated.

pimlottc · 2024-01-20T19:22:17 1705778537

I can’t really see any difference in those images on the Twitter example when viewing it on mobile

vhcr · 2024-01-20T19:57:50 1705780670

The animation when you change images makes it harder to see the difference, I opened the three images each in its own tab and the differences are more apparent when you change between each other instantly.

SirMaster · 2024-01-21T01:14:00 1705799640

But that’s not realistic?

If you have to have both and instantly toggle between them to notice the difference, then it sounds like it’s doing its job well and is hard to notice the difference.

bowsamic · 2024-01-21T08:45:45 1705826745

What kind of artist is not going to be bothered with seeing huge artifacting on their work? Btw for me it was immediately noticeable even on mobile

SirMaster · 2024-01-22T14:57:49 1705935469

If it's huge, then why are multiple people commenting that they don't see a difference?

RedXPlex · 2024-01-22T20:28:04 1705955284

Kid me found 13 FPS in games to be a smooth and cursive experience. Current me thinks 60 FPS is laggy.

Standards differ. I saw glazed images in the wild, was wondering why they have so much JPEG artifacts, until I saw the post of one of those anti-AI + glaze images on his profile.

bowsamic · 2024-01-22T19:18:59 1705951139

That is a great mystery, to me it's as clear as if someone pasted a cartoon dog onto the image, it's extremely blatant and impossible to ignore by my normal human pattern recognition.

SirMaster · 2024-01-22T20:39:29 1705955969

I'm looking at them on my iPhone 14 Pro and I am having a hard time seeing any meaningful difference that changes the way the artwork registers with me.

I can't really imagine a case where if I had only seen the AI edited one I would have any different reaction or response to viewing the piece of art compared to having only seen the original one.

battles · 2024-01-21T05:09:15 1705813755

The person who drew it would definitely notice.

dontupvoteme · 2024-01-20T20:29:23 1705782563

One of the few times a 'blink comparator' feature in image viewers would be useful!

fenomas · 2024-01-21T04:36:16 1705811776

At full size it's super obvious - I made a side-by-side:

https://i.imgur.com/I6EQ05g.png

trimethylpurine · 2024-01-21T06:35:16 1705818916

I still don't see a difference. (Mobile)

fenomas · 2024-01-21T06:52:37 1705819957

Here's a maybe more mobile friendly comparison:

https://i.imgur.com/zUVn8rt.png

But now that I double-check, I was comparing with the images zoomed to 200%. On desktop the artifacts are also noticeable at 100%, but not nearly as bad as in my previous comment.

Rewrap3643 · 2024-01-21T12:34:08 1705840448

Have you done a color blindness test before? Red-green is the most common type and the differences here are mostly shades of green.

trimethylpurine · 2024-01-23T02:02:12 1705975332

I typically read HN in bed where the brightness is at the minimum setting. I turned the brightness up and I see it.

Detrytus · 2024-01-21T22:14:52 1705875292

Second picture looks like you were looking at it through a dirty window, there's lot of pale white stains, or light reflections, it's really blurry.

blincoln · 2024-01-22T22:34:09 1705962849

Look at the forehead and arms. The processed version looks like it's been run through a posterization filter.

bowsamic · 2024-01-21T08:46:25 1705826785

What phone are you using? It’s extremely obvious on my iPhone

josefx · 2024-01-20T22:31:09 1705789869

Something similar to jpeg artifacts on any surface with a normally smooth color gradient, in some cases rather significant.

0xcde4c3db · 2024-01-20T19:42:04 1705779724

I didn't see it immediately either, but there's a ton of added noise. The most noticeable bit for me was near the standing person's bent elbow, but there's a lot more that becomes obvious when flipping back and forth between browser tabs instead of swiping on Twitter.

Keyframe · 2024-01-20T19:37:56 1705779476

look at the green drapes to the right, or any large uniform colored space. It looks similar to bad JPEG artifacts.

pxc · 2024-01-20T19:36:25 1705779385

I don't have great vision, but me neither. They're indistinguishable to me (likewise on mobile).

Gigachad · 2024-01-21T03:15:36 1705806936

I was on desktop and it looks like pretty heavy jpeg compression. Doesn't completely destroy the image, but it's pretty noticeable when blown up large enough.

jquery · 2024-01-21T04:10:34 1705810234

It's really noticeable on desktop, like compressing an 800kb jpeg to 50kb. Maybe on mobile you won't notice, but on desktop the image looks blown out.

milsorgen · 2024-01-20T19:37:52 1705779472

It took me a minute too but on the fast you can see some blocky artifacting by the elbow and a few spots elsewhere like curtain upper left.

charcircuit · 2024-01-20T20:21:01 1705782061

The gradient on the bat has blocks in it instead of being smooth.

gedy · 2024-01-20T19:03:46 1705777426

Maybe it's more about "protecting" images that artists want to publicly share to advertise work, but it's not appropriate for final digital media, etc.

sesm · 2024-01-20T19:20:14 1705778414

In short, anti-AI watermark.

johnnyanmac · 2024-01-20T23:15:33 1705792533

Yeah. It may mess with the artist's vision but the impact is still way more subtle than other methods used to protect against these unwanted actions.

Of course I'm assuming it works to begin with. Sounds like a game of cat and mouse. And AI has a lot of rich cats.

kjs3 · 2024-01-21T23:59:01 1705881541

Seems obvious that the people stealing would be adjusting their process to negate these kinds of countermeasures all the time. I don't see this as an arms race the artists are going to win. Not like the LLM folks can consider actually paying their way...the business plan pretty much has "...by stealing everything we can get our hands on..." in the executive summary.

h0p3 · 2024-01-21T02:32:27 1705804347

Sir /u/b3nsn0w is courteous, `/nod`.

GaryNumanVevo · 2024-01-20T19:49:37 1705780177

The artifacts are a non-issue. It's intended images with nightshade are intended to be silently scrapped and avoid human filtering.

minimaxir · 2024-01-20T20:14:51 1705781691

The artifacts are extremely an issue for artists who don't want their images damaged for the possibility of them not being trained by AI.

It's a bad tradeoff.

GaryNumanVevo · 2024-01-20T20:20:26 1705782026

Nightshaded images aren't intended for portfolios. They're mean to be uploaded enmasse and scraped later.

AJ007 · 2024-01-20T21:15:52 1705785352

To where? A place no one sees them and they aren't scraped?

filleduchaos · 2024-01-20T21:30:33 1705786233

I think the point is that they're akin to a watermark.

Even before the current AI boom, plenty of artists have wanted to showcase their work/prove that it exists without necessarily making the highest quality original file public.

Diti · 2024-01-20T21:52:58 1705787578

Most serious artists I know (at least in my community) release their high-quality images on Patreon or similar.

pgeorgi · 2024-01-20T22:07:12 1705788432

For example in accounts on image sites that are exposed to suspected scrapers but not to others. Scrapers will still see the real data, but they'll also run into stuff designed to mix up the training process.

the8472 · 2024-01-20T22:00:51 1705788051

do you mean scrapped or scraped?

GaryNumanVevo · 2024-01-20T22:33:06 1705789986

scraped

soulofmischief · 2024-01-21T02:34:49 1705804489

> The artifacts are a non-issue.

According to which authority?

gfodor · 2024-01-20T19:53:47 1705780427

Huge market for snake oil here. There is no way that such tools will ever win, given the requirements the art remain viewable to human perception, so even if you made something that worked (which this sounds like it doesn’t) from first principles it will be worked around immediately.

The only real way for artists or anyone really to try to hold back models from training on human outputs is through the law, ie, leveraging state backed violence to deter the things they don’t want. This too won’t be a perfect solution, if anything it will just put more incentives for people to develop decentralized training networks that “launder” the copyright violations that would allow for prosecutions.

All in all it’s a losing battle at a minimum and a stupid battle at worst. We know these models can be created easily and so they will, eventually, since you can’t prevent a computer from observing images you want humans to be able to observe freely.

AJ007 · 2024-01-20T21:20:12 1705785612

The level of claims accompanied by enthusiastic reception from a technically illiterate audience make it sound, smell, and sound like snake oil without much deep investigation.

There is another alternative to the law. Provide your art for private viewing only, and ensure your in person audience does not bring recording devices with them. That may sound absurd, but it's a common practice during activities like having sex.

Gormo · 2024-01-21T04:08:48 1705810128

That doesn't sound like a viable business model. There seems to be a non-trivial bootstrap problem involved -- how do you become well-known enough to attract audiences to private venues in sufficient volume to make a living? -- and would in no way diminish demand for AI-generated artwork which would still continue to draw attention away from you.

wraptile · 2024-01-21T06:15:36 1705817736

The thing is people want the benefits of having their stuff public but not bear the costs. Scraping has been mostly a solved problem especially when it comes to broad crawling. Put it under a login, there, no more AI "stealing" your work.

csydas · 2024-01-21T08:50:31 1705827031

I don't think that's true at all. Images and text get reposted with or without consent, often without attribution. It wouldn't make it right for the AI companies to scrape when the original author doesn't want that but someone else has ignored their wishes and requirements. Basically, what good is putting your stuff behind login or some other restrictive viewing method if someone just saves the image/text? I think it's still a relatively serious problem for people creating things. And without some form of easy access to viewing, the people creating things don't get the visibility and exposure they need to get an audience/clients.

This is one the AI companies should offer the olive branch on IMO, there must be a way to use stenography to transparently embed a "don't process for AI" code into an image or text or music or any other creative work that won't be noticeable by humans, but the AI would see if it tried to process the content for training. I think it would be a very convenient answer and probably not be detrimental to the AI companies, but I also imagine that the AI companies would not be very eager to spend the resources implementing this. I do think they're the best source for such protections for artists though.

Ideally, without a previous written agreement for a dataset from the original creators, the AI companies probably shouldn't be using it for training at all, but I doubt that will happen -- the system I mention above should be _opt-in_, that is, you must tag such content that is free to be AI trained in order for AI to be trained on it, but I have 0 faith that the AI companies would agree to such a self-limitation.

edit: added mention to music and other creative works in second paragraph 1st sentence

edit 2: Added final paragraph as I do think this should be opt-in, but don't believe AI companies would ever accept this, even though they should by all means in my opinion.

amlib · 2024-01-21T12:22:25 1705839745

Here are my 2 cents, I think we will need some laws specifying two types of AI models, ones trained with full consent (opt-in) for its training material and ones without. The first one would be like Adobe's firefly model where they allegedly own everything they trained it with, or something where you go around asking for consent for each thing in your training corpus (probably unfeasible for large models). Maybe things in the public domain would be ok to train with. In this case there are no restrictions and the output from such models can even be copyrighted.

Now for the second type, representing models such as Stable Difusion and Chat GPT, it would be required to have their trained model freely available to anyone and any resulting output would not be copyrightable. It may be a more fairer way of allowing anyone to harness the power of AI models that contain essentially the knowledge of all man kind, but without giving any party an unfair monopoly on it.

This should be easily enforceable for big corporations, else it would be too obvious if they are trying to pass one type model as another or even keep the truth about their model from leaking. It might not be as easy to keep small groups or individuals from breaking those rules, but hey, at least it evens the playing field.

946789987649 · 2024-01-21T08:38:45 1705826325

Is that login statement strictly true? Unless the login is paid, there's no reason we can't get to (if not already there) the point where the AI scraper can just create a login first.

wraptile · 2024-01-21T12:08:47 1705838927

No, eforcing click-wrap legal agreements is actually possible. With basic KYC the scraper would instantly open up itself for litigation and no internet art piece is frankly worth this sort of trouble.

Tade0 · 2024-01-21T08:51:45 1705827105

But then you can rate-limit to a point where scraping everything will take a considerable amount of time.

Of course the workaround would be to have multiple accounts, but that in turn can be made unscalable with a "prove you're human" box.

csydas · 2024-01-21T09:02:36 1705827756

you are not incorrect that this would help mitigate, but it still misses a few key points I think regarding why artists are upset about AI generation

- This is still vulnerable to stuff like mturk or even just normal users who did get past the anti-bot things pulling and re-uploading the content elsewhere that is easier for the AI companies to use

- The artists' main contention is that the AI companies shouldn't be allowed to just use whatever they find without confirm they have a license to use the content in this way

- If someone's content _does_ get into an AI model and it's determined somehow (I think there is a case with a news paper and chatGPT over this very issue?), the legal system doesn't really have a good framework for this situation right now -- is it copyright infringement? (arguably not? it's not clear) is it plagiarism? (arguably yes, but plagiarism in US court system is very hard to proof/get action on) is it license violation? (for those who use licenses for their art, probably yes, but it's the same issue as plagiarism -- how to prove it effectively?)

Really what this comes down to is that the AI companies use the premise that they have a right to use someone else's works without consent for the AI training. While your suggestions are technically correct, it puts the impetus on the artists that they must do something different because the AI companies are allowed to train their models as they currently do without recourse for the original artist. Maybe that will be ruled true in the future I don't know, but I can absolutely get why artists are upset about this premise shaping the discussion on AI training, as such a premise negates their rights as an artist and many artists have 0 path for recourse. I'm pretty sure that OpenAI wouldn't think about scraping a Disney movie from a video upload site just because it's open access since Disney likely can fight in a more meaningful way. I would agree with artists who are complaining that they shouldn't need to wait for a big corporation to decide that this behavior is undesirable before real action is taken, but it seems that is going to be what is needed. It might be reality, but it's a very sad reality that people want changed.

Art9681 · 2024-01-21T01:46:31 1705801591

This would just create a new market for art paparazzis who would find any and all means to inflitrate such private viewings with futuristic miniature cameras and other sensors and selling it for a premium. Less than 24 hours later the files end up on hundreds or thousands of centralized and decentralized servers.

I'm not defending it. Just acknowledging the reality. The next TMZ for private art gatherings is percolating in someone's garage at the moment.

jurassic · 2024-01-21T10:53:11 1705834391

I find this difficult to believe; no matter how small your camera is, photography is about light. Art reproduction photography is surprisingly hard to do if you care about the quality of the end result. Unless you can surreptitiously smuggle in a studio lighting setup, tripod, and color checker card… sure you can take an image in secret, but not one that is a good representation of the real thing.

nihilius · 2024-01-22T09:36:42 1705916202

You could just build a stabilizer system and stand really still for 1 second. Then expose for a longer time. Photography is Apertrue, ISO, and exposure time. This will gather enough light to do a proper exposure even in a dimm lit venue. Anything darker and every viewer will have a hard time seeing the private art. ANother thing would be to crank up the ISO and denoise it later. Its much more lossy but with this you could get lower exposure times.

jurassic · 2024-01-23T06:14:39 1705990479

I hear what you’re saying, but I think maybe we just have different standards for what counts as acceptable quality.

Robotbeat · 2024-01-22T06:00:06 1705903206

It’s about number of photons and aperture. In principle this could be very hard to detect, especially once people get good at multiple distributed apertures that are coherent with one another.

offices · 2024-01-22T11:16:06 1705922166

For comparison, The Ocean Full of Bowling Balls did this and successfully remained hidden for half a century, only failing because it was published somewhere else.

gfodor · 2024-01-20T21:32:54 1705786374

True I can imagine that kind of thing becoming popular.

thfuran · 2024-01-20T21:53:23 1705787603

>There is no way that such tools will ever win, given the requirements the art remain viewable to human perception

On the other hand, the adversarial environment might push models towards a representation more aligned with human perception, which is neat.

aqfamnzc · 2024-01-21T00:54:40 1705798480

The ol' Analog Gap. https://en.m.wikipedia.org/wiki/Analog_hole

Reubend · 2024-01-21T01:05:52 1705799152

> Huge market for snake oil here.

This tool is free, and as far as I can tell it runs locally. If you're not selling anything, and there's no profit motive, then I don't think you can reasonably call it "snake oil".

At worst, it's a waste of time. But nobody's being deceived into purchasing it.

autoexec · 2024-01-21T05:55:25 1705816525

If this is a danger from "snake oil" of this type, it'd be from the other side, where artists are intentionally tricked into believing that tools like this mean that AI isn't or won't be a threat to their copyrights in order to get them to stop opposing it so strongly, when in fact the tool does nothing to prevent their copyrights from being violated.

I don't think that's the intention of Nightshade, but I wouldn't put past someone to try it.

Biganon · 2024-01-21T11:39:25 1705837165

There's an academic paper being published.

Snake oil for the sake of getting published is a very real problem that does exist.

golol · 2024-01-21T14:32:15 1705847535

Religion is also deceptive and snake-oil even if it does not involve profit driven motivations.

NoahKAndrews · 2024-01-21T22:40:55 1705876855

It very often does involve such motivations, though I agree with your larger point.

golol · 2024-01-22T10:50:25 1705920625

I should've said even when it does not involve... English can be funny.

spaceman_2020 · 2024-01-21T07:07:31 1705820851

This is the hard reality. There is no putting this genie back in the bottle.

The only way to be an artist now is to have a unique style of your own, and to never make it online.

hutzlibu · 2024-01-21T08:37:18 1705826238

"and to never make it online."

So then of course, you also cannot sell your work, as those might put it online. And you cannot show your art to big crowds, as some will make pictures and put it online. So ... you can become a literal underground artists, where only some may see your work. I think only some will like that.

But I actually disagree, there are plenty of ways to be an artist now - but most should probably think about including AI as a tool, if they still want to make money. But with the exception of some superstars, most artists are famously low on money - and AI did not introduce this. (all the professional artists I know, those who went to art school - do not make their income with their art)

BeFlatXIII · 2024-01-21T13:27:58 1705843678

GP almost certainly mean "make physical art." Pictures of that can get online, but it's not the real thing.

sabedevops · 2024-01-21T10:28:02 1705832882

Can you elaborate on how they supplement their income?

hutzlibu · 2024-01-21T13:53:02 1705845182

Every other source of income? So other, art-unrelated jobs.

jedberg · 2024-01-21T05:14:21 1705814061

Everything old is new again. It's the same thing with any DRM that happens on the client side. As long as it's viewable by humans, someone will figure out a way to feed that into a machine.

honkycat · 2024-01-21T18:54:29 1705863269

"A law, ie, leveraging state backed violence to deter the things they don’t want."

We all know what a law is you don't need to clarify. It makes your prose less readable.

gfodor · 2024-01-21T23:42:28 1705880548

Other people pointed out they appreciated this prose. It’s easy to forget what exactly people are asking for when they talk about regulating the training of machine learning models.

jMyles · 2024-01-21T16:54:23 1705856063

> leveraging state backed violence to deter the things they don’t want

I just want to say: I really appreciate the stark terms in which you've put this.

The thing that has come to be called "intellectual property" is actually just a threat of violence against people who arrange bytes in a way that challenges power structures.

mihaaly · 2024-01-22T09:44:42 1705916682

I heard that flooding the net with AI generated art would do much much more harm to generative AI than this whatever is this. Yes, this must be some snake oil salesman, those take it seriously turn AIs own weapon against AI.

vmirnv · 2024-01-21T18:43:40 1705862620

I'm thinking — is it possible to create something on a global level similar to what they did in Snapchat: some sort of image flickering that would be difficult to parse, but still acceptable for humans?

nihilius · 2024-01-22T09:45:56 1705916756

Sorry i do not use Snapchat and with googeling "Snapchat image flickering" i did not find a good result. Could you elaborate this a bit more or provide me with a link where this is described? Thank you very much. :)

int_19h · 2024-01-23T04:56:53 1705985813

If humans can process it, you can train a model to do the same.

elzbardico · 2024-01-22T00:25:16 1705883116

You don’t need it to visible. You only need it to be scrapped to poison the models. I think that’s the idea.

AlfeG · 2024-01-21T05:45:49 1705815949

My guess. Is that at some poi t of time You will not be able to use any generated image or video in commercial. Because of 100% copyright claim for using parts of copyrighted image. Like YouTube those days. When some random beeps matches with someone music...

abrarsami · 2024-01-21T06:16:13 1705817773

It should be like that. I agree

minimaxir · 2024-01-20T20:03:17 1705780997

A few months ago I made a proof-of-concept on how finetuning Stable Diffusion XL on known bad/incoherent images can actually allow it to output "better" images if those images are used as a negative prompt, i.e. specifying a high-dimensional area of the latent space that model generation should stay away from: https://news.ycombinator.com/item?id=37211519

There's a nonzero chance that encouraging the creation of a large dataset of known tampered data can ironically improve generative AI art models by allowing the model to recognize tampered data and allow the training process to work around it.

smrtinsert · 2024-01-24T02:49:16 1706064556

Great lora post, thanks for sharing this again! Not sure how I missed as I'm especially interested in sd content.

eigenvalue · 2024-01-20T22:30:25 1705789825

This seems like a pretty pointless "arms race" or "cat and mouse game". People who want to train generative image models and who don't care about what artists think about it at all can just do some basic post-processing on the images that is just enough to destroy the very carefully tuned changes this Nightshade algorithm makes. Something like resampling it to slightly lower resolution and then using another super-resolution model on it to upsample it again would probably be able to destroy these subtle tweaks without making a big difference to a human observer.

In the future, my guess is that courts will generally be on the side of artists because of societal pressures, and artists will be able to challenge any image they find and have it sent to yet another ML model that can quickly adjudicate whether the generated image is "too similar" to the artist's style (which would also need to be dissimilar enough from everyone else's style to give a reasonable legal claim in the first place).

Or maybe artists will just give up on trying to monetize the images themselves and focus only on creating physical artifacts, similar to how independent musicians make most of their money nowadays from touring and selling merchandise at shows (plus Patreon). Who knows? It's hard to predict the future when there are such huge fundamental changes that happen so quickly!

johnnyanmac · 2024-01-20T23:41:22 1705794082

>Or maybe artists will just give up on trying to monetize the images themselves and focus only on creating physical artifacts, similar to how independent musicians make most of their money nowadays from touring and selling merchandise at shows (plus Patreon).

As is, art already isn't a sustainable career for most people who can't get a job in industry. The most common monetization is either commissions or hiding extra content behind a pay wall.

To be honest I can see more proverbial "Furry artists" sprouting up in a cynical timeline. I imagine like every other big tech that the 18+ side of this will be clamped down hard by the various powers that be. Which means NSFW stuff will be shielded a bit by the advancement and you either need to find underground training models or go back to an artist. .

Gigachad · 2024-01-21T03:20:17 1705807217

>need to find underground training models

It's not particularly that hard. The furry nsfw models are already the most well developed and available models you can get right now. And they are spitting out stuff that is almost indistinguishable from regular art.

raincole · 2024-01-21T04:46:17 1705812377

> This seems like a pretty pointless "arms race" or "cat and mouse game".

If there is any "point" of this, it's that's going to push the AI models to become better at capturing how humans see things.

jMyles · 2024-01-21T16:57:43 1705856263

> musicians make most of their money nowadays from touring and selling merchandise at shows

Be reminded that this is - and has always been - the mainstream model of the lineages of what have come to be called "traditional" and "Americana" and "Appalachian" music.

The Grateful Dead implemented this model with great finesse, sometimes going out of their way to eschew intellectual property claims over their work, in the belief that such claims only hindered their success (and of course, they eventually formalized this advocacy and named it "The Electronic Frontier Foundation" - it's no coincidence that EFF sprung from deadhead culture).

mihaaly · 2024-01-22T09:40:33 1705916433

It is a funny appearance (weird viewpoint) that artists are furious loosing their monopily in stealing and cloning components from other artists, recomposing into a similar but new thing.

And that OpenArt on the analogy of OpenSource is a non-existing thing (I know, I know, different things, source code is not for the generic audience and can be hidden on will, unlike art, just having some generative thoughts artefact here ;) )

hackernewds · 2024-01-20T22:34:07 1705790047

the point is you could circumvent one nightshade, but as long as the cat and mouse game continues there can be more

marcinzm · 2024-01-20T19:33:50 1705779230

This feels like it'll actually help make AI models better versus worse once they train on these images. Artists are basically, for free, creating training data that conveys what types of noise does not change the intended meaning of the image to the artist themselves.

r3trohack3r · 2024-01-20T19:44:56 1705779896

The number of people who are going to be able to produce high fidelity art with off the shelf tools in the near future is unbelievable.

It’s pretty exciting.

Being able to find a mix of styles you like and apply them to new subjects to make your own unique, personalized, artwork sounds like a wickedly cool power to give to billions of people.

kredd · 2024-01-20T22:06:12 1705788372

In terms of art, population tends to put value not on the result, but origin and process. People will just look down on any art that’s AI generated in a couple of years when it becomes ubiquitous.

petesergeant · 2024-01-21T02:04:34 1705802674

> population tends to put value not on the result, but origin and process

I think population tends to value "looks pretty", and it's other artists, connoisseurs, and art critics who value origin and process. Exit Through the Gift Shop sums this up nicely

Aerroon · 2024-01-21T04:08:07 1705810087

I disagree. I definitely value modern digital art more than most historical art, because it just looks better. If AI art looks better (and in some cases it does) then I'll prefer that.

kredd · 2024-01-21T06:17:48 1705817868

That’s totally fine, everyone’s definition of art is subjective. But general value of an art as a piece will just still be zero for AI generated ones, just like any IKEA / Amazon print piece. You just pay for the “looks pretty”, frame and paper.

Aerroon · 2024-01-21T08:47:05 1705826825

>You just pay for the “looks pretty”, frame and paper.

But you pay that for any piece of art though? You appreciate it because you like what it looks like. The utility of it is in how good it looks, it's not how much effort was put into it.

If you need a ditch you're not going to value the ditch more if the worker dug it by hand instead of using an excavator. You value it based on the utility it provides you.

kredd · 2024-01-21T15:26:32 1705850792

That analogy doesn’t work for art, since worker’s ditch is result based. There are no feelings like “i like this ditch”, “experience of a ditch” or “i’m curious how this ditch was dug”.

Again, i’m not saying buying a mass made AI art will be wrong. Just personally speaking, it will never evoke any feelings other than “looks neat” for me. So its inherent “art value” is close to 0 as I can guess its history is basically someone put in a prompt and sent it to print (which I can do myself on my phone too!). It’s the same as looking at cool building pics on my phone (0 art value) versus actually seeing them in person (non-0), mostly because the feelings I get from it. That being said, if it makes others happy, it’s not my place to judge.

redwall_hp · 2024-01-20T22:36:14 1705790174

This is already the case. Art is a process, a form of human expression, not an end result.

I'm sure OpenAI's models can shit out an approximation of a new Terry Pratchett or Douglas Adams novel, but nobody with any level of literary appreciation would give a damn unless fraud was committed to trick readers into buying it. It's not the author's work, and there's no human message behind it.

torginus · 2024-01-21T11:40:50 1705837250

Thing is there are way more good books written, than any single person can consume in their lifetimes. An average person like me, reading a mixed diet of classics, obscure recommendations and what's popular right now, I still don't feel like I'm making a dent in the pile of high quality written content.

Given all that, the purpose of LLMs should be to create tailor made content to everyone's tastes. However, it seems the hardcore guardrails put into GPT4 and Claude prevent it from generating anything enjoyable. It seems, even the plot of the average Star Wars movie is too spicy for modern LLM sensibilities, never mind something like Stephen King.

int_19h · 2024-01-23T05:01:42 1705986102

That's where you spin up a local LLaMA instance. The largest models that are still runnable on consumer grade hardware actually beat GPT-3.5 at this point. And there are numerous finetunes all over the "spiciness" spectrum.

Aerroon · 2024-01-21T04:11:21 1705810281

Novels aren't about a message. They're entertainment. If the novel is entertaining then it's irrelevant whether there is or isn't a message in it. Besides, literature enthusiasts will invent a message for a popular story even if there never was one.

Also, I'm sure that you can eventually just prompt the model with the message you want to put into the story, if you can't already do that.

rondini · 2024-01-22T14:55:38 1705935338

I sounds like you don't value art as the purest form of human expression but you'll never be able to convince others to think like you with logic. For my part I think you fundamentally misunderstand the value of creativity but I know I won't change your mind either.

Aerroon · 2024-01-22T22:52:28 1705963948

If it was really about the message, then why waste all the time with the rest of the novel? Describe the message in a sentence or two. You could read an entire library of books worth of messages in a few days.

But that wouldn't be helpful. It would've been memorable, because novels aren't just about the message.

portaouflop · 2024-01-21T04:23:08 1705810988

I haven’t read anything “shit out” by any LLM that even nearly approaches the level of quality by the authors you named — would very much like to see something like that - do you have any evidence for your claims?

AFAICT current text generation is something approaching bad mimicry at best and downright abysmal in general. I think you still need a very skilled author and meaty brain with a story to tell to make use of an LLM for storytelling. Sure it’s a useful tool that will make authors more effective but we are far from the point where you tell the LLM “write a story set in Pratchetts Discworld” and something acceptable or even entertaining will be spit out - if such a thing can even be achieved.

Theodores · 2024-01-21T02:58:57 1705805937

https://en.wikipedia.org/wiki/Labor_theory_of_value

According to Marx, value is only created with human labour. This is not just a Marxist theory, it is an observation.

There may be lots of over-priced junk that makes you want to question this idea. But let's not nit-pick on that.

In two years time people will not see any value in AI art, quite correctly because there is not much human labour in creating it.

mesh · 2024-01-21T03:34:15 1705808055

In two years time, no one will know what was created with AI, what was created by humans, or what was created by both.

Gormo · 2024-01-21T04:12:25 1705810345

> According to Marx, value is only created with human labour. This is not just a Marxist theory, it is an observation.

And yet it's completely and absolutely wrong. Value is created by the subjective utility offered to the consumer, irrespective of what inputs created the thing conveying that utility.

thiagoharry · 2024-01-23T18:21:40 1706034100

You are using marginal utility value theory. Parent comment is using labor value theory. In fact, there are also other value theories in economy. It's a mostly philosophical choice, and like other philosophical choices, it's not possible to accuse one of them of being wrong. It's a matter of choosing your philosophy, and understanding different philosophies.

Gormo · 2024-01-25T02:16:42 1706149002

> You are using marginal utility value theory. Parent comment is using labor value theory.

Yes, I'm aware. This is precisely why I'm stating the prior comment to be "absolutely wrong". Marginal utility is a substantially valid model, LTV is not.

> In fact, there are also other value theories in economy. It's a mostly philosophical choice, and like other philosophical choices, it's not possible to accuse one of them of being wrong.

Sure it is. These aren't theories in a normative sense, they're models of causality for manifest phenomena. They're closer to scientific theories than they are to philosophical axioms. LTV simply doesn't bear out with observation.

jquery · 2024-01-21T04:26:20 1705811180

Labor theory of value is quite controversial, many economists call it tautological or even metaphysical. I also don't really see what LTV has to say about AI art, if anything, except that the economic value generated by AI art should be distributed to everybody and not just funneled to a few capitalists at the top. I would agree with that. It's true that more jobs get created even as jobs are destroyed, but it's also true that just as our ancestors fought for a 40 hour work week and a social safety net, we should be able to ask for more as computers become ever so productive.

int_19h · 2024-01-23T05:04:28 1705986268

In Marx's time, you needed humans to perform any kind of labor. Even machines needed operators. But there's nothing about LTV that would make it a hard requirement. The point of Marx's claim is that without someone performing labor using capital, there wouldn't be any value for the owner of said capital to pocket. This is just as true if you replace workers with AI.

petesergeant · 2024-01-21T05:13:05 1705813985

> This is not just a Marxist theory, it is an observation.

Yeah? Well, you know, that's just like uh, your opinion, man

MacsHeadroom · 2024-01-20T22:25:16 1705789516

Nope, but I already look down on artists who refuse to integrate generative AI into their processes.

mplewis · 2024-01-20T22:34:07 1705790047

Can you share some of the art you’ve made with generative AI?

_svoh · 2024-01-20T22:36:14 1705790174

Cool, who are you?

MisterBastahrd · 2024-01-20T22:54:27 1705791267

People who use generative AI in their processes are not artists.

blacklion · 2024-01-20T23:52:18 1705794738

And people who use Photoshop are?

There is somewhat famous digital artist from Russia - Alexey Andreev. Google it, he has very distinctive style of realistic technique and surrealistic situations, like landing big manta ray on the deck of aircraft carrier. Or you can see his old works in his 5-years-not-updates LJ [1].

Now he uses generative AI as one of his tools. As Photoshop, as different (unrealistic!) brushes in Photoshop, as other digital tools. His style is still 100% recognizable and his works don't become worse or more "generic". Is he still artist? I think so.

Where will you draw the line?

[1] - https://alexandreev.livejournal.com/

nottorp · 2024-01-22T13:47:02 1705931222

For every Alexey Andreev there are about 248518 blogs with "AI" generated illustrations that you just scroll past without noticing :)

smackeyacky · 2024-01-21T04:41:06 1705812066

I don’t think this is quite right. I think paraphrasing The Incredibles has a better take:

When everybody is an artist, then nobody will be one.

davely · 2024-01-21T02:37:53 1705804673

I use generative AI to rubber duck and help improve my code.

Am I no longer a software engineer?

password54321 · 2024-01-20T23:27:14 1705793234

This is true. They are just taking a sample from a generated latent space, just like taking a photo of something doesn't make you an artist.

blacklion · 2024-01-20T23:53:20 1705794800

So, there is no artists in, for example, street photography? Picture must be altered to become art, or staged?

Was it irony? :)

password54321 · 2024-01-21T00:01:04 1705795264

They are photographers. Here is the definition of an artist so you can have better clarity on what an artist is:

"A person who creates art (such as painting, sculpture, music, or writing) using conscious skill and creative imagination"

int_19h · 2024-01-23T05:06:40 1705986400

Your definition assumes that photography is not art and/or doesn't involve conscious skill and creative imagination. That's not the consensus, to put it mildly.

password54321 · 2024-01-23T13:13:24 1706015604

Everything is art then and we are all artists. Happy? Also consensus != correct.

aqfamnzc · 2024-01-21T04:04:05 1705809845

I took gp as satire. But maybe not haha.

falcolas · 2024-01-20T22:32:32 1705789952

> Being able to find a mix of styles you like and apply them to new subjects to make your own unique, personalized, artwork sounds like a wickedly cool power to give to billions of people.

And in the process, they will obviate the need for Nightshade and similar tools.

AI models ingesting AI generated content does the work of destroying the models all by itself. Have a look at "Model Collapse" in relation to generative AI.

23B1 · 2024-01-20T20:45:04 1705783504

It'll be about as wickedly tool as the ability to get on the internet, e.g. commoditized, transactional, and boring.

sebzim4500 · 2024-01-20T22:05:18 1705788318

I know this is an unpopular thing to say these days, but I still think the internet is amazing.

I have more access to information now than the most powerful people in the world did 40 years ago. I can learn about quantum field theory, about which pop star is allegedly fucking which other pop star, etc.

If I don't care about the law I can read any of 25 million books or 100 million scientific papers all available on Anna's Archive for free in seconds.

r3trohack3r · 2024-01-21T02:01:42 1705802502

As Jeff Bezos recently said on the Lex podcast: one of the greatest compliments you can give an inventor is that they’re invention will be taken for granted by future generations.

“It won’t be any more wickedly cool than the internet” - saying something won’t be any more wickedly cool than the most profound and impactful pieces of infrastructure human civilization has erected is a pretty high compliment.

__loam · 2024-01-20T20:09:40 1705781380

And we only had to alienate millions of people from their labor to do it.

r3trohack3r · 2024-01-20T20:35:16 1705782916

Absolutely agree we should allow people to accumulate equity through effective allocation of their labor.

And I also agree that we shouldn’t build systems that alienate people from that accumulated equity.

DennisAleynikov · 2024-01-20T20:21:47 1705782107

Yeah, sadly those millions of people don’t matter in the grand scheme of things and were never going to profit off their work long term

easyThrowaway · 2024-01-22T08:53:16 1705913596

Where the "grand scheme of things" are the quarter reports of a few AI-invested companies?

r3trohack3r · 2024-01-20T20:43:15 1705783395

What a bummer of a thing to say.

Those millions/billions of people matter a great deal.

DennisAleynikov · 2024-01-21T02:25:35 1705803935

They matter but not under the current system. Artists are a rarely paid profession, and there are professional artists out there but there’s now a huge amount of people that will never contact an artist for work that used to only be human powered. It’s not personal for me. I understand that desire to resist the inevitable but it’s here now.

For what it’s worth I never use midjourney or dalle or any of the commercial closed systems that steal from artists but I know I can’t stop the masses from going there and inputting “give me pretty picture in style x”

__loam · 2024-01-21T12:11:29 1705839089

Resistance is important imo. If this happens and we, who work in this industry, say nothing, what good are we. It's only inevitable if it's socially acceptable.

mensetmanusman · 2024-01-20T20:21:55 1705782115

Is this utilitarianism?

BeFlatXIII · 2024-01-21T13:33:36 1705844016

Worth it.

password54321 · 2024-01-20T23:22:05 1705792925

Not really. There is a reason why we find realistic painting to be more fascinating than a photo and why some still practice it. The effort put in by another artist does affect our enjoyment.

wruza · 2024-01-21T04:15:42 1705810542

For me it doesn’t. I’m generating images, realistic, 2.5d, 2d and I like them as much. I don’t feel (or miss) what you described. Or what any other arts guy describes, for that matter. Arts people are different, because they were trained to feel something a normal person wouldn’t. And that’s okay, a normal person without training wouldn’t see how much beauty and effort there is in an algorithm or a legal contract as well.

dartharva · 2024-01-21T05:12:08 1705813928

The word "we" is doing a lot of heavy lifting here. A large majority of consumers can't even tell apart AI-generated from handmade, let alone care who or what made the thing.

password54321 · 2024-01-21T12:49:51 1705841391

Yeah, that's just information you made up on the spot.

chris-orgmenta · 2024-01-20T20:34:50 1705782890

I want progressive fees on copyright/IP/patent usage, and worldwide gov cooperation/legislation (and perhaps even worldwide ability to use works without obtaining initial permission, although let's not go into that outlandish stuff)

I want a scaling license fee to apply (e.g. % pegged to revenue. This still has an indirect problem with different industries having different profit margins, but still seems the fairest).

And I want the world (or EU, then others to follow suit) to slowly reduce copyright to 0 years* after artists death if owned by a person, and 20-30 years max if owned by a corporation.

And I want the penalties for not declaring usage** / not paying fees, to be incredibly high for corporations... 50% gross (harder) / net (easier) profit margin for the year? Something that isn't a slap on the wrist and can't be wriggled out of quite so easily, and is actually an incentive not to steal in the first place.)

[*]or whatever society deems appropriate.

[**]Until auto-detection (for better or worse) gets good enough.

IMO that would allow personal use, encourages new entrants to market, encourages innovation, incentivises better behaviour from OpenAI et al.

Dylan16807 · 2024-01-21T02:01:44 1705802504

> And I want the world (or EU, then others to follow suit) to slowly reduce copyright to 0 years* after artists death if owned by a person, and 20-30 years max if owned by a corporation.

Why death at all?

It's icky to trigger soon after death, it's bad to have copyright vary so much based on author age, and it's bad for many works to still have huge copyright lengths.

It's perfectly fine to let copyright expire during the author's life. 20-30 years for everything.

wraptile · 2024-01-21T06:25:37 1705818337

Extremely naive to think that any of this could be enforced to any adequate level. Copyright is fundamentally broken and putting some plasters on it is not going to do much especially when these plasters are several decades too late.

alentred · 2024-01-20T18:26:09 1705775169

With this "solution" it looks like the world of art enters the cat-and-mouse game the ad blockers were playing for the last decade or two.

isodev · 2024-01-20T18:39:13 1705775953

I just tested it with Azure AI image classification and it worked - so this cat is yet to adapt to the mouse’s latest idea.

I still feel it is absolutely wrong to roam around the internet and scrape images (without consent) in order to power one’s cash cow AI. I hope more methods to protect artworks (including audio and other formats) become more accessible.

HKH2 · 2024-01-21T01:53:13 1705801993

Artists copy from each other all the time. Arguably, culture exists because of copying (folk stories by necessity); copyright makes culture top-down and stagnant, and you can't avoid it because they have the money to shove it right in your face. Who wants trickle-down culture?

blibble · 2024-01-21T03:10:41 1705806641

it's not an artist, it's a piece of software

in the same way bittorrent or gzip is

HKH2 · 2024-01-21T12:51:01 1705841461

Sure. The person using it has intent. Now we have come to a point in which intent alone is art. Let there be light.

KTibow · 2024-01-20T18:46:29 1705776389

I might be missing something because I don't know much about the architecture of either Nightshade or AI art generators, but I wonder if you could try to have a GAN-like architecture (an extra model trying to trick the model) for the part of the generator that labels images to build resistance to Nightshade-like filters.

the8472 · 2024-01-20T19:25:36 1705778736

It doesn't even have to be a full GAN, you only need to train the discriminator side to filter out the data. Clean reference images + Nightshade would be the generator side.

brucethemoose2 · 2024-01-20T18:41:37 1705776097

What the article doesn't illustrate is that it destroys fine detail in the image, even in the thumbnails of the reference paper: https://arxiv.org/pdf/2310.13828.pdf

Also... Maybe I am naive, but it seems rather trivial to work around with a quick prefilter? I don't know if tradition denoising would be enough, but worst case you could run img2img diffusion.

reply

GaryNumanVevo · 2024-01-20T19:24:27 1705778667

The poisoned images aren't intended to be viewed, rather scraped and pass a basic human screen. You wouldn't be able to denoise as you'd have to denoise the entire dataset, the entire point is that these are virtually undetectable from typical training set examples, but they can push prompt frequencies around at will with a small number of poisoned examples.

minimaxir · 2024-01-20T19:56:24 1705780584

> You wouldn't be able to denoise as you'd have to denoise the entire dataset

Doing that requires much less compute than training a large generative image model.

brucethemoose2 · 2024-01-20T20:23:41 1705782221

I guess the idea is that the model trainers are ignorant of this and wouldn't know to preprocess/wouldn't bother?

That's actually quite plausible.

BugsJustFindMe · 2024-01-21T02:47:18 1705805238

> I guess the idea is that the model trainers are ignorant of this

Maybe they're ignorant of it right up until you announce it, but then they're no longer ignorant of it.

brucethemoose2 · 2024-01-21T18:40:33 1705862433

Right, but they aren't necessarily paying attention to this.

I am not trying to belittle foundational model trainers, but a lot goes on in ML land. Even groups can't track every development.

GaryNumanVevo · 2024-01-20T20:23:14 1705782194

> the entire point is that these are virtually undetectable from typical training set examples

I'll repeat this point for clarity. After going over the paper again, denoising shouldn't affect this attack, it's the ability of plausible images to not be detected by human or AI discriminators (yet)

jamesu · 2024-01-20T15:29:55 1705764595

Long-term I think the real problem for artists will be corporations generating their own high quality targeted datasets from a cheap labor pool, completely outcompeting them by a landslide.

jdietrich · 2024-01-20T20:56:25 1705784185

In the short-to-medium term, we're seeing huge improvements in the data efficiency of generative models. We haven't really started to see self-training in diffusion models, which could improve data efficiency by orders of magnitude. Current models are good at generalisation and are getting better at an incredible pace, so any efforts to limit the progress of AI by restricting access to training data is a speedbump rather than a roadblock.

ufocia · 2024-01-20T20:13:00 1705781580

It will democratize art.

sussmannbaka · 2024-01-21T08:31:12 1705825872

Art is already democratized. It has been for decades. Everyone can pick it up at zero cost. Even you!

The poorest people have historically produced great art. Training a model, however? Expensive. Running it locally? Expensive. Paying the sub? Expensive.

Nothing is being democratized, the only thing this does is devaluing the blood and sweat people have put into their work so FAANG can sell it to lazy suckers.

int_19h · 2024-01-23T05:11:37 1705986697

Time is not a zero cost thing, and especially not for the poorest people.

23B1 · 2024-01-20T20:48:25 1705783705

then it won't be art anymore, it'll just be mountains of shit

sorta like what the laptop did for writing

jrflowers · 2024-01-21T01:15:48 1705799748

This is a good point. There hasn’t been any writing since the release of the Gateway Solo in 1995

Quanttek · 2024-01-20T15:11:44 1705763504

This is fantastic. If companies want to create AI models, they should license the content they use for the training data. As long as there are not sufficient legal protections and the EU/Congress do not act, tools like these can serve as a stopgap and maybe help increase pressure on policymakers

popohauer · 2024-01-20T15:27:19 1705764439

It's going to be interesting to see how the lawsuits against OpenAI by content creators plays out. If the courts rule that AI generated content is a derivative work of all the content it was trained on it could really flip the entire gen AI movement on its head.

luma · 2024-01-20T15:35:38 1705764938

If it were a derivative work[1] (and sufficiently transformational) then it's allowed under current copyright law and might not be the slam dunk ruling you were hoping for.

[1] https://en.wikipedia.org/wiki/Derivative_work

kevingadd · 2024-01-20T15:38:10 1705765090

"sufficiently transformational" is carrying a lot of water here. At minimum it would cloud the issue and might expose anyone using AI to lawsuits where they'd potentially have to defend each generated image.

ufocia · 2024-01-20T20:29:45 1705782585

Sufficiently transformational only applies to copyrightability, but AI works are not copyrightable under current US law, so it's a non-issue.

popohauer · 2024-01-20T20:15:02 1705781702

Oh, interesting, I didn't realize that's how it worked. Thanks for the additional context around this. Guess it's not as upending as I thought it could be.

ufocia · 2024-01-20T20:27:44 1705782464

Not if it is AI generated. So far only humans can be original enough to warrant copyrights, at least in the US .

BTW, the right to prepare derivative works belongs to the copyright holder of the reference work.

I doubt that many AI works are in fact derivative works. Sure, some bear enough similarity, but a gross majority likely doesn't.

torginus · 2024-01-21T11:51:47 1705837907

My biggest fear is that the big players will drop a few billion dollars to silence the copyright holders with power go away, and new rules are put in place that will make open-source models that can't do the same essentially illegal.

int_19h · 2024-01-23T05:24:38 1705987478

If the courts do rule that way, I would expect a legislative race between different countries to amend the relevant laws. Visual generative AI is just too lucrative a thing.

BeFlatXIII · 2024-01-21T13:35:27 1705844127

…then I'll keep enjoying my Stable Diffusion and pirated models.

Kuinox · 2024-01-20T15:13:46 1705763626

> they should license the content they use for the training data

You mean like OpenAI and Adobe ?

Only the free and open source models didn't licensed any content for the training data.