The screenshots you sent in [1] are inference, not training. You need to get a N...

scheeseman486 · on Jan 21, 2024

Hey you know what might not be AI generated post-2021? Almost everything run through Nightshade. So given it's defeated, which is pretty likely, artists have effectively tagged their own work for inclusion.

hkt · on Jan 21, 2024

It is a great shame that we have come to a no-win situation for artists when VCs are virtually unable to lose.

ToucanLoucan · on Jan 21, 2024

I mean that's more or less status quo isn't it? Big business does what it wants, common people can get fucked if they don't like it. Same as it ever was.

hkt · on Jan 21, 2024

That's exactly right. It is just the variety of new ways in which common people get fucked that is dispiriting, with seemingly nothing capable of moving in the opposite direction.

kmeisthax · on Jan 21, 2024

Why wouldn't an artist just generate AI spam and Nightshade it?

visarga · on Jan 21, 2024

Modern generative image models are trained on curated data, not raw internet data. Sometimes the captions are regenerated to fit the image better. Only high quality images with high quality descriptions.

og_kalu · on Jan 22, 2024

I wouldn't call what Stable Diffusion et al are trained on "high quality". You need only look through the likes of LAION to see the kind of captions and images they get trained on.

It's not random but it's not particularly curated either. Most of the time, any curation is done afterwards.

visarga · on Jan 22, 2024

Have you seen the BLIP paper? It's a bit old now, but it introduced a curation method.

https://arxiv.org/abs/2201.12086

KTibow · on Jan 21, 2024

Correct me if I'm wrong but I understand image generators as relying on auto-labeled images to understand what means what, and the point of this attack to make the auto-labelers mislabel the image, but as the top-level comment said it's seemingly not tricking newer auto-labelers.

michaelbrave · on Jan 21, 2024

not all are auto labelled, some are hand labelled, some are initially labelled with something like clip/blip/booru and then corrected a bit by hand. The newest thing though is using llm's with image support like GPT4 to label the images, which kind of does a much better job most of the time.

Your understanding of the attack was the same as mine, it injects just the right kinds of pixels to throw off the auto-labellers to misdirect what they are directing causing the tags to get shuffled around.

Also on reddit today some of the Stable Diffusion users are already starting to train using Nightshade so they can implement it as a negative model, which might or might not work, will have to see.

webmaven · on Jan 21, 2024

Even if no new images are being scraped to train the foundation text-to-image models, you can be certain that there is a small horde of folk still scraping to create datasets for training fine-tuned models, LoRAs, Textual Inversions, and all the new hotness training methods still being created each day.

GaggiX · on Jan 21, 2024

If it doesn't work during inference I really doubt it will have any intended effect during training, there is simply too much signal and the added adversarial noise works on the frozen and small proxy model they used (CLIP image encoder I think) but it doesn't work on a larger model and trained on a different dataset, if there is any effect during training it will probably just be the model learning that it can't take shortcuts (the artifacts working on the proxy model showcase gaps in its visual knowledge).

Generative models like text-to-image have an encoder part (it could be explicit or not) that extract the semantic from the noised image, if the auto-labelers can correctly label the samples then the encoded trained on both actual and adversarial images will learn to not take the same shortcuts that the proxy model has taken making the model more robust, I cannot see an argument where this should be a negative thing for the model.

ptdn · on Jan 21, 2024

The context windows of LLMs are now significantly larger than 2048 tokens, and there are clever ways to autopopulate context window to remind it of things.

jerbear4328 · on Jan 21, 2024

[3] sounds really interesting - do you have a link?

ittseta · on Jan 21, 2024

https://www.nature.com/articles/s41467-023-40499-0 https://deepmind.google/discover/blog/images-altered-to-tric...

Study on the Influence of Adversarial Images on Human Perception