Glaze: Protecting artists from style mimicry

greysphere · on March 20, 2023

I think a lot of folks are missing the point thinking this needs to be super-robust to be useful.

This is a hedge against courts deciding scraping data for training purposes is valid.

Maybe you're allowed to scrape data, but with this, now you are applying a filter (creating a derivative work) to defeat a copyright protection mechanism, both clearly prohibited in current law (US jurisdiction at least). For any serious player scraping this opens your buisness up to huge lawsuits. For any serious player making tools, you'll specifically avoid defeating these techniques. For any minor player you'll now have to go to the backwaters of the internet for tools to do this that you hope won't steal your bitcoins.

Every notable artist will be only upload their art to sites that offer something like this, paid at first, but when the cost are low enough, pretty much every site that wants art content will offer it.

This isn't a technical solution to this problem, it's a political solution that happens to use tech.

not-chatgpt · on March 20, 2023

For anyone looking to train on a specific style, this algorithm is useless. For any organization looking to scrape images on a massive scale, it doesn't matter as there are more than enough unaltered images out there.

It's a political solution that capitalized on fear, yet does not offer tangible protection. Even watermarking is more effective.

Spivak · on March 20, 2023

> it doesn't matter as there are more than enough unaltered images out there

The point for a given artist is that your images aren't there so the AI can't imitate your style (at least not by using your name).

smeagull · on March 20, 2023

> so the AI can't imitate your style (at least not by using your name).

It'll work regardless. All it needs is X artist uses this medium, does these sorts of pictures, looks like artists Y & Z, etc. etc.

SD does okay on some artists that don't have works in the image dataset. All their name does is pick out a position in the embedding space.

esperent · on March 20, 2023

> SD does okay on some artists that don't have works in the image dataset

I don't think this is true. Can you provide an example?

l33tman · on March 20, 2023

It works because SD (and DALLE2) don't only infer from the priors from their image training dataset, they infer and mix up concepts coming from the text embedding as well - as this was also trained on images (previously, as CLIP or OpenCLIP).

So CLIP can have picked up an association that a named artist usually is synonymous with for example "broad strokes, moody lighting" and then that is fed into the diffusion model, which doesn't know the artist but DOES know what broad strokes and moody lighting is.

But sure, if CLIP didn't know about the artist name either it won't work of course.

By the way you can still just enter the particulars of the artist you want to mimic by text as well. There is not THAT much information in a style and you won't need to feed an image into the system.

I guess all of this with artists trying to protect their online works by watermarking or glazing will only be a very short speedbump for better or worse. If a human can do a 1-shot style transfer by a single glance at a work, the next round of AIs will as well, and won't be hampered by adding noise to the works and you might have "style extraction" tools that could work like chatgpt in that you iteratively instruct by text commands what to do to get closer without ever letting the AI look at an image.

esperent · on March 22, 2023

Thank you, this is a clear explanation.

> There is not THAT much information in a style and you won't need to feed an image into the system.

I think it really depends on the style. Some styles are simple, others are more complex. How that translates to an AI understanding it though, I have no idea. But in terms of brush strokes I don't think it's fair to say every style can be described as "broad strokes, moody, dark". Some very simple styles, maybe.

chii · on March 20, 2023

The recently released game "Atomic Hearts" has a robotic style that didn't exist in the training dataset of SD, and yet i have seen the similar robotic styles generated for it. Of course, i cannot tell if it was a fine tuned model made for such a purpose.

But i do feel that unless your style is very unique and no existing "roots" in existing styles, it would not be possible to "protect" it technologically.

usrbinbash · on March 20, 2023

Why wouldn't it be true?

That's the purpose of the training, no? To generalize the model so that it can produce images it has never seen before. That includes images in styles it has never seen before.

Whether or not models have generalized to that point is a different question, but if they do, (and let's be honest, alot of artistic styles aren't that unique or different) the only thing that would be different is that the model cannot conjure up the style by providing the artists name in the input, instead one would have to describe the style in other ways.

simandl · on March 20, 2023

This assumes that the filter actually works in practice: https://www.reddit.com/r/StableDiffusion/comments/11v7sv9/ha...

raincole · on March 20, 2023

> Maybe you're allowed to scrape data, but with this, now you are applying a filter (creating a derivative work) to defeat a copyright protection mechanism, both clearly prohibited in current law (US jurisdiction at least)

Are you sure it works like this? Because SD training usually starts with resizing, cropping, and encoding to latent space. If applying a filter is clearly prohibited, the current approach of "resizing, cropping, encoding" is surely too, right?

freeone3000 · on March 20, 2023

Intent matters. Defeating a copyright protection measure is in itself illegal according to the DMCA.

bioemerl · on March 20, 2023

But if scraping isn't a copyright matter, how would glaze be a copyright protection?

usrbinbash · on March 20, 2023

a) Does this, legally speaking, count as a copyright protection measure?

b) The DMCA is a US legislation, what about training done in other countries?

kamray23 · on March 20, 2023

To a): 17 USC § 1201(a)(3) А technological measure “effectively controls access to a work” if the measure, in the ordinary course of its operation, requires the application of information, or a process or a treatment, with the authority of the copyright owner, to gain access to the work.

Maybe? It certainly prevents reusing the work in that specific manner without authorisation. It doesn't prevent just seeing the work though, so it's a bit up to interpretation.

To b), US-like copyright law effectively applies in any WTO-conforming country. Anything that implements WIPO functiouns roughly the same way. Fun DMCA fact, there is no fair use provision. Any use is likely to be criminal. Another fun DMCA fact, apparently ripping out spyware and republishing is entirely legal. Huh.

usrbinbash · on March 21, 2023

I am not a lawyer, so the following is only my opinion.

> It certainly prevents reusing the work in that specific manner without authorisation.

Well, technically speaking, it doesn't prevent it. It just messes up the results of the work being used in that manner. And what if someone builds a training workflow that can just ingest such changed images and use them without being negatively affected by the changes?

> so it's a bit up to interpretation.

Interpretation that would likely have to be decided in court, and likely in a very drawn out and very very very expensive manner, with uncertain outcome.

A machine-readable tagging that simply says "noone is allowed to use this for training AI" sounds way easier to argue in court to me.

kamray23 · on March 21, 2023

> It just messes up the results of the work being used in that manner. And what if someone builds a training workflow that can just ingest such changed images and use them without being negatively affected by the changes?

That is what encryption does as well. You can certainly attempt to watch DVDs without permission. It won't be very enjoyable. And what if someone builds a viewing application which can just watch those DVDs anyway? You see, if this is legally protected, building that workflow is circumvention and very, very illegal. Defining "effectively restricts" is left intentionally up to interpretation because there is no clear line between messing up the result and preventing access.

usrbinbash · on March 21, 2023

Disclaimer (again): I am not a lawyer, so this is only my opinion.

Encryption prevents usage of the data. This doesn't. The data can still be viewed without any special software, device, password, key, etc. A sufficiently robust ingestion engine could still use it for training. In fact, even an unprepared engine can train on it, it only messes up the outcome.

Honest question: Is it harder to argue legally that I DRM-protected my work if I publish them in a form that needs an encryption key/software/device, or if I publish them for everyone to see after changing some pixels around?

> if this is legally protected

"If" is the important term here. It was mentioned above that "maybe" this counts in the same way as a Copyright protection measure.

I don't argue against that. Maybe it does. That is for lawmakers, courts and similar legal experts to decide.

My opinion as someone who isn't a lawyer, is that it would be EASIER to get courts to agree on that machine-readable tagging, simply disallowing usage of works for training, is similar to DRM measures, and ignoring them should be punished the same way as circumventing copyright mechanisms.

The added bonus for users: Such tagging is easier to implement, easier to update, there exists prior law already covering it (see several european countries) providing legal guidelines. Plus, artists wouldn't have to mangle their works to implement them, and it is useable with all forms of data, not just images.

kamray23 · on March 21, 2023

> it would be EASIER to get courts to agree on that machine-readable tagging, simply disallowing usage of works for training, is similar to DRM measures

It should be. It'd be really great if it was. Sadly, lawyers wrote the DMCA. It has to be a measure which actually restricts access in an effective enough way, just saying "don't touch this" isn't a measure because it doesn't effectively prevent the usage. 17 USC § 1201(a)(3) and all that.

If training on image sets isn't copyright infringement, "don't use this" doesn't count. It's a license, and you're not infringing on it. If it is copyright infringement, "don't use this" is the default and you can't use anything without explicit permission, effectively requiring datasets to only include CC0 images. Since the first one is way more likely to be true, you instead use the 17 USC 1201(a) which prevents circumvention of technical protection measures, by creating what is hopefully a technical protection measure. Is it foolproof? It was never meant to be. It's an attempt at best. But it's better than relying on a law which is incredibly likely to never apply to dataset scraping.

Preventing the usage of the data for a specific application vs total restriction is the big issue here. Is it enough to qualify as a technical protection measure? Maybe, maybe not. The courts may agree, or they may not. But it's fairly established in conversation that dataset scraping isn't infringement, so tagging it doesn't really work.

usrbinbash · on March 22, 2023

> But it's fairly established in conversation that dataset scraping isn't infringement, so tagging it doesn't really work.

Conversations aside, laws can adapt to include terms regarding tagging. As said before, there are legal examples for this, eg. in the EU:

https://discoverdigitallaw.com/is-web-scraping-legal-short-g...

Quote:

    the new law, that must be applied by all EU countries until  7 June 2021 (Directive (EU) 2019/790 on copyright and related rights in the Digital Single Market or ‘DSM Directive’), in its Article 4 provides an exception from the rights of the database owner mentioned above in case of ‘reproductions and extractions of lawfully accessible works and other subject matter for the purposes of text and data mining’ unless ‘the use of works and other subject matter referred to in that paragraph has not been expressly reserved by their rightholders in an appropriate manner, such as machine-readable means in the case of content made publicly available online’.

End Quote.

Again, I'm not a lawer, but to me that seems like it's up to lawmakers to do their homework, and update existing laws to deal with the reality that

a) data mining exists and is useful for lots of things

b) people want to make their works available publicly, and therefore ...

c) people publishing works need a workable, stable and reliable way to tell others whether they are okay with their work being scraped and used for analysis/training/etc. or not

And as I said above, ideally such a solution doesn't require changing the published data in some way, and works for all kinds of data.

kamray23 · on March 23, 2023

That's what is ideal, this is to make use of existing laws. Besides, laws like that don't pop up for no reason. There would need to be a real concentrated push and need for it, and there won't be, because those who stand most to lose here are not a unified voice, and those who are unified voices stand most to gain. Larger tech companies will not complain about scraping because they stand to benefit from it, smaller creators will but have no voice. The DMCA passed exclusively to protect the records and movie industries, the DSM exists to remove barriers on digital services within the EU, but there's nobody who has a voice and wants this. In the EU especially, the voices of companies and organizations count as much if not slightly more than actual votes, non-profits and for-profits alike are intentionally an integral part of decisionmaking processes with the intention of being a fairer society. That's exactly why there isn't really any reason to reach those ideals, and why we have to make do with what is already passed.

zirgs · on March 20, 2023

What if I do training in jurisdiction where DMCA doesn't apply?

dragonwriter · on March 20, 2023

> I think a lot of folks are missing the point thinking this needs to be super-robust to be useful.

It does, though.

> This is a hedge against courts deciding scraping data for training purposes is valid.

For that hedge to work, though, it needs:

(1) Not to substantially degrade the art it is used on, and

(2) [To protect anyone other than a major player against other major players] ] Not to be trivially bypassed by in a way that can be incorporated in automated workflows.

> now you are applying a filter (creating a derivative work) to defeat a copyright protection mechanism, both clearly prohibited in current law (US jurisdiction at least).

If the Fair Use exception applies to training an LLM (which is creating a derivative work, itself, before considering Fair Use), then its extremely clear that applying a filter to incoming works as part of that process (even if not if the filtered work was used for any other purpose) will also be protected. So the derivative work thing is useless.

The circumvention measure thing might technically work (in that it interjects a violation into a workflow that would otherwise be Fair Use), but as a practical matter that doesn’t matter for most users or against most violators. Moreover, to the extent its primary effect would be to adversely impact otherwise noninfringing use, it would be a textbook case of a justification for the Librarian of Congress issuing a DMCA exemption, which would then negate the legal utility entirely.

> For any serious player making tools, you’ll specifically avoid defeating these techniques. For any minor player you’ll now have to go to the backwaters of the internet for tools to do this that you hope won’t steal your bitcoins.

The proof of concept defeats already demonstrated use…the same tools that are used for AI image generation and training models on artist styles in the first place.

> Every notable artist will be only upload their art to sites that offer something like this

The samples I’ve seen of the damage this does to art suggests that this isn’t the case.

> This isn’t a technical solution to this problem, it’s a political solution that happens to use tech.

It’s neither, its just a non-solution.

bawolff · on March 20, 2023

> The circumvention measure thing might technically work (in that it interjects a violation into a workflow that would otherwise be Fair Use), but as a practical matter that doesn’t matter for most users or against most violators. Moreover, to the extent its primary effect would be to adversely impact otherwise noninfringing use, it would be a textbook case of a justification for the Librarian of Congress issuing a DMCA exemption, which would then negate the legal utility entirely.

That's a big if. Sure the librarian of congress could justifiably issue an exemption, but i think it is far from garunteed that they would.

I think you are underestimating what a cooling affect FUD related to anticircumvention could be. Just consider all the stuff related to dvds that went down back in the day e.g. Dmitry Sklyarov being arrested at defcon (even if it didn't stick). The uncertainty could definitely have a major cooling effect.

dragonwriter · on March 20, 2023

> I think you are underestimating what a cooling affect FUD related to anticircumvention could be.

I think you are overestimating it. Because you overestimating the degree to which the present legal context is similar to when DeCSS was an issue, and because you are underestimating the degree to which the financial and political power imbalance between the xxAAs and the other side was key to the effect then, and because you are misunderstanding who is doing the thing this most effects, before countermeasures (its not the easily deterrable parties), and because you are (based on the circumvention POCs) overestimating the degree to which any controllable specialized circumvention tool is needed to circumvent this type of “protection”.

bawolff · on March 20, 2023

Has the legal situation changed significantly? The political situation has, and perhaps that is more important, but its not like the relavent laws have been struck down.

I agree that power imbalance is a significant factor that is different. Fair point.

Why would these parties not easily be deterred by legal concerns? Well there are a variety of groups in this space, many of them are corporate/start ups, and usually they are more adverse to legal risk than your average pseudo-anonoymous citizen.

jakobson14 · on March 20, 2023

Using section 1201 of the DMCA (anti-circumvention) to slam the door shut is an UNBELIEVABLY scummy move. An outright shameful concession.

The statute is essentially "we don't care if what you're doing is legal or not and we're not going to wait for a court to decide, we're playing this trump card to make what you're doing illegal regardless of it's copyright status."

It's equally scummy whether it's over AI training, preventing someone from using material from a DVD under fair use, or suing for Joe Blow for refilling his inkjet cartridges.

amrocha · on March 20, 2023

Artists are literally already being affected by their art being scraped to train these models. According to my artist friends, commissions are down, and there's reports of professional positions being replaced as well.

Do you not think that the response is justified in the face of the livelihood of artists being actually stolen from them as we speak?

worrycue · on March 20, 2023

> According to my artist friends, commissions are down, and there's reports of professional positions being replaced as well.

To be fair, the economy is in a dump. Reduced commissions and positions being cut might have more to do with that than AI. That said, the market for “generic” art is probably going to be taken over by AI.

numpad0 · on March 20, 2023

It's hard to understand what's going on without witnessing it: kids are collecting ~50 files, throwing it into a packaged fine-tuning tool, or worse yet, just putting an image through i2i to wash off giveaway features and claiming to be an "artists". And those "arts" aren't that well received by audiences. There are some low-budget fringe use cases but those are low-effort low-return stuff. People say the AIs are getting better, but those to me are just trying to be optimistic.

Nothing is taken over by AI, but just young clinical psychopath types are being enabled to be burned later at the expense of all. That's kind of wrong.

zirgs · on March 20, 2023

But are their works even included in any of the datasets?

Also - if I want to get, say, cover art for my book - why should I hire a human if Stable Diffusion works just as good? I'm also not interested in ripping off a particular artist's style. Do those artists really think that I'm going to hire them after they opt out of being included into ML datasets? Nope - I'll just experiment with SD a bit more to get the style that I want.

dannyobrien · on March 20, 2023

I don't understand: is your argument you should bad laws are only bad when they are used to address a non-existent problem?

CamperBob2 · on March 20, 2023

No. If a robot can do your job, it's time to find another.

Copyright maximalism makes the creative world worse for everyone in the long run.

amrocha · on March 20, 2023

The robot can't do the job. The Robot needs the artist to train. The robot can't train on its own output because it doesn't understand what it's supposed to be drawing.

Artists don't need to collaborate with the tool running their livelihoods either.

Ruining the lives of actual creatives is making the creative world a worse place right now, so spare me the thought for the long run.

dragonwriter · on March 20, 2023

> The Robot needs the artist to train.

It doesn’t, though. The models that are transforming the markets now would continue to do so, in much the same way, if no additional art from working “traditional” artists was used to train either base models or fine tunes.

Sure, if this existed before the models – which it couldn’t, since it incorporates one of them directly – and had been applied to all existing (including public domain) art and it was completely effective and had no immediate countermeasure, it could have obstructed or delayed things, but now its trying to unring the bell, and it can’t, even if it worked flawlessly on a technical level.

> The robot can't train on its own output because it doesn't understand what it's supposed to be drawing.

Actually, a very significant source of training data for new training of models is…output of existing models curated by the trainer. The robot can be trained on its own output.

> Artists don’t need to collaborate with the tool running their livelihoods either.

Ironic statement, given that they do even to use Glaze, which incorporates exactly the tool they are blaming to do its work. In fact, the entire poisoning attack is done by injecting other artist’s styles into their work, in the hope of confusing style extraction. That is, the tool literally does exactly the thing it is intended to prevent, as its preventive measure.

avereveard · on March 20, 2023

And what did the artist do at art school/formation years? Just learned strokes with zero references to works of art nor culture?

Besides a diffuser trained just in public domain is going to be as dangerous to their job, they seem to generalize well enough, and if their style is anywhere close to anything existing pre 1920 they will have ai spitting out the exact same thing they do and no recourse.

And to get to the point, actual creations can use these tools to increase their output then fold or more. A reduction in commission price is to be expected, as is expected for them to use the tools to produce more.

They are not outcompeted by ai, they are outcompeted by a new crop of artists bracing ai tools to work at a greater volume and thus can sustain a lower price point. More art will be produced, not less, and those in true danger are the artisans that aren't embracing the ai industrial revolution.

Riverheart · on March 20, 2023

Humans come and go but AI is forever and shouldn’t be treated as if they’re the same. Human society is for humans but we’re creating immortal models that can be forever improved and will outperform all current and future humans. So what, the current generation embraces it and the next gets replaced by it entirely?

Seems tech won’t be satisfied till its devalued the entire human experience of growth and effort and replaced it with quick answers and cheap results.

CamperBob2 · on March 20, 2023

And these arguments are different from those employed by the OG Luddites, how...?

Riverheart · on March 20, 2023

I’m sorry does it need to be? Does being a Luddite invalidate the argument I’m making or perhaps you can see that all technology has trade offs and perhaps this moves the needle too far in one direction. By all means continue using Luddite as a slam dunk argument not to consider the impact of this technology on society vs previous forms of automation.

Edit: Then one day when the species goes extinct because we’ve got our robo-waifus we can exclaim in our final breathes that the luddites were wrong. All technology is great, no issues whatsoever.

Edit2: I’m kidding of course. We’ll either automate baby making and/or have the means to live forever so there’s no incentive to reproduce. Silly luddites and their antiquated notions of human relationships and death. What will they complain about next, human cloning or brainwashing? They’ll probably bring out some trope like “stop playing god” or “stop bending people to your will”. The technology exists so it must be used. If I don’t brainwash people someone else will right? The ones who fail to adapt to the changing world will just have to do whatever I say.

CamperBob2 · on March 20, 2023

(Shrug) There are two careers to avoid if you prefer things to stay the same over time: 1) technology; and 2) art.

A rising tide really does lift all boats, whether you believe it or not. History furnishes no exceptions to this rule.

Riverheart · on March 20, 2023

I’d like that to be true but the argument that people who champion AI always make is that those who embrace the technology will out compete those who don’t. This technology isn’t free, certainly not globally. Sounds like power will concentrate in the hands of early adopters. If AI makes everyone productive then labor value goes down. I’m just having a hard time seeing this as a massive benefit to ordinary people.

CamperBob2 · on March 20, 2023

In that regard you do have a valid point. Equitable access to models is going to be a huge, huge deal. Things aren't getting off to a great start in that regard.

To those looking at becoming politically active in this area, it will be much better to work to democratize AI than it will be to try to stop it. The former will be difficult, the latter impossible.

Blackthorn · on March 20, 2023

Well, you live by copyright law minutia and you die by copyright law minutia. If this was glaze's intended purpose, and it actually works, then hats off to them for the brilliant hack.

bawolff · on March 20, 2023

While i don't disagree, it also probably isn't even in the top 10 scummy anti-circumvention uses.

gravitronic · on March 20, 2023

In 2 out of 3 of your examples the "victim" is a large corporation

dragonwriter · on March 20, 2023

> In 2 out of 3 of your examples the “victim” is a large corporation

No, in none of them are the victims mainly large corporations.

In two out of the three, the scumbags who are victimizing are acting through a large corporation, but I’m not sure how that is material.

simonh · on March 20, 2023

Which employ real people to do jobs. What’s your point, that people working for large corporations don’t count for some reason? What reason is that?

alwayslikethis · on March 20, 2023

Interesting viewpoint, but what do you mean by "applying a filter"? There isn't anything inherent in this that is specifically aimed to protect copyright, so removing should not circumvent it. Removing the cloak, if necessary, is more akin to processing the input data to remove racial slurs or misspelled words to train a language model. The cloak is not meant to prevent copying. This is just an adversarial technique against specific models, which will not work in general if you get to train a different model.

hgsgm · on March 20, 2023

> The cloak is not meant to prevent copying.

The cloak is absolutely meant to prevent copying of the image. Thd cloak is only effective against ML, not humans viewers.

When CSS encrypts a DVD, it scramble the bits so your unauthorized computer can't decode it. You could decrypt it by hand.

Delento · on March 20, 2023

I'm not sharing your assumption:

This is not a problem from the start as you are reading it but not coping it.

And as far as I know an artistic style is not just protected because someone uses a style.

There are million artists out there. Style references a certain amount of rules.

You can't just protect styles as this would also hurt artists

none_to_remain · on March 20, 2023

> now you are applying a filter (creating a derivative work) to defeat a copyright protection mechanism,

Aren't you kind of jumping ahead and assuming copyright protection applies to ML training? I thought this remains undecided right now.

kevingadd · on March 20, 2023

My understanding is that the actual circumvention of protections on a copyrighted work is itself a crime under the DMCA, unless your use case is covered by an exemption. It's not a specific set of prohibited uses that doesn't include ML.

In short: "Section 103 (17 U.S.C Sec. 1201(a)(1)) of the DMCA states: No person shall circumvent a technological measure that effectively controls access to a work protected under this title."

So if Glaze is viewed as a protection measure, circumventing it would itself be a breach.

dragonwriter · on March 20, 2023

If training LLMs on copyright protected art is itself Fair Use, then the motivating and only significant purpose of this is to adversely impact noninfringing uses. That’s the textbook reason for the Librarian of Congress to establish exemptions from the anti-circumvention rules, so even if as a legal hack that was effective initially, it wouldn’t be likely to last very long.

michaelmrose · on March 20, 2023

Why would it need to specifically circumvent this instead of casually circumventing it simply by better mapping how the bits on a program map to how an image appears to the viewer?

if f(x)1011 and 1010 both "look" the same shouldn't a well designed AI learn equally well to recreate that particular look. It doesn't need a "deglaze" circumvention step it just needs a continually improving learning process that automatically accommodates both glazed and un-glazed images.

kevingadd · on March 20, 2023

Right, a tool explicitly designed and used to "deglaze" images would be different from a network that just happens to not be affected by glaze, I expect.

none_to_remain · on March 20, 2023

But I think whether this is a protection measure is undecided.

bawolff · on March 20, 2023

I think it would be hard to argue it isn't. It is the only value that such a system could theoretically provide, so there is no other reason someone would use it. What would be the argument that it is not one?

alwayslikethis · on March 20, 2023

How are you applying a filter or circumventing anything if you are simply training a brand new model with a bunch of "cloaked" images?

bawolff · on March 20, 2023

Oh, i was thinking the scenario was more a specialized tool to remove the "glaze".

Hmm. That's a good point and now i am less sure.

I suspect it would come down to intentionality.

none_to_remain · on March 20, 2023

I looked at the definition in the law and it was unclear to me whether this technique met the definition. Among other things there's no "authorized deGlazing" to recover the original going on during human viewings. But there seemed to be enough leeway that maybe a judge could find a fit

gpderetta · on March 20, 2023

Keyword being "effectively". If routine cropping and resizing defeat it, then it is hardly effective.

GaggiX · on March 20, 2023

The author of https://haveibeentrained.com/ made an article about it: https://spawning.substack.com/p/we-tested-glaze-art-cloaking

What surprises me most is that the paper does not consider the most obvious case: resizing of images before training; people usually train an SD model on resolutions of 512/768, so the noise is destroyed to a large extent even without realizing it. Why resizing the images is so effective is shown in this article about adversarial attacks: https://towardsdatascience.com/know-your-enemy-7f7c5038bdf3, the model after being trained with adversarial training learns independently to rescale the images as an adversarial defence.

They also not consider in the Countermeasures section the fact that people can use diffusion models for what they are trained to do, denoising the images, if you put adversarial noise on it you can use img2img to remove it (maybe even canny controlnet just to guide it even more).

Final detail (maybe) is that the paper do not address the fact this only works on models trained with a VAE and not on diffusion-only models like Dalle 2, Karlo, Imagen (maybe MJ, who knows).

The software that applies the "protection" do not run on the GPU even though it runs SD and gradient descent, so it could even take 40 mins to apply it on a single image; it also violated the GPL license of DiffusionBee: https://www.reddit.com/r/StableDiffusion/comments/11sqkh9/gl...

In case it's necessary to remove the adversarial noise, the simplest way: https://github.com/lllyasviel/AdverseCleaner (16 lines of Python)

j16sdiz · on March 20, 2023

> it also violated the GPL license of DiffusionBee.

Isn't "critic" / "research" / "education" good fair-use cases?

p_l · on March 20, 2023

This is creation of derivative art and publishing it in violation of license.

Fair use is violated by the scale of copy-paste involved, and well, none of "critic", "research", or "education" are the uses involved in the release of the derived art. (Writing a paper would fall under "research", iirc)

GaggiX · on March 20, 2023

https://www.gnu.org/licenses/gpl-faq.en.html#GPLFairUse

I don't know how small the snippet has to be to be considered fair in court, but it looks like they copied large portion of code.

bawolff · on March 20, 2023

Whether this counts as that seems like a complex question. Certainly i don't see how this is a critique.

josephcsible · on March 20, 2023

> it also violated the GPL license of DiffusionBee: https://www.reddit.com/r/StableDiffusion/comments/11sqkh9/gl...

Are there plans for any DiffusionBee contributors to sue or file a DMCA takedown against Glaze?

GaggiX · on March 20, 2023

I think that the authors of Glaze released the code after it was discovered.

Edit: https://mobile.twitter.com/ravenben/status/16364393355693752..., he is talking about releasing the code behind the front-end even if the back-end also violates the license.

zarzavat · on March 20, 2023

Is DiffusionBee AGPL? If not, they don’t have to release any backend code if they are not distributing it as binaries or obfuscated code.

GPL says: if you distribute something that is based upon GPL code, then you must be prepared to offer the full code to the thing you distributed, on demand. You can even charge a reasonable fee to cover your costs.

Someone asked them for the code, they responded by publishing it, so there has been no GPL violation. There is only a violation if they refuse a valid request.

GaggiX · on March 20, 2023

The backend is obfuscated using Pyarmor.

danaris · on March 20, 2023

> it also violated the GPL license of DiffusionBee

...and as soon as this was brought to their attention, the authors took out the infringing code and reimplemented it from scratch.

ShamelessC · on March 20, 2023

I am skeptical of this research. But, it amazes me the degree to which GPL proponents are willing to threaten the livelihoods of what are _clearly_ people with good intentions without even trying to understand the situation or inspect intentions.

These are grad students and artists. It was probably a mistake - or they found it was covered under fair use. But, nope! Stallman wouldn’t approve! Canceled!

bawolff · on March 20, 2023

So their entire usecase is around the idea that people shouldn't use other's work without permission. Clearly they have some familiarity with this issue.

What possible good itentions could their possibly be here?

Were they unaware of the idea that you shouldn't wholesale copy other people's work? That seems impossible by their own admission.

Were they unaware that they were using the software? That seemes implausible.

Did they believe that rules only apply to other people and not themselves? If so that is my definition of a bad person.

Quite frankly i don't see any possible way this could be dismissed as an innocent mistake. Arguably the hypocrisy makes it much worse than your average gpl violation.

Artists and grad students are people too. Being an artist doesn't alleviate moral culpability.

Edit: however to clarify, since you said people wanted to "ruin their livlihoods". I want to be clear that i don't support anything like that. Negative consequences should fit the "crime", and ruined livlihood would be way out of porportion.

Kim_Bruning · on March 20, 2023

They claimed they had made a tool to prevent plagiarism.

On closer inspection, it turns out the tool itself was plagiarized.

That was certainly a mistake.

kazinator · on March 20, 2023

You're just playing word games.

Whether a tool prevents plagiarism has nothing to do with whether it is itself plagiarized.

Plagiarism is a misconduct whereby someone else's work is presented as one's own.

Falling afoul of the GNU Public License in some way isn't ipso facto plagiarism.

Kim_Bruning · on March 20, 2023

I'm actually just summarizing what the sources say here.

"Plagiarism is the fraudulent representation of another person's language, thoughts, ideas, or expressions as one's own original work." [1]

Tool claims to prevent plagiarism, paper uses "plagiarism" language throughout. [2]

Writers of tool used substantial portions of a different tool without acknowledgement. Source explicitly calls this out as plagiarism in first paragraph [3].

[1] https://en.wikipedia.org/wiki/Plagiarism

[2] https://arxiv.org/pdf/2302.04222.pdf

[3] https://www.reddit.com/r/StableDiffusion/comments/11sqkh9/gl...

(edit) To be sure: somewhat later, authors do seem to be correcting their behavior, but only after having been called out. ( https://mobile.twitter.com/ravenben/status/16364393355693752... )

kazinator · on March 20, 2023

Plagiarism and copyright/licensing violations are only tangentially related.

If you have the license to use some source code (e.g. it's public domain, or you purchased a license which allows royalty-free redistribution) then you have no copyright issue. If you submit that code to your school as your assignment, or part of your assignment without attribution, then that is plagiarism nonetheless. It's a problem between you and the school (and the world at large), not between you and the author of the code, who allowed your use.

The GNU license is not concerned with academic misconduct. Users are encouraged to use the code as the basis for making something that works for them. It has to be redistributed according to certain rules so that the freedoms originally granted to the users are preserved. The concern isn't that the correct author has been identified to the users, but that they can get the source code and build that same thing, and also redistribute it with their modifications, if they wish. If you tell the users that this is based on GPLed code from such and such, but don't give them the code, then you are innocent of plagiarism, but still in violation of the license.

Most programs use code that their author didn't write---often, a lot of code; that is understood. When I see an announcement like "Glaze: Protecting artists from style mimicry" and read its synopsis, I'm assuming that 90% of it is cobbed from libraries and adapations of other people's code. We don't think of it as plagiarism unless the central, distinguishing idea of the work is something that was consciously taken from someone else and presented by the purported authors as their own.

And even then, if that plagiarized idea is effective in implementing something which prevents plagiarism, that just amounts to irony. Irony is a real philosophical construct, but not a software defect.

Kim_Bruning · on March 20, 2023

AFAIK Attribution is the minimal requirement for all the free/libre/open source licenses, as well as (almost?[1]) all of the open content licenses.

If you plagiarize, you will also tend to automatically violate both copyright law and most licenses, including BSD, MIT, and GPL. The GPLv3 is a bit more complicated than the other two, but section 4 and 5 state that you must include "appropriate copyright notices", which means full attribution.

Here's a more in depth discussion specific to the GPLv3:

https://opensource.stackexchange.com/questions/4577/does-gpl...

[1] (edit) I was thinking that CC-0 (being close to PD) would not require attribution. But on looking it up, it's a bit more complicated.

(edit 2) In general I don't think "that is understood" holds. If you plagiarize copyrighted content, you might be breaking one or more laws. Depending on what you did exactly, how much you did, and how nasty the other party is feeling: you may be forced to take "your" work down, delete it outright, hand it over to the other party, pay damages, pay a large fine, or even go to jail.

(edit 3) I agree that plagiarism by itself need not be illegal in some contexts. But if you then distribute your plagiarized work (which is very easy to end up doing these days), you do end up infringing copyright law.

kazinator · on March 23, 2023

FOSS licenses typically require the preservation of the copyright clause. They are not mainly doing this for attribution rights, but to preserve the licensing terms: copied works (perhaps with modification) are licensed to subsequent users the same way. This has the side effect of discouraging plagiarism, since it's hard to present something as being exclusively your product if someone's copyright license is prominently displayed.

The USA copyright law doesn't mention the word "plagiarism", though there is a small section in it entitled "Rights of certain authors to attribution and integrity". I'm pretty sure FOSS licenses are not relying on this. They rely on the idea that the license lapses if any of its terms are not met. When the license lapses, then the right to redistribute is revoked. A lawsuit revolving around someone removing copyright notices would most liklely be based on unauthorized reproduction (the license to redistribute is not granted to someone who removes the notices), not violating attribution rights (license to redistribute is unconditionally granted, but authors retain attribution rights, which are being stepped on).

Software licenses as a broader class not limited to FOSS do not necessarily discourage plagiarism. In proprietary software, it's common to be able to license a software component on such terms that its authors need not be mentioned anywhere; neither the binary build of the program, nor any accompanying documentation have to display any notices related to the licensed part. If you license parts of a program in this way, and say that you wrote all of it, that is plagiarism. In an academic setting, it could get you disciplined and kicked out of school.

Homework writing services probably license their solutions on similar terms (out of necessity): you may present the purchased solution as your own work.

Free software writers are not mainly concerned with plagiarism. You're encouraged to take a program and make it yours, if it doesn't work the way you want it. You may add your own name to the copyright notice if you make more than just trivial changes, and if you add new files that are not a derived work of the program, you can use your own license for them, provided it's compatible.

Someone who, in redistributing some code, neglected some requirement stated in its free license isn't ipso facto a plagiarist: i.e. isn't guilty of fraudulently presenting the entire program as their sole creation. Situations in which that is the case will be painfully obvious.

Therefore, in summary, it's basically just trolling to throw around the word "plagiarism" when someone made a mistake or oversight in some redistributing use of a program.

ShamelessC · on March 20, 2023

That seems like intentionally broadening the scope of their claims in order to fit a pre-defined narrative that falls apart upon closer inspection.

dongobread · on March 20, 2023

I would agree that these tools are almost inevitably going to fail. If anything, they'll only improve the models' abilities to handle edge cases accurately.

However, I'm surprised at how little empathy is being displayed towards artists by so many here. Having a model train itself on your work is not the same as having another human being inspired by or even copying your work, despite the fact that both involve some sort of learning.

I am not an artist but I build all my solo programming projects "in the open". My programs are mostly niche stats models, and frequently people message me with questions and ask for help integrating my work into their software. I'm happy to do so. I don't care if they credit me in their final product, but knowing that humans went through my work and took time to appreciate it is a big motivator for me. I would imagine that many artists feel similarly.

On the other hand, knowing that OpenAI, Github/Copilot, etc, train models on my work and turn it into some pay-to-play API, without a human ever seeing my work during the whole process is a nasty feeling. At that point, I've just been turned into faceless cog to generate profits for big tech shareholders. Luckily, these are just side projects for me and I can just make these repos private, but of course artists are forced to "build in the open" by the very nature of their work.

easyThrowaway · on March 20, 2023

Because the main objective of commercial AI generative projects is the "Uberization" of the Arts. Massively dumping on prices, becoming gatekeepers on the creation process, ultimately forcing themselves between clients and the artists, who then become disposable "content creators" for the training model.

khimaros · on March 20, 2023

my personal take is that it is okay (for code and other copyrightable works), and possibly for the greater common good if, and only if, the resulting models are permissively licensed.

alwayslikethis · on March 20, 2023

Yeah, I think it would make sense to have a special condition like the following: You get to train models on public data under fair use if and only if you release the resulting models (it is not clear whether the models themselves are copyrightable at this point) to the public and do not claim copyright on them.

atleastoptimal · on March 20, 2023

Nobody will use this, it takes too much time and it ruins the art by adding a weird texture.

I understand the disappointment of real artists that the tech bros are stealing their lunch, but sadly it's the way all these things go. Pandora's box is opened, things aren't going to go back the way they were.

Even if an artist manages to completely guard themselves against exploitation by AI, the market expectation for art commissions and work will be 100x higher than it was before AI art in a few years. It's like the scene in There Will be Blood, and the milkshake of traditional artists has been drunk already.

anonylizard · on March 20, 2023

This, trying to use this to stop AI art, is like trying to stop climate change, except 10000 times harder.

1.Like climate change, its a commons issue. AI learns from the artists at large, so a single artist protecting themselves does nothing. Nobody copies specific artists except the very top tier who have distinct and beautiful styles.

2.AIs will still have all the art produced before 2023 to train on. Which is a gigantic amount with huge room for optimization in training.

3.This can be trivially circumvented. Given the requirement of the glazed image to look identical to the human eye, it must be possible informationally to reproduce the image without the glaze.

4. It should be trivial to train a quick GAN to revert this glaze, given you can trivially artificially create before-after datasets using this very tool.

gwoolhurme · on March 20, 2023

I don't really like this view point, respectfully. It reads like "learn to code" except now that it is coming for programming as well. Perhaps the new slogan should be learn to plumb. Your point about climate change is apt, just because it is hard doesn't mean there should not be some sort of protections put in place to protect intellectual property of the artists. Overtrained AI produce something similar to plagiarism. There should be legal protections against that.

Kalium · on March 20, 2023

OK. Let's assume legal protections are put in place tomorrow and today's living artists magically (because it skips all the details of "How do you make that work?") acquire all the rights and enforcement abilities they could wish for. What do we expect changes?

I think the uncomfortable answer is that for a lot of commercial art uses, nothing changes. In many cases people just want something to fulfill a need and aren't really all that picky about it. Maybe the style changes to be one of the Old Masters instead of something current. Getting it fast and cheap means they don't have to think about it much. A great many commercial art jobs will vanish just as they will outside this hypothetical.

Perhaps we should pause and identify what outcome we want before prescribing policy. Are rights our priority, or are we trying to secure the incomes of artists?

andrewflnr · on March 20, 2023

The real desired outcome seems to be "artists can make a living from doing art (without needing to be famous)". Unfortunately that goal has been dubious at the best of times, even before AI. The same questions are happening in fiction writing.

Kalium · on March 20, 2023

It's been dubious in fiction and music for a long time as well. At a remove, this feels less like a major shift and more like a collective agony as a dream that felt within reach recedes into the distance.

gwoolhurme · on March 20, 2023

I don’t know I truly don’t. I don’t have any answers. Because you might be right. Especially for commercial art. Just the same as climate change, I don’t see any solution. It all gets worse and worse for everyone involved, except a select few. The outcome I would assume, would be that the livelihoods of artists are protected, or at the minimum that if they do not want a program to mimic a clearly unique style, that it doesn’t. That might just be pie in the sky dreaming at this point though. All desk work is going to disappear, everyone will be displaced.

Kalium · on March 20, 2023

What is your goal here? What outcome do you want to achieve? It sounds like your primary goal is financial, with rights being a means to an end. Is that accurate?

gwoolhurme · on March 20, 2023

That might be accurate, however it is not just financial right? I would say that is the bulk of it though. To the self-worth side of it, someone posted below, but humans tie self worth to their profession as well right? That is probably something wrong with society as a whole but that is how people work as of this day. In the case of an artist (I am not an artist just to be clear), they've worked decades to hone this craft correct? To have it scraped and used to a level that only a machine can do. Something about that feels wrong? I get that is an emotional appeal. Which is probably why I will never write policy.

Kalium · on March 20, 2023

Once upon a time, when thread and cloth was all made by hand, the equivalent of a t-shirt cost the equivalent of $5000 in labor alone. Today we consider that absurd. The spinners, fullers, and weavers of the time considered it their livelihoods, the way they fed their children, the way they achieved economic and social status, and thus just and right. Today we're far enough removed from those days to consider their perspective transparently self-serving for all that their distress must have been immensely real and sincere. We use mechanical looms now. We broadly agree that people have better things to do with their lives than spin rough thread, better ways to contribute to society.

So I get why it feels wrong to many, but I also have no trouble placing it and those objections in a context of a recurring historical pattern.

gwoolhurme · on March 20, 2023

You are ultimately right. Progress cannot be stopped, won't be stopped. My only wish is to help mitigate the pain. You brought up t-shirt manufacturing. That was true and while the luddites ultimately lost, I thought maybe a lesson society could have gained from that was some level of compassion to who it is happening to. Because it's not just artists this time, it's not just the factory, this time it is every profession that can be done at a computer. I am a programmer and I feel like I see the writing on the wall for us too. Soon we will be the Luddite who is scorned, but yes, society and economics don't owe me or any artist a damn thing. This is an emotional post, but reading these stories I feel like the only answer I have come up with is to just allow it to up-end all of our worth, security, and jobs. If you are a 50 year old corporate artist, hopefully something help you on your way to a new profession. It's the same way why I feel like society doesn't do nearly enough for coal miners either. Coal is a nasty product, yes, and a coal miner should clearly not "just learn to code". I don't know what should be done, but something of compassion clearly needed to have been done ethically.

Kalium · on March 20, 2023

Why don't you say what you want? You don't want compassion. Compassion is an internal emotional experience. I am having compassion right now for the hypothetical hordes of unemployed former commercial artists as I consign them to history's scrapheap. Compassion is a story we tell to tug at the heartstrings of ourselves and others.

Compassion is not a policy. Compassion is not a plan. Compassion isn't what coal miners or Luddites wanted. What they wanted was to freeze in amber a way of life that served them well and could be passed on. That's both a reasonable thing to want and a very unreasonable thing to expect.

Or maybe you don't know what you want as an outcome. That's OK. It might be worth thinking about that.

For my own part, I'm not too worried about programmers quite yet. After all, programming is the easy part. All you have to do is get the business to decide precisely what they want and communicate it clearly. For the future, well, adaptability and rapid learning have been the hallmarks of every good engineer I've ever worked with. We're a flexible lot.

gwoolhurme · on March 20, 2023

I don't know what I want as an outcome. I think that is part of the discussion. You have come at me with a lot of really good questions. I think the discussion is the most important before we hit that hypothetical soon.

For sake of clarity I am an engineer as well. I hope you are right? As I see it now, it seems like every single job is on the chopping block before I hit retirement age. We already see a few experts arguing about this already.

I think I am doing a poor job conveying what I want or mean by compassion, as you said that is true that the Luddites or coal miners in those examples want to pause time. That is clearly a very bad idea when it comes to something like coal, but at the same time I think when I say compassion it is not that I want to tug at the heartstrings of ourselves and others. I mean it in a selfish way that when we are able to answer the question as to what people do when their livelihoods are removed from them, we get a better functioning society. When we use coal miners as an example there has been a few studies that have shown much higher usage of drugs as despair has grown in those communities. I see that and have extrapolated that out to the large white-collar hypothetical hordes of unemployed commercial artists. Perhaps all desk workers (my self included).

It legitimately has kept me up at night trying to think what sort of policy, what responsibility do we all have now? Economically nobody owes anyone anything, but is that an ethical answer?

I don't know what that looks like. Which goes back to my original answer, I don't know what I want as an outcome. I just think that perhaps, we will see less animosity and less anxiety when we have better answers to our modern Luddites.

chii · on March 20, 2023

Unlike the luddites, information is so available today, that it'd be easy for an artist to predict the demise of their profession (presumably).

Therefore, why is it not the artist's responsibility now, to look for alternatives, and not wait till they truly become obsolete, and personal resources run out? If you went back in time and told the luddites that in 5 years time, their services would no longer be required by society, would they not retrain themselves instantly, instead of waiting for the 5 years and hope that society has some sort of welfare program ready for them?

cobertos · on March 20, 2023

And yet watermarks are a thing. I'm skeptical that no one will use this, especially if you can put it into a pipeline of multiple image transformations that "protect art". It could be bundled into tools or on platforms themselves as part of their value add. YouTube already transcodes your videos, why not DeviantArt encode your art except to verified human accounts or something?

Their efficacy is the more interesting question to me.

dragonwriter · on March 20, 2023

> And yet watermarks are a thing.

The full-resolution examples I’ve seen have shown this as much more destructive to the underlying art than usual watermarks, to the point where I’m not even convinced people would use it on samples when the underlying art wasn’t displayed online but sold through other channels.

> YouTube already transcodes your videos, why not DeviantArt encode your art except to verified human accounts or something?

So the idea is to be “protected” against automated scrapers building datasets for training, say, the core StabilityDiffusion models, but not against the huge number of individuals training (and remixing) checkpoints/loras/TIs/Aesthetic Gradients and exchanging them that are driving the ecosystem?

cobertos · on March 20, 2023

Maybe you have a different viewpoint of the ecosystem than me? Everyone I know that's used this is not an AI expert but still tech savvy. None of them have trained their own models. The path of least resistance for them would be to work around the "protected" art if they were doing something more custom

For someone with more effort and time to dedicate to cloning a specific artist, maybe not as useful. But I know less of this type of use case/the tools in that part of the ecosystem

dragonwriter · on March 20, 2023

> Maybe you have a different viewpoint of the ecosystem than me? Everyone I know that’s used this is not an AI expert but still tech savvy. None of them have trained their own models.

Look at something like Civitai [0] and the number of users submitting things there: pretty much everything there (there’s a few things like poses and tools that aren’t) is one kind or another of trained model (though some are merges of existing models and extractions of one kind of model from another.)

[0] https://civitai.com/

Afforess · on March 20, 2023

> * Nobody will use this, it takes too much time and it ruins the art by adding a weird texture.*

New Cloudflare offering:

ECA: Edge Cloaked Artwork.

Enabled at the edge with our blazing fast CDN, ECA is a brand new Cloudflare service that offers a groundbreaking solution for digital artists and creators! With our CDN network, we can now cloak artwork to protect it from unauthorized use, while still maintaining its visual appeal to the human eye. Our technology modifies the data of your artwork in a way that's invisible to the naked eye, making it nearly impossible for AI companies to use your artwork in their training samples. This not only ensures that your artwork is protected, but also helps prevent the proliferation of unethical AI practices. Trust us to safeguard your artwork and its value like never before. Try Cloudflare's artwork cloaking service today and experience the peace of mind that comes with true digital security.

- CHATGPT 3.5

Feel free to re-use Cloudflare.

not-chatgpt · on March 20, 2023

The description (as well as the project as a whole) is incredibly disingenuous and capitalized heavily on artist's fear of being trained by AI.

Even one pass of SD's img2img with low denoise is enough to bypass this data poisoning attack. It's a useless attack that makes training a tidbit more inconvenient.

The project also gained infamy when it stole code from an open source project with GPL license without giving credits.

Additionally, SD now offers opt outs for artists and MD likely does not train on these artists at all, and there are way more effective ways to protect art than using this algorithm.

atleastoptimal · on March 20, 2023

This project and a lot of projects in the future will be in a similar vein. Something that appeals to the fear of being cheated or made irrelevant by AI by returning primacy to the human.

It all seems good intentioned but also naïve, like when robots take over doing all surgery and massively reduce the risk of accidental death, surgeons will get together and insist they ban the robots and that humans still do it better, even if they don't.

Techonomicon · on March 20, 2023

This is entirely false. My s.o is relatively well known in comic art communities and they (and many others) are using this right now. I don't think HN quite grasps how much artists are trying to ensure ownership and governance over their art.

2muchcoffeeman · on March 20, 2023

If it’s too late then why wouldn’t they use it? May as well poison the well on the way out right?

dragonwriter · on March 20, 2023

On the other hand:

“The Problem with UChicago’s Glaze”

https://jackson.sh/posts/2023-03-glaze/

Yesterday, the SAND Lab at UChicago made Glaze available to download. It’s a tool to help artists protect against their work being used to train AI models. It got a bit of buzz last month, including a New York Times spot. However, it has some issues:

1. The authors plagiarized code from DiffusionBee, an AI art tool licensed under GPL.

2. The paper contains inflammatory and libelous language with no legal backing.

3. It doesn’t work and I was able to execute a proof-of-concept bypass in minutes!

Each of the sections below will go into further detail on these points.

nl · on March 20, 2023

> The paper contains inflammatory and libelous language with no legal backing.

I don't think that copying an artistic style is stealing legally or morally (I think the moral issue is "passing off"). But I think the reaction ("libelous language") is way over the top.

magicalist · on March 20, 2023

> The authors plagiarized code from DiffusionBee, an AI art tool licensed under GPL.

I haven't really been following this closely, but according to that Twitter thread, supposedly that was fixed yesterday.

> The paper contains inflammatory and libelous language with no legal backing.

I don't think I can roll my eyes harder. Maybe go for RICO next, too.

lucubratory · on March 20, 2023

Crazy to me that the people making this plagiarised code to make it happen, from the people they're accusing of "theft", and then when called on it they said they'd... open source their GUI, even though the GPL code is also present on their back-end and they're continuing to publish it.

I think property rights are a sham, if we got rid of copyright altogether for code and art and everything else I'd be very happy. But if you're going to falsely accuse people of stealing when what you're actually accusing them of is copyright infringement, probably don't have your literal only production be a product of copyright infringement.

alwayslikethis · on March 20, 2023

Property rights are not the same as copyright. Intellectual property is not property. Infringing copyright is not theft.

dragonwriter · on March 20, 2023

> Property rights are not the same as copyright.

Copyright is a subset of property rights.

> Intellectual property is not property.

Intellectual property is intangible personal property is personal property is property.

> Infringing copyright is not theft.

Right, it doesn’t permanently deprive the property holder; its more like trespass than theft.

kweingar · on March 20, 2023

This is going to be an arms race. Inevitably Glaze will be thwarted, and a new version will need to be made.

Generative art models will always have the advantage in this fight. For them, they just need to crack the protection and update their model. For artists who want to protect their work, they’ll have to continually update their whole portfolio of images and republish them whenever a new version of Glaze comes out (and attempt to ensure that their images with old versions of Glaze don’t end up in a training set, which will be difficult if not impossible).

Avicebron · on March 20, 2023

Right, so off the bat, if you're going into a situation with your generative art model, and you're first thought is, "we're totally going to break through this attempt at someone trying to protect their artwork"....aren't you automatically in the wrong?

textninja · on March 20, 2023

Unless it’s specifically trained not to, I’m fairly certain AI will bypass this without even trying. Such is the way of the unsupervised hill climb on huge unstructured datasets. To put it another way, in order for an artist’s style to truly be invisible to AI, it would have to be invisible to humans.

satvikpendem · on March 20, 2023

Wrong? It's just a fact that generative AI can simply train against these "protected" images. I don't think the parent said anything about ethics, if this is even an ethical issue in the first place.

teaearlgraycold · on March 20, 2023

You wouldn't even need to try to break it. The goal of the model is to learn patterns and pass tests against the training data. Given that these images are expected to ned up in training sets, then eventually whatever technique is being used here will fail to thwart the training process.

userbinator · on March 20, 2023

Not if you don't believe in Imaginary Property.

wwweston · on March 20, 2023

All property is imaginary.

It also turns out to be useful in often helping human beings coexist relatively peacefully and sometimes negotiating mutual interests productively, so like a number of things that don't exist until enough people agree to conduct themselves as if they do, it's probably a good idea in general.

Intellectual property has some distinct characteristics, but it's fundamentally the same.

And assuming entitlement to any available data for ingestion into a training set without a moment's regard to the labor and agency of the people who produced it has things in common with the ethic of a thief who, coming upon your stuff, decides that if they can take it, it's theirs, society's construction of property be damned.

shadowgovt · on March 20, 2023

Except that intellectual property can be copied while leaving the original behind, but this is a centuries-old debate and I doubt we'll break new ground going 'round the mulberry bush about it again here.

(... I'm literally wearing a "YOU WOULDN'T REIMPLEMENT AN API" t-shirt as I type this.)

wwweston · on March 20, 2023

You'll note on reviewing my comment that I didn't claim there are no differences. That's intentional, having tread the relevant ground plenty of times myself.

The similarities matter more than the differences here.

And the problem really isn't breaking new ground in general. People saw the similarities at least as far back as the printing press. The problem is getting people to care about the claims others have on the fruits of their labor, and why incentives might matter for everyone.

shadowgovt · on March 20, 2023

If technology causes the value of the fruits of that labor to crash, then... The value of the fruit of that labor crashes. No more and no less.

The invention of the printing press didn't stop writing, the invention of the camera didn't stop painting, and the invention of Stable Diffusion won't stop creation of novel art. But it definitely upends a known methodology for extracting value from non-fungibility of labor product.

In any case, the labor-to-value question is moot because Stable Diffusion will enable companies to generate unlimited visuals off the single-time-compensated manual labor of a handful of artists (for no other reason than some artists will take that deal). So whether any one artist is in the training set will become irrelevant as people choose nearly-free good-enough product over far-more-expensive handcrafted product most of the time.

Glaze is useless when an advertising firm is fairly-compensating consenting artists to toss their work into the meat grinder and that grinder then still churns out work that swamps the individuality of traditional-methods artists.

wwweston · on March 20, 2023

> If technology causes the value of the fruits of that labor to crash

Well then, if the fruits are worthless, then presumably there's no loss from excluding it from a training set, and creating norms or even laws which allow people the privilege of negotiating the basis on which their work can be so used, right?

Of course, nobody believes that. Much like no one really believed that unauthorized copies of 18th century print works were valueless just because the production could be industrialized. The enterprises producing unauthorized copies did so because they knew full well they could capture the value... without any of the pesky pro-social obligation to respect the labor and interests of those who created it.

The interest in the works as training data betrays the position. This work is valuable, and works derived from it are valuable.

> Glaze is useless when an advertising firm is fairly-compensating consenting artists

I imagine there will be artists who opt in after negotiating the basis on which their work can be used -- although who knows what kind of surprises there could be there, some might even demand equity or collectively bargain.

But deciding there's no need for a framework of negotiation and consent just because some people will get to yes sounds like a really bad precedent.

And if the argument is that this isn't sufficient to protect interests, well, sure. Necessary but insufficient points exist all the time, usually the thing to do is pair them with other axioms / initiatives.

shadowgovt · on March 20, 2023

Or simply don't, and let the old method wither.

The printing press didn't end copyright, but it obviated the need for hand-illuminated manuscripts. I expect automatic image generation will do something similar to hand-created from-scratch art. And yes, if there's a framework to be created to allow artists to indicate that they don't want to be involved and wish to lead the charge into irrelevance, it should be created. It's best to have a revolution be tidy not messy.

posnet · on March 20, 2023

Unfortunately as we have learned from open source software, it doesn't matter if it's write or wrong. The only way to enforce licensing is to litigate.

The decades of GPL violations have taught us the only way to get commercial interests to not abuse copyleft licenses is to force them via the courts, and in doing so dis-incentivize other businesses from violating the licenses.

But it isn't 100% effective by any means, and usually relies on a large commercially successful organization to already have aligned interests in enforcing their GPL/copyleft licenses.

Anything else is just ethics paper fodder.

josephcsible · on March 20, 2023

If bypassing "protection" like this makes you automatically in the wrong, does that also make it immoral to try to refill ink cartridges?

imron · on March 20, 2023

Maybe so, but that’s not going to stop people.

The tech industry is full of examples.

egypturnash · on March 20, 2023

Imagine a GUI-less version of Glaze designed to be plugged into other systems. Stuff it into a plugin for your portfolio CMS. Add it to the popular gallery site you run. Add it to the tool you use to post art to a half-dozen galleries with one click. When a new version of Glaze comes out, update it, and push the button that starts to re-glaze the original copies of the uploaded images, and replaces the publicly-displayed stuff.

sebzim4500 · on March 20, 2023

Imagine a CMS that takes 40 minutes to return a single image.

More seriously, reglazing old images isn't very helpful, since they will amost always be already in training datasets by that point.

wokwokwok · on March 20, 2023

Well...

1) That's not how it works.

LAION is a set of urls, not images.

Actually distributing the images would be blatant copyright infringement and, even if you don't care, is prohibitively massive.

So, yes, modifying the source images will be effective against new players who want to scrape the dataset... and yes, as the tech to 'build your own stable diffusion' becomes more commoditized, you can be 100% assured that multiple groups will be doing it in the future.

...this also has a FAQ that states even a limited number of poisoned images (eg. most recent art by Greg) can cause artifacting in outputs.

2) ...but more importantly, are you joking?

Artists complain and artstation does what, update their TOS?

That obviously does nothing.

So, companies start inventing this anti-ai watermarking tech that shops like deviantart and artstation can pickup and put into their pipelines and make into a pro-feature.

If you think it's not going to happen, you are 100% deluding yourself. The playbook is so obvious, I'd be amazed if it's not already a WIP for these places.

Artist friendly. Make money. Big players can work around it and small players are screwed out of any good images to use for training.

It's going to happen.

3) '40 minutes to return a single image...'

Oh come on. Are you objecting because you think the process has no technical merits, or because you don't like it?

Technically, this isn't a limitation. These places already have massive image processing pipelines for resizing, thumbnailing, etc.

anonylizard · on March 20, 2023

LAION is currently a set of URLs.

That's because AI art was a mere research toy back then.

Now it has massive economic potential, do you really think downloading say 100TB of images is a big deal to any company? Less than $100000 to get an invaluable database that's safe and sound. Its also not prohibitive at all to just clone this data into a NAS array, and just literally ship the hard drives to whomever needing it.

zirgs · on March 20, 2023

Google has already downloaded terabytes of images and they store resized thumbnails of those images on their own servers. If google can do it why can't everyone else?

dragonwriter · on March 20, 2023

> More seriously, reglazing old images isn’t very helpful, since they will amost always be already in training datasets by that point.

And trained into models.

foota · on March 20, 2023

I think the issue here is that the old versions will still have potentially been crawled.

falcolas · on March 20, 2023

On the “plus” side, glaze will cost commercial AI model creators both time and money. And they won’t really hurt academic research use cases.

Kind of a David and Goliath situation (yes, David is an acknowledged sniper).

josephcsible · on March 20, 2023

> And they won’t really hurt academic research use cases.

How do you figure?

falcolas · on March 20, 2023

It’s just additional data to academics - something that helps them identify issues and hallucinations in their models.

They’re not trying to produce sellable images; free adversarial images are useful to them.

j16sdiz · on March 20, 2023

> ; free adversarial images are useful to them.

Only if they were correctly tagged/ labelled. Of course, academics always have access to cheap labour, aka undergrads.

satvikpendem · on March 20, 2023

How is that not also true to commercial AI model creators who will train against these adversarial images?

Imnimo · on March 20, 2023

The FAQ says:

>Once you add a cloak to an image, the same cloak can prevent different AI models (e.g., Midjourney, Stable Diffusion, etc.) from stealing the style of the cloaked image

But my understanding of what they're actually doing is a data poisoning attack on the fine-tuning process (e.g. someone tries to finetune StableDiffusion on a handful of a particular artist's images). Does Midjourney offer that sort of finetuning at all? And isn't it misleading to say that you're preventing Midjourney and Stable Diffusion from "stealing the style"? That seems to imply you're also poisoning the regular training process.

I'm also very unconvinced that this offers any meaningful protection in practice. The lesson from years of research on adversarial perturbations has been that it's very easy to make your method look successful in your own paper against a naive adversary, and way harder to make something that stands up to an intelligent counter-attack.

I'm not really convinced it's ethical to present this in the way they do to artists who probably don't understand the technical details. Even with their disclaimers and limitations section, this website gives a much rosier picture of the tool's effectiveness than I think it justified. If an artist is concerned enough about this sort thing to want to use this tool, I think they'd be upset to learn about it's true (in)effectiveness, and once they've chosen to post their Glazed images, it's too late.

not-chatgpt · on March 20, 2023

The description (as well as the project as a whole) is incredibly disingenuous and capitalized heavily on artist's fear of being trained by AI. Even one pass of SD's img2img with low denoise is enough to bypass this data poisoning attack. It's a useless attack that makes training a tidbit more inconvenient.

The project also gained infamy when it stole code from an open source project with GPL license without giving credits.

Additionally, SD now offers opt outs for artists and MD likely does not train on these artists at all.

d0mine · on March 20, 2023

> against a naive adversary

It is all you need in practice otherwise watermarks wouldn't be a thing.

You don't need to run faster than a lion, all you need is to run faster than the gazelle next to you.

Imnimo · on March 20, 2023

Disagree with that. We're talking about defense against a situation where a user downloads a fine-tuning colab notebook, uploads a handful of the artist's pictures, and "steals" their style. The user here is interested in the specific artist, not someone blindly ingesting billions of images. And the technical know-how to overcome the defense will be in the colab notebook, written by an expert, completely hidden from a user.

dragonwriter · on March 20, 2023

> But my understanding of what they’re actually doing is a data poisoning attack on the fine-tuning process (e.g. someone tries to finetune StableDiffusion on a handful of a particular artist’s images).

No, its poisoning core model training as well, because the core models are trained with scraped art with artist and other metadata, which is why you can use “in the style of…” prompts for lots of artists with the core models. It also applies to fine tuning, etc.

Imnimo · on March 20, 2023

It is absolutely true that the Glazed images will be scooped up by a fresh webscrape for training data for a new model. But there isn't any evidence that this would provide any actual defense. Their paper only studies the fine-tuning scenario. It seems to me that if you train your text-to-image system from scratch on Glazed images, Glaze has lost its upper-hand. You'd essentially be performing adversarial training but with a fixed adversary!

At the very least, I'd want to see some actual experiments on training from scratch before telling artists that Glaze will protect them in that scenario. And I'm very skeptical that it would.

dragonwriter · on March 20, 2023

I hadn’t really thought about that. If it doesn’t work against people training the base models, and those are inevitably going to be trained on a wider and wider set of internet-available imagery, it seems like this is even more futile.

sebzim4500 · on March 20, 2023

Wouldn't having a small portion of adverserially modified images in your training set improve the robustness of the model?

It's a known technique to do this kind of thing intentionally to train models that are more resistant to adverserial attacks. One reason people don't do this is the cost of running PGD is so high, but in this case your adversaries are doing it for you free of charge.

nickvincent · on March 20, 2023

IMO - this kind of tool cuts across a debate that currently involves a lot of people yelling past each other.

If you're an artist, of course you can make changes to your process or your content that modify how it's used. To argue otherwise is analogous to arguing we must always be mindful to stay in the surveillance cameras' view when walking about the streets.

Yes, it's a never-ending back-and-forth game that the obfuscater will probably lose in the long run (though abstractly, an obfuscation technique with >50% adoption could "win" long-term) . And yes, it's important to stay apprised about how effective such tools are.

But in the short term, the existence of these tools provides a critical counter-measure to the current narrative, which is basically that everything that can be scraped will be scraped. Returning to the cameras & streets analogy, obfuscation tools are maps that tell us about routes out of view of the cameras (even though these routes may often be blocked off or inconvenient).

Whether you hate AI art or love it, I honestly believe both sides can get behind understanding obfuscation and poisoning and making tools available: those opposed will use the tools, those who want to improve generative AI can learn from the counter-measures, etc. This kind of thing can be part of a healthy deliberative process around these emerging technologies.

ismokedoinks · on March 20, 2023

I agree, I think that a lot of discourse on this website is grounded in critiquing people's misconceptions about these AI models when in reality accepting the concerns of writers, artists, consumers of online content, etc and offering them even superficial peace of mind is much more conducive to gaining popular support at a broader scale than teaching people how they work.

itronitron · on March 20, 2023

>> offering them even superficial peace of mind is much more conducive to gaining popular support at a broader scale than teaching people how they work.

While that has been the tried and true formula for tech companies attitudes towards their users/customers I'm not sure that will work in this case since artists are on the supply side of the equation.

itronitron · on March 20, 2023

>> those who want to improve generative AI can learn from the counter-measures

The fact that people think generative AI will improve its output by learning from the obfuscating counter-measures just drives home the point that the AI is just copying the work of others without their consent.

XorNot · on March 20, 2023

Or it accelerates the endpoint: cutting the pre-made art out of the process entirely.

AI image generators don't need to see art, they just need to know what blobs of pixels "are" in relation to words. That sort of data can be extracted from just photography of the real world - it just turns out there's a lot less of that easily available and properly tagged then art collections right now.

There's more then enough public-domain examples of "style" to do the rest (and style-transfer was one of the original AI image manipulation applications).

simandl · on March 20, 2023

Last year's ICLR had a paper, "Data Poisoning Won't Save You From Facial Recognition" that included the Glaze team's previous project, Fawkes. This statement from that paper is quite damning.

"This paper shows that these systems (and, in fact, any poisoning strategy) cannot protect users’ privacy. Worse, we argue that these systems offer a false sense of security. There exists a class of privacy-conscious users who might have otherwise never uploaded their photos to the internet; however who now might do so, under the false belief that data poisoning will protect their privacy. These users are now less private than they were before."

Paper: https://arxiv.org/pdf/2106.14851.pdf Fawkes: https://sandlab.cs.uchicago.edu/fawkes/

shevis · on March 20, 2023

“Doesn’t look like anything to me.”

nwoli · on March 20, 2023

Feels like if someone actually performed this fine tuning on SD or some novel architecture it wouldn’t actually be fooled

dymk · on March 20, 2023

The following contains hyperbole; please interpret words like "everybody" appropriately.

What's annoying about the AI art zeitgeist is how dramatic and frankly mean everybody is about it, on both sides. And then the sheer ignorance about how AI models work, again, from people on both sides of this stupid internet fight-of-the-month (year?).

It's always something on one side of "AI art is the future and artists should stop whining that they're obsolete!" or "AI tech-bros are literally stealing in a way fundamentally different than all prior art consumption!".

Everybody is talking past each other. Using (or not using) an ML model to do anything nowadays is politicized like mask wearing. Glaze is, at its core, a GAN. The key word is "adversarial", and it will be defeated in... already, looks like.

Last5Digits · on March 20, 2023

This was cathartic to read. The almost complete lack of calm, productive discussion about AI is honestly frightening.

No matter where you land on this issue, you have to admit that there is a credible chance of significant societal changes in the near future. But instead of discussing how to best manage these changes or how to support people who may be negatively affected, we waste our time with petty squabbles.

tjs8rj · on March 20, 2023

It's not exactly a both-sides thing. I don't have a strong opinion on the topic, but the side with the gun is the AI art generator side (not the artists).

The artists are saying "stop" and the AI side is saying "you can't make me". One side would love a pause to have a productive discussion, and the other side doesn't want a pause at all.

genericone · on March 20, 2023

One side is horses, the other side is automobiles. Horses continue to exist, but automobiles were the inevitable future. The horse side would have loved the auto side to pause and discuss too, and say 'think of all the horse related jobs, culture, and infrastructure at risk'.

eropple · on March 20, 2023

One side may be automobiles.

The other side are human beings.

If you don't immediately and intuitively grasp the moral weight of drawing comparisons between animals and humans in this context, perhaps you can at least understand why you're hated for it. "Let's just screw over everybody who we can grind between the gears of our APIs because ha ha, they can't stop us" is not a recipe for a functioning society.

dragonwriter · on March 20, 2023

> One side may be automobiles.

> The other side are human beings.

No, both sides are human beings. They are human beings using different tools in the same domain, and the side that wants to prevent the adoption of newer tools rather than adapt to them is…well, historically, that’s not a winning position, basically, ever. (In broad societal terms; if you want to create your own isolated subculture where certain newer tools are excluded, that can work.)

simonh · on March 20, 2023

They don’t want to prevent the adoption of new tools, they want to stop their personal creative work being appropriated without compensation or permission to make those tools.

dragonwriter · on March 20, 2023

> They don’t want to prevent the adoption of new tools, they want to stop their personal creative work being appropriated without compensation or permission to make those tools.

I think the arguments in this thread for why this is important, by supporters of it, indicate this is wrong. Yes, directly, this aims at preventing style fine-tunes from work of the artist who uses it (even though styles have never been proprietary.) But if you look at the arguments about why this is significant, its not anything connected to the copying of individual styles, but to the way the explosion of capacity of AI art generators is transforming the market. That’s the target. Even if a magic shield protecting every work that the creator wanted to protect from this day forward against style fine tunes (or AI training more generally) wouldn’t deflect that trend even a tiny bit – which it wouldn’t, IMO – that’s what this is aimed at sociologically.

simonh · on March 20, 2023

I don't think that's fair. The artist community seem to have a genuine and legitimate grievance. Arguing that they're only saying that because their jobs are being affected is, well, not the best looking argument I've ever seen. Surely if they have a legitimate argument and they are suffering economically, that strengthens their argument. How could it possibly weaken it?

I'm not an artist and I'm excited to use AI art generation to realise visuals I can obtain affordably. I just would like to be able to do that ethically.

Last5Digits · on March 20, 2023

It may not look good, but it's a strong argument.

The problem with the current discourse is that we're hung up on things that are largely irrelevant to providing an actual solution.

Arguments like "my IP is being violated" completely fall apart under any scrutiny and are readily abandoned in favor of some other useful proxy-concerns like "It's just copy and paste, not art!". I refuse to believe that the art community is stupid enough to make arguments like that without some existential dread clouding their judgment.

Do you really think anyone would complain if AI art didn't threaten their livelihood, in a space that usually celebrates inspiration and recognizes that IP rights often only benefit the biggest players? Do you really think a significant portion of that group would advocate for the strictest possible IP laws, which will wipe out a significant amount of human art as well, without some deeper motivation?

The ultimate issue is that people are losing their livelihoods and the ability to engage in work that is meaningful to them. We need to develop social programs that ease their fall, not run circles around the emotionally satisfying arguments that would "really show the other side what's what!". Ironically, a bunch of people are reducing one side to money hungry thieves and the other to lazy luddites below a comment calling out this exact behavior.

satvikpendem · on March 20, 2023

> Do you really think anyone would complain if AI art didn't threaten their livelihood, in a space that usually celebrates inspiration and recognizes that IP rights often only benefit the biggest players? Do you really think a significant portion of that group would advocate for the strictest possible IP laws, which will wipe out a significant amount of human art as well, without some deeper motivation?

This is what gets me about this whole argument. When you're agreeing with The Mouse about IP law, you know something has gone awry. In reality, we should remove arcane IP restrictions, not continue to add them further into society. Copyright is already, what, 100+ years on average?

> The ultimate issue is that people are losing their livelihoods and the ability to engage in work that is meaningful to them.

Indeed, people are using a moral argument to mask their deeper intent, their fear of the economic damage done to their livelihood. In this case the solution is not to further calcify their work, but to...solve the actual issue, via UBI and the like.

eropple · on March 20, 2023

> In this case the solution is not to further calcify their work, but to...solve the actual issue, via UBI and the like.

Are you going to buy the necessary number of congressmen to make this happen? Because the people being aggrieved certainly can't afford to--and so "well just pass UBI" is the "just draw the rest of the owl" flavor of not-helping.

satvikpendem · on March 20, 2023

It'll happen as more and more people start protesting such that politicians will have to support UBI, as well as corporations when no one has the money to buy their products. The need is simply not as strong currently as it is in the future. It's the central tenet in Marxist accelerationism theory.

eropple · on March 21, 2023

I will concede the theoretical possibility, but, uh...Marxist acceleration theory doesn't have much of a track record.

dragonwriter · on March 20, 2023

> I don’t think that’s fair. The artist community seem to have a genuine and legitimate grievance.

Insofar as its about treating “style” as a proprietary entitlement against the rest of the world, despite that it has not ever been proprietary, a kind of entitlement to an extension of copyright which itself is not even notionally a matter of fundamental right but privilege granted in the expectation of externalized utility, I am not convinced that it is a legitimate grievance, even to the extent it is clearly a genuine one.

> Arguing that they’re only saying that because their jobs are being affected is, well, not the best looking argument I’ve ever seen.

I didn’t argue that. I argued that the motivation is not that the threat to their jobs from the thing this would correct even if it worked, but the threat from something which it does not, even in the most optimistic view. That is, yes, its because “AI art” threatens their existing mode of work, but not because “style transfer based on their current and future artwork” moves the needle, in any meaningful way, on the threat. It is fundamentally (whether they are entitled to their existing mode of work or not) misdirected for the motivating concern.

simonh · on March 20, 2023

Firstly I don’t see that, secondly so what? If they have a legitimate grievance, that grievance doesn’t go away if they have other concerns as well.

I don’t think style is even the main issue here, although it’s a significant one. You may be right. Without scraping vast troves of copyrighted art, and using it in a way that is not at all clearly fair use, these models would be nearly as good at almost anything. But if that’s a legitimate concern, the fact that it might severely hamper these models is the right outcome. If these companies want to use copyrighted works to train them, maybe they should need to license it.

eropple · on March 20, 2023

I want you to know that I think you are one of the best posters here. That said:

> historically, that’s not a winning position, basically, ever

In what universe do you think people who are staring down the gunbarrel believe they have any chance to win at all, in anything? When “join up”, if one is even allowed (and let’s be real, they’re not being invited) is also losing?

We are talking about telos here, and tech’s incessant drive to smash it. And this industry offers no alternatives or even the barest minimum of safety to anyone.

gwoolhurme · on March 20, 2023

It really sucks to be compared to a horse. It goes to the mentality of this that gives me such anxiety. I understand to my boss I am just a cog (horse), but damn… a bit of compassion would be nice before I’m turned to glue.

satvikpendem · on March 20, 2023

And to extend your analogy further, humans use automobiles. Just like cars are tools for humans, so too are AI.

bigDinosaur · on March 20, 2023

And the proliferation of cars led to ruining many cities for people like pedestrians (and the destruction of neighbourhoods). Not all technology leads to great outcomes. Cars are useful but we've gone way too far in many cases, and we run the risk of doing the same with AI too.

amrocha · on March 20, 2023

Others have already pointed out the lack of empathy in your current, but here's another angle:

AI art literally relies on artist's output to train. It's not horses vs the car, it's horses vs a motor that grinds up horses to generate power.

jachee · on March 20, 2023

We kind of see it in transportation once more, with the ICE and EV sides of the automobile world.

zoklet-enjoyer · on March 20, 2023

Any information online is fair game to be used and remixed. Sampling is an art form

CatWChainsaw · on March 20, 2023

I'm pretty sure the music industry disagrees with your stance.

eropple · on March 20, 2023

I think there is a substantial burden of proof necessary for the implicit claim that AI people, in the main, care in the slightest about supporting those who will be negatively affected. (Wringing one's hands about "well we should have UBI" is not caring when one knows it will not happen. It is moral dress-up.)

As the sibling comment notes, only one side here is saying "stop" and only one side is saying "you can't make me."

Last5Digits · on March 20, 2023

Most people don't care in the slightest about supporting people that are suffering. Do you have sleepless nights because you know that the device you use right now required slave labor to be constructed? That's the main issue at play here and needs to be remedied by fair and empathic argument - not by railing against a group of people who you paint as unequivocally evil.

Your comment was the exact kind of argument I was writing about. You took the emotionally easy path of thinking about a large group of people as some evil cabal that has the power, unity and desire to destroy the livelihoods of others, instead of actually assessing their arguments.

People who are saying that AI can't be stopped are referring to the fact that even if they personally stopped using it - there would be millions of others who would not have such qualms. If OpenAI shut down tomorrow, new companies would open up the next day. If the government banned all AI, other countries would welcome it and start eating said government's lunch.

Striving towards better social programs is the best way forward, and UBI will be thought of as impossible until a large enough group of people campaign for it. If you disagree with that, then argue against it, but don't construct some boogeyman version of the other side in order to dismiss them.

eropple · on March 20, 2023

> Do you have sleepless nights because you know that the device you use right now required slave labor to be constructed?

Yes. Literally yes. It sucks. I try to minimize purchases, I research the most ethical sellers I can within the bounds of available information, and I try to reuse as much as is possible.

> That's the main issue at play here and needs to be remedied by fair and empathic argument - not by railing against a group of people who you paint as unequivocally evil.

When they stop acting like their mission in life is to drive everyone else into precarity and then to ruin, or even at least stop acting like it’s so horrible that people in their gunsights are upset about it, I’ll stop drawing from their actions the obvious conclusions.

UBI is a great way forward. Never said it wasn’t! But. It is clearly not on the radar of the people bankrolling this. If it was? They would expend the money and the political capital to do it. They would be on that goddamned stump. But Sam Altman isn’t out there buying congressmen for UBI. Could! But isn’t. So why would I believe that it’s any kind of priority?

But okay, that’s uncharitable, I’m sure. Altman is busy. So show me who should convince me that there is an argument at all beyond “you can’t stop us”, could you? Show me who is making fat stacks off of the AI craze and is also doing the political work, won’t you? Show me who is going “I am now making bank and I want us to give back structurally.” I have looked. I don’t see them. I want to push those messages. But I am reasonably confident they don’t exist. I’d like to be wrong.

I never said that there is any expectation of putting the genie back in the bottle, either. We do need political action, we do need to be emphatic and we do need to push for solutions (and UBI is a good one), and in so doing we might even fix the problems they create…but that doesn’t change their souls. It just means maybe we can work around them.

Last5Digits · on March 20, 2023

My point was that you are strongly judging a group of people whose behavior is not that unusual.

If you genuinely have a bad conscience about the things I outlined, then you are in the vast minority. 99% of people in developed nations don't care about the sacrifices made for their lifestyle, they feel no compulsion to give back or prevent the abuse of others in the name of profit. They may pay lip service, but they won't sacrifice anything to practice what they preach. Reading about modern slavery in cobalt mines will make them feel bad temporarily, but within days they'll be right back shopping for a new smartphone.

That is the unfortunate reality - our intuitive sense of injustice is strongly biased towards locality. We care when something bad happens to the people near and dear to us, but suffering that is far away or somewhat hidden doesn't really register.

We need to face this reality and work with it, instead of getting heated and self-righteous. Sam Altman's soul is neither worse nor better than the one of your average middle-class American.

Instead of assuming evil intent, assume ignorance. Make convincing arguments, highlight the suffering, offer possible solutions - these are the actions that likely lead to improvement.

And to just add a small reminder: AI isn't all bad. It's a disruptive technology, with great risks and great rewards as well.

eropple · on March 20, 2023

> Instead of assuming evil intent, assume ignorance. Make convincing arguments, highlight the suffering, offer possible solutions - these are the actions that likely lead to improvement.

I'll be real: the actions that are likely to lead to improvement are buying Congressmen, and I don't have the money. The AI crowd does, and they are positioning themselves to collect ever more of it. If they are in fact Actually Worried about the people downrange of their guns, why is the onus on everybody else? Why are you implicitly holding that normal people are unreasonable for not wanting to be fed into the gears and that it's not the gear-owners' responsibility to not feed people into them?

Don't misunderstand me--I can do a lot of stuff, both tech and not. I'm reasonably confident I'll be fine regardless; if push came to shove, I can enter my disappear-into-the-woods arc. I'm not going to begrudge those who can't refusing to go quietly, and frankly, you shouldn't either. I think you know as well as I do that this crowd isn't looking to talk to the people they seek to grind to dust. They don't think they matter, and if you seriously thought otherwise you'd have answered with examples.

Discussion is a luxury afforded only when all parties are going to be okay at the end of the day. You are, for lack of a better term, tone-policing the scared and the harmed on behalf of the wealthy and the harming. And you are clearly more capable of insight than that.

sebzim4500 · on March 20, 2023

HN doesn't even care about SWE being laid off, and you'd think if they could empathise with anyone it would be them.

gwoolhurme · on March 20, 2023

Agreed, however I'd like to add it's not just art, it's a lot of other facets of the dawn of the LLM(s). Unfortunately I don't see a slowdown in talking past each other. It would be really nice to sit down and rationally talk about what we should do for translators, artists, even programmers, and what we should do as time goes on. Because the hyperbole just creates more anxiety. To be honest, I don't have any good solutions but it would be nice to talk about what can be done.

XorNot · on March 20, 2023

The ship came and went with translators: nobody cared. The people currently angry didn't care, they didn't even think about it.

The reality is there's a whole group of people right now who's real problem is they didn't see this coming. AI, as always, was meant to destroy people in minimum wage jobs first. Because they're unskilled right? That's why they're minimum wage? Right?

But art, creativity. Machines can't do that! It's a classic science fiction trope, how could it be wrong? The irreducible nature of the human spirit is represented in art, it's not paid well but it'll totally be safe from automation.

Bricklayers, electricians and plumbers are going to be making decent money putting up buildings while the white-collar specialists get downsized due to automation. Any job done from a sitting position in front of a computer should be keeping in mind just how replaceable you actually are.

gwoolhurme · on March 20, 2023

I don’t disagree, but my question still stands. Besides every single desk worker learning a trade. What else can we do? Right? We are saying the same thing. You are right nobody cared when it upended a lot of translation. You are right people didn’t care because it was going to effect “low skill” jobs. Okay, now that we have that out of the way. What is a proposed solution? UBI? That’s fine and dandy but what does every desk worker on a working visa do? Go home? Is that viable? I’ve said in this post, I don’t have any solutions, all I see is real hurt and anxiety. Should we have cared before? Yes, I get that.

XorNot · on March 20, 2023

But you gave the answer in the first sentence: that's exactly the solution.

Because let's flip the question: why do white collar desk workers exist? Because we're specialists - we provide a service which is vital and necessary in some way, which justifies the cost and being done from a desk.

The consumers of those services are everyone else, including blue collar workers who are doing physical labor, moving material in the real world, etc.

So why should any of those people subsidize us sitting on our asses when AI can provide the same service cheaper?

gwoolhurme · on March 20, 2023

I don't think everyone learning a trade is viable for a number of reasons, there are plenty of jobs that can be done, should be done. Trades are great jobs. They are not for every single desk worker right now. So some extremely fast napkin math and some quick searching there was 63,644,000 people doing desk jobs of some variety in the US. There probably is not 63,644,000 peoples worth of trade jobs. There are trade shortages and people probably should and will go into them as other jobs disappear. Drivers, plumbers, electricians etc etc are all amazing jobs that should be highly valued. That doesn't mean everyone <can> do those jobs. Physically or otherwise. You are right that that we exist because we provide a service that justify our cost and being, however that will be replaced by a single device. I don't know if all of society can be subsidized. While a shitty thing to do, I am just telling you that "learn to trade" is not the answer to this question either though.

King-Aaron · on March 20, 2023

> how dramatic and frankly mean everybody is about it

I notice this in almost every topical discussion on the internet these days. People have seemingly lost the ability to have rational discussion.

klodolph · on March 20, 2023

It's far from everybody. You're just hearing from the loudest, most amplified voices on the internet.

minimaxir · on March 20, 2023

Ironically, the drama provides an incentive for people using AI art tools productively (which includes artists by trade) to just not talk about it and cause the drama to continue to perpetuate.

1attice · on March 20, 2023

[EDIT: 2023-03-19@20:12PDT : s/esteem/respect/g, to match Rawls' usage.]

The anger you notice is not context-free. People are shouty because:

- There are livelihoods at stake

- Worse, there are systems of respect at stake, many of which underwrite people (artists, coders, etc) feeling good about themselves. Rawls is a great reference point for this; feeling like you can contribute meaningfully (in a way that is meaningful for you) is one of the prereqs for social stability (said Rawls: https://academic.oup.com/book/32571/chapter-abstract/2703665...)

- We tech bros (or bras, in my case) do have a notable pattern of disrupting-for-profit. I mean, even middlebrow fare like _Glass Onion_ now casts us as villains at worst, and stooges and best, and it would be good to slow down for a moment to consider if they might be seeing something real about us. (Ask an NYC taxi medallion holder.)

- Creatives, especially, were already at a profound breaking point, one that is not easily communicated to those outside the disciplines in question; see https://www.goodreads.com/book/show/53935644-the-death-of-th... . It was created by, yes, our industry, and began in the nineties, before at least some of the readership here was born, and picked up steam ever decade thereafter.

Much like the early days of climate change reportage in oil/gas sector, the historic & intersubjectively verifiable externalities of our industry's (frankly incredible) progress feel ego-dystonic. It doesn't seem fair that something as fun as what we do could cause immiseration.

This --- to draw a tidy circle --- is because tech progress is part of our self-respect. And non-harming is part of most non-sociopathic self-respect, including our own. This is, in a word, a threat to our sense of self. A threat to our identity, and worse, a risk of moral injury.

'Moral injury' -- the harm to the self that comes from having done something wrong or harmful to someone else -- is one of the most profound roots of, e.g. PTSD in soldiers, and most humans intuitively avoid it at any cost.

One inefficient way of avoiding it is to avoid admitting that you did anything wrong, for example, by shouting back that you didn't do anything wrong.

In the long term, this works out about as well as you'd expect.

--

A quick closing word about the comments on the parent that advocate a sensible calm sit-down to work out solutions:

People are absolutely not going to have a quiet sit-down about how to fix this problem because we all know full well that very few liberal democracies remain functional enough to stickhandle everyone through such an enormous change. The USA, for example, can barely pass a budget; you think that you can get UBI through? Which would still do nothing about that Rawlsian self-respect, which, again, is just as significant to a healthy human as air, water, or food (and is therefore vital to social stability, in the same way that a steady supply of food is vital.)

Shouting is one of the few avenues left for self-expression, and thus, self-respect. No one wants to go quietly, and 'going' -- whether it be from a job, a career, a family, or even a home -- is quite clearly what is now in the cards for a great many.

And they are shouting at us because we do this shit on the regular. But never like this before. We went large with it. Did we ever.

eropple · on March 20, 2023

This is not merely a deeply wise post, but a necessary one, and one that all of us, but especially advocates of AI technologies, need to deeply internalize.

We as an industry have brought immiseration as a primary product to the developed world for quite a while. Shoving human beings "below the API" (coined by Peter Reinhardt, but I heard it first from Venkatesh Rao) is what has gotten the richest in the tech space paid, and the idea that we can adjust our pince-nez and hmm-hmm at the people being actively harmed for not being sufficiently calm about the incipient transition from immiseration to active ruin is a silly idea when it isn't outright insulting.

1attice · on March 20, 2023

Thank you <3 I've had to do a lot of my own reckoning recently, for reasons that are personal, but 'willingness to see even uncomfortable things about oneself' has been a big part of my recovery. Here's hoping it can be gentle medicine for others as well.

It's not as horrible as it first tastes :D

artsytrashcan · on March 20, 2023

"The child who is not embraced by the village will burn it down to feel its warmth."

To expand: no side is without sin in this. The art community has inclusiveness problems of its own, and you'll find that many of the loudest anti-AI voices are those privileged and connected enough to have been able to develop their craft and to make a living off it, not the majority of individuals who are struggling to find a place and a manner of expression that is successful (however they define it). The former's derisive calls to "pick up a pencil and draw, if you want to make art" are not altogether different from the advice you would get on deviantArt or some long-since-forsaken forum 20 years ago.

This, of course, is incomplete advice, especially in a quickly-developing art movement, where not having the resources, or the time, or the connections that produce skill or economic mutual aid, or the psychological space and safety to be self-critical and creative, could leave a would-be artist stagnated and frustrated.

Some people are able to find that capability, while some are merely left with years of resentment from trying and failing to break through. And, make no mistake: much of where one ends up has less to do with actual artistic ability, and more to do with one's social standing.

I imagine that many of the people who are engaged with AI art and its development fall into that latter category. Much as with crypto art and NFTs in 2021 and 2022, the wailing and gnashing of teeth from the anti-AI art crowd comes from the end-run people who were ignored or ostracized by those gatekeepers have been enabled to do around those "systems of respect" (which, as I've explained, were arbitrarily elitist and exclusionary to begin with). Their anger is self-serving, and in some regards cruel, because it's energy that a less egotistical group might have turned towards proving with action the morality of the process they espouse. But now, as before, the goal is not in enabling the success of those aggrieved who turn to AI as a balm, but in conserving an advantageous status quo. All the shouting does is cement this craven ethos as the true core of their discontent.

1attice · on March 20, 2023

This is a good point, and I upvoted it.

AI tools do redistribute the ability to produce art, which might otherwise tend to be limited to a lucky few who get the right education and training.

The problem, however, is that without a professional class, fields tend to drop in stature, and whole fields of human endeavour begin to seem less consequential. This is important to the self-respect argument I outlined above.

I recall an Onion article entitled something like "Area man asks friend if he knows how to get any doctoring work," about a man who (apparently) sees conducting surgery and doing odd-job home repairs as roughly equivalent.

Something like a "doctoring work" market is on its way for the artworld, and in the process, it will dramatically change the social significance of artists, the social stature of artists, and, more generally, the possiblity of making a defensible 'moat' around one's living as an artist.

The big beneficiaries of this process, aside from Susan, who will suddenly be able to write that novel, will be the guys selling Susan her novel, one token at a time.

Perhaps I'm old fashioned, but I genuinely think this will make Susan's novel less meaningful, both to her and to her readership, as well as making novels in general less meaningful.

All the while lining the pockets of, inter alia, my former employer.[1] I think this is.... not a good change?

[1] Disclosure: I benefitted financially from the advent of contemporary ML. It made my career, frankly. It stands poised to unmake the careers of many of my art friends.

Kim_Bruning · on March 20, 2023

Note that at some point, some people-identifying-as-artists tried to attack Creative Commons. They were making very similar arguments to the arguments we hear against AI today.

Since Creative Commons is commonly used by charities, museums and volunteer organizations, they had some trouble maintaining the moral high ground. The attacks eventually fizzled out. (Even just finding back their texts attacking CC turns out to be pretty tricky)

But I still remember something of those attempts.

Just because some people are saying bad things about you, doesn't mean that those things are necessarily true.