Hahahahaha you sweet summer child. Training code? For an *art generator*?! Yeah,...

nl · on Jan 26, 2023

This isn't entirely accurate.

The SD training set is available and the exact settings are described in reasonable details:

> The model is trained from scratch 550k steps at resolution 256x256 on a subset of LAION-5B filtered for explicit pornographic material, using the LAION-NSFW classifier with punsafe=0.1 and an aesthetic score >= 4.5. Then it is further trained for 850k steps at resolution 512x512 on the same dataset on images with resolution >= 512x512.

LAION-5B is available as a list of urls.

daveloyall · on Feb 1, 2023

Sorry, I didn't see this reply until just now.

To my eye, kmeisthax's comment appears to be entirely accurate.

Well, that is to say, that assuming the facts that listed are accurate, then I agree with the conclusion: it's not "open source" at all. (And certainly not Libre.)

The things you said do not describe an open source project.

The point here is that the title of this thing is incorrect. If the ML community doesn't agree, it's because they are (apparently) walking around with incorrect definitions of "open source" and "Free Software" and "Libre Software".

kelipso · on Jan 25, 2023

Have you looked at LAION-400M? And the OpenCLIP [1] people have replicated CLIP performance using LAION-400M.

[1] https://github.com/mlfoundations/open_clip

walterbell · on Jan 25, 2023

Thanks for educating the masses of machine-unwashed newbies!