Some cool stuff from the paper: • Earlier SD versions would often generate image...

jack_riminton · on July 4, 2023

The two stage process using different skill sets reminds me of the old painting masters. Often the master would come up with the overall composition, which requires a lot of creativity and judgement of forms, apprentice painters would then come in and paint clothes, trees or whatever and then the master would finish off the critical details such as eyes, mouth and hands etc.

It makes sense to have different criteria of what good is for each stage

pkdpic · on July 5, 2023

not questioning the veracity here (I'm under the same impression) but curious if you have any sources on this, its so hard to find information on that kind of thing...

jack_riminton · on July 5, 2023

Yes sources on this are notoriously difficult, especially since the use of apprentices was an open secret. I did find this source which talked about Rubens and Rembrandt:

"The structure of the work was usually as following: Rubens sketched and corrected the painting at the very end, and his "staff" was given the whole main stage. To increase the speed and efficiency of work, Rubens shared the duties: some of the pupils painted the background, others were focused on details — they worked on foliage or clothing, the master himself corrected the whole work and execute the most "important" parts — hands and faces."

Link: https://arthive.com/publications/2854~Rembrandt_the_teacher_...

Llamamoe · on July 5, 2023

Not the same thing, but this is exactly how manga is made, and you can probably find a lot more about that.

Zacharias030 · on July 5, 2023

I’d recommend to talk to any museum‘s curator of the old masters during a guided tour and they will be able to tell you a lot about it. Sometimes which parts were „workshop“ and which were the master are also visually identifiable by a schooled eye.

dustypotato · on July 5, 2023

Hardly something anyone will do from the curiosity from reading a forum thread.

loudmax · on July 4, 2023

The two-stage model sounds a lot like the "Hires fix" checkbox in the Automatic1111 interface. If you enable that, it will generate an image, and then generate a scaled up image based on the first image. You can do the same thing yourself by sending an image to the "Image to Image" tab and then upscaling. If you do it that way, you also have the option of swapping the image model or adding LoRAs.

Presumably the two parts of the SDXL model are complimentary: a first pass that's an expert on overall composition, and a second pass that's an expert on details.

Taek · on July 5, 2023

Its not quite the same, the highres fix in Auto1111 used the same model twice, the upscaling for SDXL uses two different models for each stage.

ChatGTP · on July 5, 2023