Neural LMs used to be based on recurrent architectures until the Transformer cam...

WithinReason · 2024-10-14T09:18:08.000000Z

I meant sequential generation, I didn't mean using an RNN.

Diffusion doesn't work on pixels directly either, it works on a latent representation.

kleiba · 2024-10-14T10:18:04.000000Z

All NNs work on latent representations.

barrkel · 2024-10-14T10:49:02.000000Z

The contrast here is real: there are pixel space diffusion models and latent space diffusion models. Pixel space diffusion is slower because there's more redundant information.