If I understand what they did correctly it won’t have the same failure mode of hallucinations that a diffusion model has - it’s not a model that has an understanding of the world, it’s a model that’s really good at turning async per pixel light event data plus blurry rgb into sharp rgb.
That said I don’t understand it very well, for instance there’s a voxel step in the pipeline and I have no idea why.
That said I don’t understand it very well, for instance there’s a voxel step in the pipeline and I have no idea why.