You can't discount all the data they Waymo has collected over nearly a decade or the scenarios they've manually created. They also have the world's most complete map and spatial dataset, which could easily be extended to create a model that creates tricky roadways. Stimulating obstructions or hardware failures doesn't require very much data at all.
If you are modeling scenarios like a game engine, a "discriminator" model isn't necessary: you just check whether a simulation doesn't result in a crash.
I'm not discounting their data, I just think Tesla has so much more. If you were looking at just those opted into the FSD Beta you have a larger fleet actively running the model with feedback loops capturing every failure. But cars without FSD are still running the model and capturing data as well.
If you are modeling scenarios like a game engine, a "discriminator" model isn't necessary: you just check whether a simulation doesn't result in a crash.