Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

To be fair, multiview-consistent diffusion is extremely hard - it's an accomplishment of it's own right to get right, and still very useful. "World model" is probably a misnomer though (what even is a world model?). Their recent work on frame gen models is probably a bit closer to an actual world model in the traditional sense (https://www.worldlabs.ai/blog/rtfm).




They have $230m in funding and some of the best CS/AI researchers in the world. People like Skybox labs have already released stuff that is effectively the same as this with far less capital and resources. This is THE premiere world model company, and the fact their first release is a far cry from the promise here feels like a bit of a bellweather.

I agree RTFM is in more of the "right" direction here, and what is presented here is a bit of a derivative of that. Which makes this release so much more crass, as it seems like a ploy to get platform buy in from users more so than a release of a "world model".

https://www.skyboxai.net/ https://worldgen.github.io/




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: