When, manus.ai came out I wondered the same, the "use computer" mode seemed really interesting to me, although I've seen JAYU [1] which implemented computer use with gemini.
Moreover I saw somewhere I really don't remember navigating web browser through layouts akin vimium like experience.
Initially (my impression of) computer use was only opening chrome and doing things inside chrome, chrome-as-os experience, I think maybe cloudflare could do something better here, with their workers?
Per user instance seems really costly I really do wonder how did they architect it.
Initially (my impression of) computer use was only opening chrome and doing things inside chrome, chrome-as-os experience, I think maybe cloudflare could do something better here, with their workers?
Per user instance seems really costly I really do wonder how did they architect it.
[1] https://ai.google.dev/competition/projects/jayu