Hacker Newsnew | past | comments | ask | show | jobs | submit | kodefreeze's commentslogin

great question! we have logic to look for things like when certain network requests are completed, dom loaded, etc as well as a timeout so we are not waiting for ever. The LLM based on the screenshot can also decide to wait longer if the page hasn't fully loaded despite the checks we do.


They have browser automation, and a bunch of other agent tools to manage tasks, do things like PowerPoint slides, etc. I find chatgpt agent mode better for most tasks though.


Interesting that Meta is acquiring a Chinese company.

I was a fan of their initial product but I find it slower than chatpgpt agent mode. And the pricing is not great for individual users.


No, they are no longer; they are now Singaporean companies.


They raised at a ~6B round recently. I could have invested but missed the deadline :(


Do you have support for whatsapp group management?


soon!


I just did this test with our web QA agent - kodefreeze.com, it was able to test creating an account until it reached the screen that requires email confirmation.

Support for being able to receive email/custom actions is on our roadmap, but would love to see if getting this far would be valuable to you. The test was with the email=test@kodefreeze.com.


Thanks for letting us know, we'll fix it.

Please let us know if you have any feedback!


Sorry that you are running into this error, are you seeing this on the marketing website? or somewhere in the app?


How do you build trust in a system like that? The flowchart style have the advantage that you can decide when you want a human to review/approve as well as ensuring actions that need to happen at certain conditions do happen.


Yeah, flowchart style does have that advantage since you can add in approvals and conditions. The tradeoff is you end up limited to simpler logical flows that are easier to verify.

Our take is that trust in agent systems has to be empirical. You start with manual testing and then layer on AI based simulations (we’re adding this in Rowboat soon) to test more scenarios at scale. Splitting work into multiple agents also makes it easier to isolate and test parts separately.


They've being doing some research on this: https://machinelearning.apple.com/research/ferretui-mobile


I didn't see this, thank you! They have a follow-up as well:

https://machinelearning.apple.com/research/ferret-ui-2


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: