Hacker News new | past | comments | ask | show | jobs | submit | MagMueller's comments login

Applying to jobs & getting interviews. Go to WhatsApp and respond to my newest messages. Find leads on LinkedIn and add them to my Salesforce CRM.

You can just try with browser-use. Its open-source and connects to your real browser. So you can just decide for your own safety system.

You can use browser-use as open-source alternative for Operator

possible to use it with R1 for the reasoning part ?

Yeah I saw someone do this on Twitter recently... Can't seem to find it, as you do with twitter

Really interesting! I talked to loop11 (QA testing company) and they think a lot about how companies should change their UI for AI agents. They will soon launch a feature to test your website not only with humans - but also with browser-use. Then you can see where browser-use fails and adopt your UI to make it easier for browser-use, by e.g. having ALT texts and tool-tips ect.

I am the creator of browser-use and build it with the vision that most websites will take a long time to adopt.

One vision behind us is to predict from a website directly higher abstract functions which the agent only needs to call and we execute code to process that.

One interesting thing could be to write into the html directly API descriptions which the agent can you simultaneously.


Yes, this is the report of browser-use with 89%: https://browser-use.com/posts/sota-technical-report

We definitely need a new dataset with more complex tasks, like uploading files, handling multiple tabs, and handling many more steps.


Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: