Hacker News new | past | comments | ask | show | jobs | submit | soham123's comments login

I have built something similar at https://github.com/ComposioHQ/composio/tree/master/python/co...

Compatible with any LLMs and agentic framework


Looks nice. I find the cleaning HTML step in our cleaning pipeline extremely important, otherwise there is no real benefit from just using a general vision model and clicking coordinates (and whole HTML is just way too many tokens). How do you guys handle that?


They showed a really cool literal example of what's coming. it's almost a chatgpt like movement.


Which one? The article has four examples, none of which are particularly "cool" or impressive.

If anything, the examples involving moving the mouse to the address bar or getting csv's of results are very poor examples, because we can already do that much better without "computer use".


Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: