soham123's comments

soham123 · 2024-11-05T18:13:32 1730830412

I have built something similar at https://github.com/ComposioHQ/composio/tree/master/python/co...

Compatible with any LLMs and agentic framework

gregpr07 · 2024-11-05T18:24:06 1730831046

Looks nice. I find the cleaning HTML step in our cleaning pipeline extremely important, otherwise there is no real benefit from just using a general vision model and clicking coordinates (and whole HTML is just way too many tokens). How do you guys handle that?

soham123 · 2024-10-25T13:34:31 1729863271

They showed a really cool literal example of what's coming. it's almost a chatgpt like movement.

diffeomorphism · 2024-10-25T14:23:04 1729866184

Which one? The article has four examples, none of which are particularly "cool" or impressive.

If anything, the examples involving moving the mouse to the address bar or getting csv's of results are very poor examples, because we can already do that much better without "computer use".