Hacker News new | past | comments | ask | show | jobs | submit login
Ask HN: Best way to extract web element coordinates from an image?
2 points by MarcelOlsz 5 months ago | hide | past | favorite
I'm building a QA testing agent and everything works, except for having somewhat under par accuracy in resolving web elements from an image. For example I overlayed various types of grids onto an image which helped AI locate with better accuracy, but it was always still off the mark.

Is there some service I can use that will give me accurate coordinates of a web element on a viewport from an image?

I built a small DOM transformation tool to feed in a minimal DOM to my AI as context and it works very well, but things get dicey with iframes and some other things. Then I pivoted to using screenshots in base64 which has a much lower token count, but I can't solve my accuracy problem.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: