Surprises me to see I'm the first comment here to say: I just use GPT4 for this. Works perfectly, even for getting the Latex source of a formula you only have a screenshot of.
Probably quite the overkill in terms of energy efficiency for just image to text, but I only need this like once every two weeks or so.
Probably quite the overkill in terms of energy efficiency for just image to text, but I only need this like once every two weeks or so.