Hacker News new | past | comments | ask | show | jobs | submit login
Structured OCR with GPT Vision (binal.pub)
1 point by binalpatel 5 months ago | hide | past | favorite | 1 comment



Here's a direct link to the code: https://github.com/caesarnine/llm-experiments/blob/main/3_st...

Just a Pydantic model to define an extraction schema + a simple prompt works reasonably well.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: