Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
I Built an Open Source LLM-Based Receipt Generator – Here's Why
4 points by maxime_wellapp 28 days ago | hide | past | favorite | 2 comments
If you’ve ever worked with AI models to parse real-world documents like invoices or receipts, you know one truth: good test data is painfully hard to find.

Real receipts are noisy, diverse, and often private. PDF templates are brittle and too clean. OCR outputs are inconsistent. And once you move beyond English or simple formats, it gets even messier.

That’s why I built this:

GitHub: WellApp-ai/ai-receipt-generator Example output: imgur.com/a/YtFSodj

What’s Next?

Right now it supports: - OpenAI models (via API) - Local generation via Faker - YAML-configured generation flows

Coming soon: - Support for Claude, Gemini, Mistral, etc - More built-in schema presets - Predefined prompt templates (by region, industry, language)

We’re also planning to dogfood this internally for auto-evaluations of our own parsing engine.




not available at link as of 21:24 UTC


This link works: https://imgur.com/a/YtFSodj

I think the GitHub link was supposed to be: https://github.com/WellApp-ai/Well/tree/main/ai-receipt-gene...

I'm interested in this if the OP wants to update - my company has a PDF generation API, so I'm interested in seeing LLMs do this directly.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: