Nice project for looking for pitchdeck references. Thanks for building and sharing it. I am curious about the tech behind it - are you doing OCR on images? The search is very responsive - it's definitely not elastic search, curious what index/search system are you using?
Glad it helps! There are 4 key steps that I took:
- Upscaling (using Upscayl[0])
- OCR (using tesseract[1])
- Indexing (using Algolia[2])
- Scaling the processing and running on AWS (Klotho[3] - our startup)