Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Show HN: Open-source, citations-first RAG search for Epstein Files (github.com/benbaessler)
1 point by benbaessler 74 days ago | hide | past | favorite
I built an open-source RAG app for searching the Epstein files (36,000+ documents).

Every answer links to the exact source chunk. Click to verify. No black-box AI.

Try it at https://epfiles.ai (10 free messages, then use your own xAI key)

Stack: Next.js + FastAPI + ChromaDB + xAI (Grok).

What's included: - Pre-built ChromaDB (auto-downloads on first run) - Document chunks (~190MB, separate download) - Scripts to regenerate embeddings with your own model - One-command Docker setup

Limitations: - LLM can still hallucinate, citations let you verify fast - Fixed corpus (House Oversight release only) - Requires API keys: OpenAI (embeddings), xAI (generation)

GitHub: https://github.com/benbaessler/epfiles



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: