Hacker News new | past | comments | ask | show | jobs | submit login

I have this issue on all the epubs I have downloaded on archive.org, the converter they use (pandoc?) seems to give bad results on pdf-epub.

If you have low quality source, eg OCRed scans, then you’ll never get a high quality ePub. It needs manual intervention and proofing to keep quality up. Not to say that archive.org isn’t useful - I’ve produced a collection of Keats poetry[1] as an ePub from their OCRed source in the past - but it’s not really consumable as is.

[1] https://standardebooks.org/ebooks/john-keats/poetry

Registration is open for Startup School 2019. Classes start July 22nd.

Guidelines | FAQ | Support | API | Security | Lists | Bookmarklet | Legal | Apply to YC | Contact