The majority of scientific and technical papers are delivered by the PDF file for portable presentation. However, it difficult to read the PDF paper on small screen devices, such as tablets and eBook readers.
So I wrote PDFlower to reflow PDF papers for small-screen reading.
Compared with plain-text extraction based tools, PDFlower reconstructs the object level - columns, formulas, tables and figures - layout using some smart heuristics, and then renders them for new page size.
Great idea and actual pain point. Showing a live or even a video demo will increase sign up significantly IMHO.
Also the "get started now" should lead to the sign up page, not sign in page.
I like minimalist design, but this is a little too minimalist. More pictures, how it works, examples, pricing, will be great. Also no clue if this is an IOS app, android app, mobile web app, desktop app.
So congrats on the building and shipping attitude, but to take advantage of the flow of beta users from HN, I'd say a video / demo and more "faq" is going to increase your conversion tenfold.
Well done on the launch in any case, better launch something than wait till it's perfect.
Example pictures are kind of small, hard to see the quality of the result. Also, I'd use this for a Kindle device, not an iPad, so it'd be also good to see how it looks in a Kindle.
Seconded. I have bad vision; I would love to use this to resize the fonts in single-column PDFs that I read.
Equivalently, I would love to reduce the column width to make zooming easier.
I prefer two-column format specifically because the columns are typically much narrower (less words per line), so I can zoom them to fill the screen and read much more comfortably than a single column document with 50 words per line.
Reading the page, it looks like your document gets put into a queue for processing, and then you can download it after. I imagine they wanted to avoid all the hassles of emails, as well as having accounts to prime the system if they decide to enhance it as a for-pay service.
It probably also stops assholes from using the service on PDFlower's dime for their own needs.
There are plenty of services that put my job in a queue and simply update the page via JavaScript to show progress and deliver the end result. The better implementations identify me via session cookie and recognize me when I check back half an hour later to get my results. That's a much nicer experience than creating another account at some random service that I might never visit again.
Does anyone know an open-source solution for this? What I read is no one's business and I would never used a web-hosted service for something like this.
One of the problems with doing this in pdf is that for most documents, you need to infer reading order. A decade ago I wrote the code in poppler to do that (yet again, based off papers by Breuel) in order to get multi column select working. At the time I wanted pdfs to be readable on my iRex Iliad... anyhoo, most of the pieces are there in poppler. It can figure out reading order, render piece by piece, and already differentiates between images and text under the hood. Still a lot of work.
Yes! All versions can now be easily jailbroken. The main features (to me) are installing KOReader to view epubs and pdf files, and hacking the screen saver display. The process only takes about an hour, mostly because you are reading, doublechecking, and downloading the right files. This guide[0] is a very good overview. I find mobileread's site to be too wordy and difficult to follow.
Which version would you buy if you were buying a Kindle today? I only owned the one before the DX... managed to crack the screen in an unfortunate elbow-related incident.
Paperwhite has been really good to me. Both Oasis and Voyage look great as well - they have higher resolution than the Paperwhite. And they have real buttons to move between pages, so if I were buying a Kindle today I'd buy one of them depending on my budget (Oasis is more expensive, but slimmer).
Current (and previous) generation Paperwhites have the same resolution as the Voyage and Oasis. I have a Paperwhite and the screen is fantastic. I'd love hardware buttons for page turning, though.
While the Oasis is expensive, if you add a nice cover to the Voyage you're close in pricing. Either go solidly midline with the Paperwhite or go all out with the Oasis, depending on your budget.
Thanks! just tried this out, and it seems to work pretty well.
I'm having some trouble using the margin option -m to strip the vertical sidebar text from an arXiv paper, would you happen to have a solution to that problem that doesn't involve a lot of manual intervention? Using '-m 0.5in' handles the vertical text, but then the body text isn't coming out quite right...
Sorry, not at the moment. I figured out those parameters after tweaking things a bit. You might want to start the conversion with just the name of the file, and use the interactive prompt to try out various parameters.
Agreed. Nothing against the described service or its users, but personally when I'm searching for something like this, "upload" comes right before Ctrl-W for me.
The last time I tried to build k2pdfopt, the source zip was missing a bunch of files. You had to download another zip, which wasn't complete either, and combine the results. And then the build failed because cmake was having a bad hair day. I wanted to like it since the windows exe did seem very handy, but had to give up at that point. (Oh, wait, there was a bit more. I did get it building, but the source assumed fprintf(NULL, "log msg") was a perfectly fine thing to do. That's when I gave up.)
Exactly the same here. I won't go through the [maybe] lengthy process of creating a new account to get a paywall then. Same goes with the privacy policy, I don't want to use it with my own PDFs and lose their rights or with other people's and screw things legally.
Very cool. How hard would it be to go in the opposite direction? When I'm using a desktop or a laptop computer it's most annoying that almost all news sites, blogs, eg. optimize for a portrait orientation. I can only see a relatively small amount of text so I have to scroll constantly, despite the acres of available whitespace to either side. It's really annoying when reading anything complex where you'd want to refer back to previous sentences or paragraphs frequently to make sure you've understood the substance properly. I would love to be able to reflow into 3 or 4 columns though...
The PDF format will ignore some structures of paper and `draw` contents on a fixed-layout flat document. For a char, e-readers cannot know it belongs a paragraph, a figure caption or a formula. Plus, publishers render original paper with the specific style. It is a problem. Some research concentrate to improve PDF format, extract infos for scholarly papers, like articles on DocEng - a compute science conference: http://dl.acm.org/event.cfm?id=RE135. Then I read the spec of PDF format...
Unfortunately the kindle is optimizing for users buying Amazon content, not for being the best general purpose eReader. This incentive mismatch between Amazon and the end users is probably why we will never see such a feature built in.
If you jailbreak your Kindle and install Koreader (mentioned upstream) you can have high quality PDF reflow, with support for two-column layouts as well as math and displayed equations, on your Kindle today.
Wow, that's quite the (both) useable and useful software - thank you for the link. I'm not normally part of the "x should buy this out" group, but in this case, Amazon really should.
No, they shouldn't - because if it goes against their business interest (having Amazon-bought books look better than manually uploaded content), they'll shut the project down.
The only reason this software is any good is specifically because it isn't controlled by Amazon. This functionality is very deliberately missing from the stock Kindle.
Interesting and I might have a use for this professionally. But there is no indication on the main page about how it works (web service? code library?). And it forced me to login on the second page, so I closed the tab.
This is something I haven't found an ideal solution for yet. There are existing iOS PDF readers with reflow, but each seems to have its own issues.
To echo what others have said, this is an actual pain point and I'm very willing to spend money on a solid solution. I've bought a few different PDF readers just on the chance their reflow would be better than what I've found elsewhere.
So I clicked on your link pretty ready to throw money at you, but I'm not sure what I'm looking at. Is this an iOS app? Android app? It seems like some kind of service, since it wants me to login, but I can't tell.
Consider working on the messaging and then reposting this. There's definitely a market.
Adobe Reader has a little-known function similar to this. Go to Display > Zoom > distribute (the last entry).
It works reasonably well with simple, clean documents. You can check wether the document is compatible in its property panel.
If you're interested in doing this for sheet music (sort of a similar-ish problem), check out my product soundslice.com. Auto-reflowed sheet music depending on your device width.
So I wrote PDFlower to reflow PDF papers for small-screen reading.
Compared with plain-text extraction based tools, PDFlower reconstructs the object level - columns, formulas, tables and figures - layout using some smart heuristics, and then renders them for new page size.