Hacker News new | past | comments | ask | show | jobs | submit login

Great site, congrats!

One comment is that the slowest page to load was the Gallery [0] as it loads an ungodly amount of PNG files from what appears to be a single IP (a GCP Compute instance?)

I see 421 requests and 150 Mb loaded. As it seems to be mostly thumbnails, have you considered using jpegs instead of pngs, potentially use lazy loading (i.e. not load images outside of the viewport) and potentially use GCP's (or another provider) CDN offering?

Once I clicked a thumbnail, loading the article itself (for example [1]) was quite breezy.

The gallery is a great showcase of what your site does -- I think that it'd be worth making it snappier :-)

Cheers and congrats again

P.S. Also, the paper linked below [1] seems to have a few conversion problems -- I see "EQUATION (1): Not extracted; please refer to original document", and also some (formula? Greek?) characters that seem out of place after the words "and the next token is generated by sampling"

[0] https://papertohtml.org/gallery

[1] https://papertohtml.org/paper?id=02f033482b8045c687316ef81ba...




> One comment is that the slowest page to load was the Gallery [0] as it loads an ungodly amount of PNG files from what appears to be a single IP (a GCP Compute instance?)

Yup. There's no CDN or anything like that right now. We kept things simple to get this out the door. But we definitely intend to make improvements like this as we improve the tool.

The more adoption we see, the more it motivates these types of fixes!

> P.S. Also, the paper linked below [1] seems to have a few conversion problems -- I see "EQUATION (1): Not extracted; please refer to original document", and also some (formula? Greek?) characters that seem out of place after the words "and the next token is generated by sampling"

Thanks for the catch. As you noted there's still a fair number of extraction errors for us to correct!


Another sample paper that caused some trouble with figure extraction: https://www.cs.utexas.edu/~hovav/dist/vera.pdf

Very cool project, looking forward to seeing how it develops!


Thanks, I'll pass this example along!


> have you considered using jpegs instead of pngs

For thumbs of text papers, perhaps a GIF or PNG would be smaller than a JPEG while retaining pixel accurate crispness?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: