Hacker News new | past | comments | ask | show | jobs | submit login

What is the algorithm behind this? Does it attempts to find 2 similar pages of the same website and generate markdown from the diff of their DOM trees?



No, it's simply a service/tool that converts a html page into markdown. So let's say you have something like this:

  <h1>Title</h1>
  <p>Some wordy stuff <a href="example.com">with a link</a>
It would convert it into

  # Title

  Some wordy stuff [example.com](with a link)


Looks like it runs the page through a readability algorithm. From there, it probably has a pretty straight 1:1 html to markdown mapping.




Applications are open for YC Winter 2020

Guidelines | FAQ | Support | API | Security | Lists | Bookmarklet | Legal | Apply to YC | Contact

Search: