From a quick test, it seems to treat almost every bit of content on a page equal...

MojoJolo · on Oct 12, 2013

You are right, I'm not taking account of HTML tags. It is because I extract the text beforehand using Pythoon Goose. In that sense, only the text will be feed in the algorithm without any HTML tags.

nubela · on Oct 12, 2013

Try https://github.com/visualrevenue/reporter :) I'm looking at your service now and it is really massively awesome. Can I ask, if you are considering monetizing it, or going the venture-path (boo)? I ask this because I'm curious on the viability of using your service/library on a long-term project.

ismaelc · on Oct 12, 2013

He's monetizing it as an API here https://www.mashape.com/mojojolo/textteaser