I would like to classify these html documents into categories.
which tool would you recommend to use to classify - NLTK, Solr or something else?