Disclaimer: all puns intended
our algorithm strips out dashes and catches any
occurrence of the query in the title, for example,
'blow' catches 'blowing', 'blowjobs'
2. The "slow,fast" and "love,hardcore" trends illustrate an interesting trend. Perhaps towards women or mainstream viewers.
I don't think so 
Based on the fact that you had to spend your first two weeks doing data entry, it sounds like his experience wasn't unique.
Just talking about it makes me unhappy :)
I sense a business opportunity there.
Teaching and prostitution clearly dominate.
Time to change positions, then!
Next: provide the porn industry a simple markov chain script to generate probabilistic porn movie titles, and save them all those incredibly tiresome brainstrom sessions they must have to create new titles :)
The first thing we found was a copy of the bible and the second thing we found was someones collection of porn stories.
The start of the output was "He slipped his tongue into the lord..."
Not the same dataset though...
I used python, sometimes with Beautifulsoup, sometimes with lxml, both are very good for crawling. I would say BS is easier, and LXML cleaner.
Is the title a British thing? Like maths vs. math?
Also Obama's numbers have really dropped compared to Bush: