We've been testing an app that determines news the second it appears online.
Since our new version is up, one major US news event has happened which is the hostage situation at the Discovery Channel.
We broke this story at 1:07PM EST on September 1st, 2010. We need to determine when other sources also published this event.
Any idea? How do I go about this?
This is probably a stupid idea but I've only given it a few minutes of thought, so thats all you get tonight :)
Take automatic snapshots of news pages, like Drudge.com, cnn.com or whatever you want to compare to. Maybe once a minute.
Now hook it up with Mechanical turk and pose the question (in turker language its HIT (human intelligence task), "does X story appear on this page at 10:00pm?". They answer as true or false, then keep moving the capture forward and keep asking the question.