HN,
I would like to share some insight on a stealth mode project that a company founded by college kids have started named Memsparx and the revolutionary implications that are entailed.
Memsparx is creating a machine named WINSTON that can understand human language, reason, and create summaries from abstraction. They are applying this technology to the Media/News industry. This team is doing the impossible - many features from IBM's WATSON machine are in this machine including natural language processing, reasoning, info retrieval and so forth. And they are doing it on top of creating a news summarizing company. The description goes as follows.
The problem: Text summarization is a super hard problem. IBM and only a hand full of companies provide this software (at a high price to financial/medical/etc firms to shorten reports and index them). This current technology takes words, retrieves the definitions of each word, and then can create summaries through statistics and other methods. This software gets the job done but its not good enough. The summaries are not quite proprietary and the machines will typically form awkward sentences out of context. The problem is making a computer (software) understand the CONTEXT that the word is used in (i.e. fly can mean a ton of different things such as | fly through the air | fly through a book | the insect "Fly" etc).
Solution: Memsparx is creating WINSTON. Winston is a super smart software that can not only read (parse) an article but it can truly understand what it is/has read (as well as a comp can!). It can understand what the human language in all of its complexities and nuances are trying to communicate. This is rooted in a number of text mining applications but specifically NATURAL LANGUAGE PROCESSING, INFORMATION RETRIEVAL, REASONING ALGORITHMS, and so forth. Kind of like the popular movie Terminator's Skynet...
Also, the registration system doesn't support the foo+bar@foobar.com format of emails. An extremely minor nitpick, of course.