I worked on Google's statistical machine translation system during my internship...

liuliu · on Nov 7, 2008

But who remember that in 1998, how many webpages Google indexed and how many Altavista indexed? Data is important, for spam filter, translation etc. but is far from a "king". We have the fancy that data is much important than algorithm Because now we actually get some really good statistical learning methods.

curiousgeorge · on Nov 7, 2008

Sounds like a great job. GIZA++ is an incredible piece of software and it sounds like they've got a great team.