Hacker News new | past | comments | ask | show | jobs | submit login

Yes, that's true.

I was more thinking of his weird pre- and post-processing like >>> When new mail arrives, it is scanned into tokens, and the most interesting fifteen tokens, where interesting is measured by how far their spam probability is from a neutral .5, are used to calculate the probability that the mail is spam. <<<




Yeah, I agree he has some interesting pragmatic tweaks on it. I suppose he was proposing "[naive Bayesian] filtering" but not necessarily "naive [Bayesian filtering]".




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: