Hacker News new | comments | show | ask | jobs | submit login

You don't, yet when you go to explain how it could be used you include two other signals (came from Twitter, posts many items more than once) in order to make it work. I had originally said "I would say this method would be too sloppy to use by itself. It would need to be a small part of a larger set of signals."

So alright, we aren't on the same page, but you're saying exactly what I just said.

No single heuristic is perfect. How about you tell us your 1 flawless method?

Just because I used 2 different signals to prove to you that the method doesn't generate a lot of false positives, doesn't mean that looking at amazon links is a bad method.

If I was at Pinterest, I would slap a captcha on all Amazon links with affiliates until I had time for a fancier solution, and it would probably get rid of 99% of the spam.

Breaking captcha is easier. Deathbycaptcha, decaptcher - Two of the services that provide clean APIs for the bots and breaks the captchas for you. And on the other end there are even human sitting and cracking the captchas. It doesn't cost much and for the $2k he earns per day, these captcha breaking service is just drops in the ocean.

Guidelines | FAQ | Support | API | Security | Lists | Bookmarklet | Legal | Apply to YC | Contact