Hacker News new | past | comments | ask | show | jobs | submit login

The problem is that the selves bleed into each-other. For instance, making up a completely different city to claim you live in while still talking to other people from your own city is hard.

If the NSA spots half a dozen people on Crypto-Reddit's /r/boston who claim to be from San Francisco, New York, Seattle, Chicago, rural Tennessee, and Miami respectively, then it knows it has probably found six actual Bostonians.

Individuals are in many ways an expression of our context. To remove all information leakage that would tell what our context is or, more simply, who we are, you basically have to remove the person entirely. Even 4chan-grade anonymity won't cut it, since writing style and narrative threading can be used to extract some sense of pseudonymity from even a /b/ thread.




I thought about this problem, but it was more to do with soliciting anonymous feedback from a group of close nit friends. It's a hard problem to fix, but I think maybe running the text through something that reduces it to all lower case and all txtspk, with some sort of grammer-fucker-upper might remove some of the personality attached to the original text input.


Do you think that's NLP-complete, or something we can actually do?




Guidelines | FAQ | Support | API | Security | Lists | Bookmarklet | Legal | Apply to YC | Contact

Search: