Hacker News new | comments | show | ask | jobs | submit login

    DOB: 2012-09-09
    Age: 5
    Language: Russian
    Religion: Muslim 10-15%
    Income: $18466 USD
Hmmm



I'm sure this type of data point exists in real life in terms of identity theft schemes...

edit: from browsing more random "agents", there are a lot of 5 y/o's making tens of thousands of $ per year:

Location Country: United States GPS: (36.073868, -103.923638) Demographics Sex: Male DOB: 2012-04-11 Age: 5 Language: English Religion: Protestant Income: $57834 USD Internet: True

Ahh, to be young.


It works by taking independent distributions and sampling them independently.

So any time you lack cross-correlation data (and age vs income isn't a widely available public data source) it will assume the data is uncorrelated, and you'll get this kind of error.

Whether it reduces the utility of the data depends on the use case. I suspect it often will.

I did think that referencing social networks is a bit off, since this isn't a social network model. We have those, this isn't one.


Looks like there's an off-by-one error with ages




Guidelines | FAQ | Support | API | Security | Lists | Bookmarklet | DMCA | Apply to YC | Contact

Search: