Hacker News new | comments | show | ask | jobs | submit login

It works by taking independent distributions and sampling them independently.

So any time you lack cross-correlation data (and age vs income isn't a widely available public data source) it will assume the data is uncorrelated, and you'll get this kind of error.

Whether it reduces the utility of the data depends on the use case. I suspect it often will.

I did think that referencing social networks is a bit off, since this isn't a social network model. We have those, this isn't one.

Guidelines | FAQ | Support | API | Security | Lists | Bookmarklet | Legal | Apply to YC | Contact