Inverting the question, do you think it would be possible to do accurate geographic clustering based solely on HN voting patterns (ignoring time of vote)? I'm doubtful. I think the problem with calling California the "single largest voting block" is that it's so far from homogenous. While California probably has a slightly different ratio of clusters than other states/countries, I suspect the clusters themselves are essentially non-geographic. It would likely be the biggest fiasco in the history of HN, but it would be wonderful to see what patterns could be pulled out of the private voting data if you were to make it available to researchers.

