How can you beat a linguistic analysis? If you publish elsewhere and someone guesses to compare your work, are you screwed? Are there any programs that scan writing to determine if the writer's english is Canadian or American or British etc? Or maybe your gender? Could you use that to weed out any regional phrases, or use regional phrases from other places to confuse the text? How do you make sure you don't sound the same in your real life, using similar phrases (For example, if Scott Alexander from Slate Star Codex had another blog that was not anonymous, would it be nessecary to not use expressions like 'Steelman' or refer to effective altruism?
Should you look in the academic literature about language, and try to make it so your style can't be detected by theoretical methods of linguistic analysis that haven't yet been implimented computationally?
How do you deal with private communication? Does it make sense to simply have no possible way of privately emailing you, making all communication public (thus giving you plausible deniability if you click any links phishing for your identity). Should you not even interact with public comments?
What about any information you might giveaway even when you are being a VPN or something (browser info? Some kind of computer associated seriel number? internet cookies?). Is it overkill to simply have one device dedicated to researching/blogging, and restricting yourself from doing normal day to day work on that computer? What about a virtual machine?
Can you buy and pay for a domain anonymously?
Should you make a list of things you are willing to reveal about yourself, and stick to it? For example, A/S/L and then make sure never to reveal other details (former locations, trips with dates, schooling, etc) Should you change details of anecdotes if you share them?
If you trust someone, perhaps a girlfriend, or wife, or really good friend, is it too risky to share with them your identity, even if you agree to never discuss any of it digitally? Assuming they also keep a wall between themselves and that identity (not sharing posts, not telling friends, etc) is that safe? If you do break up, should you create a new blog, and if so, is it worth it to make the writing style clearly different from the old blog? Are there any high profile, clearly psuedonymous people who have remained so for long periods of time?
By making this post, should I now do none of these things?
I'm not any kind of expert on this. I don't have personal experience (obviously I wouldn't admit it if I did, but still); I'm just making educated guesses based on my knowledge technology, people, and the way that previous doxxing campaigns have succeeded. I hope someone else can answer your questions better; I don't want to mislead by sounding more authoritative than I should.
I can't think offhand of someone in the Internet era who became famous (for writing/posting) under a pseudonym, where people had incentive to doxx them. That I can't think of such cases is very weak evidence they don't exist; the majority of cases would occur in non-English media which I don't read anyway.
Scott Alexander is a good example of someone who has succeeded in having large and diverse online following behind a pseudonym. But he's not an example example of what you're looking for. He is very weakly pseudonymous, trivial to doxx. He often refers to his personal life on his blog (more so on his Tumblr), his pseudonym is linked to his real name, and many people know him in the flesh as the author of SSC.
There is good information online on correctly, securely using Tor and VPNs to remain anonymous and on securing (and ideally segregating) the computer you're using for this. This information is often targeted to whistleblowers and to people whose thread model is their governments, but it works equally for anyone.
Interacting with public comments on your own blog should be fine if your connection to it is securely anonymized.
You can register a domain anonymously using bitcoin or prepaid CC without providing your contact info, but then the registrar is the owner of record of the domain and you have to trust them. The biggest danger may be that in the event of a dispute or wishing to transfer to a different registrar, you won't be able to assert control over the domain name without revealing your identity. See also: https://en.wikipedia.org/wiki/Domain_privacy
Making a list of things you're willing to reveal about yourself and sticking to it sounds like a good idea: precommit to clear rules, use checklists.
The linguistic analysis questions depend heavily on both your threat model and on future technological development and I don't dare to try answer them.
Linguistic analysis is sometimes called stylometry, and (although I've never tried it) there's a tool to analyze your anon-posts against a corpus of your non-anon language to see how to unique it is and how to anonymize it: https://github.com/psal/anonymouth
My impression is that the average Joe doesn't have enough of a public corpus for this to make a difference. But if you're an academic who blogs both publicly and privately? You might want to check it out.
There have been a several recent cases of political doxing of pseudonymous users. Blogger "Delicious Tacos", alt-right Youtuber "Millenial Woes", /r/the_donald user "HanAssholeSolo", and several members of the alt-right blog "The Right Stuff" have all been exposed at varying levels by various organizations. , As far as I understand, they were all exposed through OPSEC violations in their content, rather than technical violations. In the US, calling out the SWAT team when the target forgets to VPN before logging on to IRC is reserved for black-hats and child pornographers, at least so far.
If you're behind a VPN, any cookies along with your browser fingerprint will still be bright red: https://panopticlick.eff.org/ If what you're doing is really sensitive, do it in a clean VM + Tor. If you're posting, you also need to consider keystroke fingerprinting although with language profiling: https://en.wikipedia.org/wiki/Keystroke_dynamics
Most threat models don't require considering these things.
How can you beat a linguistic analysis? If you publish elsewhere and someone guesses to compare your work, are you screwed? Are there any programs that scan writing to determine if the writer's english is Canadian or American or British etc? Or maybe your gender? Could you use that to weed out any regional phrases, or use regional phrases from other places to confuse the text? How do you make sure you don't sound the same in your real life, using similar phrases (For example, if Scott Alexander from Slate Star Codex had another blog that was not anonymous, would it be nessecary to not use expressions like 'Steelman' or refer to effective altruism?
Should you look in the academic literature about language, and try to make it so your style can't be detected by theoretical methods of linguistic analysis that haven't yet been implimented computationally?
How do you deal with private communication? Does it make sense to simply have no possible way of privately emailing you, making all communication public (thus giving you plausible deniability if you click any links phishing for your identity). Should you not even interact with public comments?
What about any information you might giveaway even when you are being a VPN or something (browser info? Some kind of computer associated seriel number? internet cookies?). Is it overkill to simply have one device dedicated to researching/blogging, and restricting yourself from doing normal day to day work on that computer? What about a virtual machine?
Can you buy and pay for a domain anonymously?
Should you make a list of things you are willing to reveal about yourself, and stick to it? For example, A/S/L and then make sure never to reveal other details (former locations, trips with dates, schooling, etc) Should you change details of anecdotes if you share them?
If you trust someone, perhaps a girlfriend, or wife, or really good friend, is it too risky to share with them your identity, even if you agree to never discuss any of it digitally? Assuming they also keep a wall between themselves and that identity (not sharing posts, not telling friends, etc) is that safe? If you do break up, should you create a new blog, and if so, is it worth it to make the writing style clearly different from the old blog? Are there any high profile, clearly psuedonymous people who have remained so for long periods of time?
By making this post, should I now do none of these things?