Is there an easy way to make this work for a non-programmer, i.e. without installing a whole python environment? I'm less interested in cloning specific voices than in getting a high-quality text-to-speech program that'll read me arbitrary Mandarin input, which it seems like this should be able to do.