I saw that too, but also I thought I read he was a broadcast trained professional voice, so he might have had some decent equipment at home.
I was wondering if the format might also be a factor. The comment above talking about sample rates is the direction I was thinking. Also I remember coming across formats other than the ubiquitous 16-bit LPCM, like 8 bit formats or mulaw and alaw, I don't know enough about those to say this is the difference I hear, but am aware that different encodings exist.
https://arstechnica.com/gadgets/2024/11/the-voice-of-america...