Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Some.

I know that not every letter has been Han-unified, but I don't know the specifics.

Is it possible, that this problem of yours has actually been taken care of?

If not, is it possible that it's just a mistake instead of a big evil conspiracy?



AFAIK it was a historical attempt to save on encoding space back when Unicode had a 16bit fixed width and could only support up to 65K characters.

Now that Unicode has expanded out, I am confused why anyone still defends this practice.


That's just one of the reasons people had (and you can't really expect everyone involved to have totally congruent reasons).

But how about "g"?

Do you really believe the two common variants (one storey/two stories) should have separate code points?

What about German vs. French vs. Danish vs. etc.? All different "g"?

Why? And if not, what is the core difference?


The Cyrillic R (looks like P) and and Greek Roh (also looks like P) and the Latin P are not unified either. Although I think unifying some Greek and Cyrillic letters would have made sense.




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: