Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
chrismorgan
on Aug 1, 2022
|
parent
|
context
|
favorite
| on:
Character Encoding and UTF-8
… and Han unification means that you’ll often get one code point representing several different “characters”, and you
must
convey the language out-of-band (e.g. via an XML or HTML lang attribute) for the text to be correctly understood, sometimes.
https://en.wikipedia.org/wiki/Han_unification
Consider applying for YC's Spring batch! Applications are open till Feb 11.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
https://en.wikipedia.org/wiki/Han_unification