utf-8 has an elegance that utf-16 lacks: utf-8 decode switch (std::countl_one(un... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

SaddledBounding on Dec 1, 2022 | parent | context | favorite | on: The sad history of Unicode printf-style format spe...

utf-8 has an elegance that utf-16 lacks:

    utf-8 decode
      switch (std::countl_one(unit)) {
        case 0:
          /* one unit */
          break;
        case 2:
          /* two units */
          break;
        case 3:
          /* three units */
          break;
        case 4:
          /* four units */
          break;
        default:
          /* not code point boundary */
          break;
      }

Mikhail_Edoshin on Dec 2, 2022 | [–]

Oh, thanks, this one is good. Must be rather fast on major platforms. But with this optimization UTF-8 still remains harder computationally than UTF-16.

bergesenha on Dec 2, 2022 | [–]

I’m happy every time I see c++20 in the wild

Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact