Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

In my own experience, the biggest unicode implementation failure was MySQL's choice of using 3-byte BMP Only UTF-8 instead of a full 4-byte Unicode support.[1] In retrospect, their decision to go with a broken subset implementation had caused more trouble, confusions and incompatibilities than their claimed benefits of speed/performance/simplification. Do a simple google search will see almost everyone recommends avoid `utf8` and use `utf8mb4` instead. [2][3][4]

As a result, if your MySQL databases/tables/columns are using `utf8` instead of `utf8mb4` (a MySQL invention) charset, you cannot store / retrieve emoji characters properly.

[1] https://dev.mysql.com/doc/refman/5.5/en/charset-unicode-utf8...

[2] https://mzsanford.wordpress.com/2010/12/28/mysql-and-unicode...

[3] https://mathiasbynens.be/notes/mysql-utf8mb4

[4] https://www.drupal.org/node/1314214



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: