Hacker News new | past | comments | ask | show | jobs | submit login
Show HN: A simple hack to highlight questionable unicode (unicode-highlight.herokuapp.com)
5 points by audiodude on Aug 18, 2015 | hide | past | favorite | 3 comments



I'm not sure if "questionable" is the right word.

Try "hello" in Polish: cześć


Yeah, I wasn't sure at first, but on second read, it seems that "questionable" in this context means "may break when piped through a system that thinks in latin-1."

This could be useful, then, for finding curly quotes that make their way into templates or HTML files (often by way of someone pasting text from e.g. Microsoft Word) and then subsequently break old, non-unicode template-parsing utilities.


Questionable is probably not the right word. The idea is to find characters that you might be surprised are in your document. It's specifically helpful for zero width or invisible characters, which is replaces with their name.

Future revisions might say "Most of your characters are European but you have this one Kanji character, let's highlight that"

But for now it just highlights all non-ASCII.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: