Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The title says ASCII Delimited Text not ASCII Delimited Binary Data.

For the purposes of CSV, I consider text to be anything that satisfies the regex ^\P{Cc}+$ (https://www.compart.com/en/unicode/category/Cc) and I normally strip chars in that category before saving some text (for single-line text). ^[\p{Cc}&&[^\n]]+$ is a regex that can be used to strip all control chars except for the newline.



Thanks for that. Handy. I think I’ll have use for that myself.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: