Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Yes, but it describes the Unicode situation in 2015. Unicode has probably already gained a few more dark corners since then, what with the proliferation of emoji, but that is not reflected in the article.


Oh God yes. Skin tone combining characters, anyone?

http://www.unicode.org/L2/L2014/14173-emoji-skin-tone.pdf

(I was fine with unrealistic, inhuman, Simpsons-style yellow...) I imagine fine gradations of locale-dependent zero-width gender identity modifiers will be added at some point. Unicode is a horror-show that will be producing bugs for decades to come. Every time you see a bug caused by "\r\n" vs. "\n", double-encoded HTML entities, or "smart" quotes, remember that Unicode is orders of magnitude more complex.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: