Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

One of my favourite dark corners of Unicode is IDS, or Ideographic Description Sequence. It's allows you to describe characters that are not encoded in Unicode, but can be described by a combination of existing ones.

For example, the Chinese character for the word Biang[1] can be describe with:

⿺辶⿳穴⿲月⿱⿲幺言幺⿲長馬長刂心

[1] https://en.wikipedia.org/wiki/Biangbiang_noodles#Chinese_cha...



> "Made up of 58 strokes in its traditional form (43 in simplified Chinese)..."

I'm glad I don't have to try to type that with 8 fingers and 2 thumbs. Writing Chinese with a keyboard sounds insane: http://www.slate.com/articles/news_and_politics/explainer/20...


The very first input scheme the article talks about is Pinyin, with which you'd input biang by typing... "b" "i" "a" "n" "g" then selecting from a list (most likely a list of length 1). Not insane or even difficult.

(Except that biang is not encoded in Unicode yet so you can't type it anyway.)


Think about it, though. You type phonetically in one script, then select a character in another script that might be pronounced in a similar way. That's like entering Hangul syllables that sound similar to what you want, then choosing the right English character sequences.


> Think about it, though

What's there to think about? How else would you input a script with more than 50k characters?

> You type phonetically in one script, then select a character in another script that might be pronounced in a similar way.

Sure. Japanese works the same way, you input in kana or romaji, then select the suitable kanji (or kanji sequence).

Of course it only works when you have a regular phonology, that would be completely impossible for english since by and large orthography and pronunciation have no relation.


I've thought about it. I input Japanese every day, Chinese frequently, and Korean on occasion. The Latin alphabet is a first-class citizen in these languages; I don't see the issue.

In fact English is becoming the same way: when you input "apple" and choose the [U+1F34E RED APPLE; stripped from input on HN] suggestion you've done exactly the same thing.


Is there any software which actually renders that? Will browsers do it?

The Wikipedia page just uses images for the characters, I can't seem to find any actual example.


As far as I know, there's nothing that will render them. There used to be a web based renderer, but I can't seem to find it now.


Wikimedia is testing an IDS extension (https://www.mediawiki.org/wiki/Extension:Ids) for Wikisource at https://ids-testing.wmflabs.org. It is still preparing for production (https://phabricator.wikimedia.org/T148693).




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: