Mailing List archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[linux-dvb] Re: DVB character coding...



"Robert Schlabbach" <robert_s@gmx.net> writes:

> But what is coding 0x14? It is described as using the "Big5 subset"
> of Unicode. What does that mean? Is it encoded as bytes with escape
> sequences, 16-bit words or something else?

Big5 is a common encoding used in China/Taiwan.  IIRC it supports
us-ascii and kanji, with 7-bit bytes are being us-ascii and 8-bit
bytes somehow encode kanji characters, so it works a bit like utf-8.

It has nothing to do with unicode (and is older than unicode), the
"subset" probably referes to the fact that the characters you can
encode with big5 are a subset of what you can do with unicode.  The
8-bit characters of the latin* encodings (i.e. german umlauts, greek
stuff, cyrillic stuff and so on) can't be encoded in big5 for example.

iconv knows big5, so converting should be easy.

  Gerd

-- 
#define printk(args...) fprintf(stderr, ## args)




Home | Main Index | Thread Index