HL7 - Version 2.8


Table 0211: Alternate Character Sets

Interpretation: Zeichensatz
OID Table: 2.16.840.1.113883.12.211
Codesystem OID: 2.16.840.1.113883.18.116 Version 5
Value Set OID: 2.16.840.1.113883.21.128
Binding: universal
Expansion: all codes from codesystem
Vocabulary Domain: tbd
Table Type: HL7
Case insensitive: Falsch
Steward: InM
Status: Active
Section 2.C.2.148

Value Description German Interpretation Comment Chapter
ASCII The printable 7-bit ASCII character set.
(This is the default if this field is omitted) 2.C.2.148
8859/1 The printable characters from the ISO 8859/1 Character set

2.C.2.148
8859/2 The printable characters from the ISO 8859/2 Character set

2.C.2.148
8859/3 The printable characters from the ISO 8859/3 Character set

2.C.2.148
8859/4 The printable characters from the ISO 8859/4 Character set

2.C.2.148
8859/5 The printable characters from the ISO 8859/5 Character set

2.C.2.148
8859/6 The printable characters from the ISO 8859/6 Character set

2.C.2.148
8859/7 The printable characters from the ISO 8859/7 Character set

2.C.2.148
8859/8 The printable characters from the ISO 8859/8 Character set

2.C.2.148
8859/9 The printable characters from the ISO 8859/9 Character set

2.C.2.148
8859/15 The printable characters from the ISO 8859/15 (Latin-15)

2.C.2.148
ISO IR6 ASCII graphic character set consisting of 94 characters.
http://www.itscj.ipsj.or.jp/ISO-IR/006.pdf 2.C.2.148
ISO IR14 Code for Information Exchange (one byte)(JIS X 0201-1976).
Note that the code contains a space, i.e., "ISO IR14". 2.C.2.148
ISO IR87 Code for the Japanese Graphic Character set for information interchange (JIS X 0208-1990),
Note that the code contains a space, i.e., "ISO IR87".

The JIS X 0208 needs an escape sequence. In Japan, the escape technique is ISO 2022. From basic ASCII, escape sequence "escape" $ B (in HEX, 1B 24 42) lets the parser know that following bytes shoul

2.C.2.148
ISO IR159 Code of the supplementary Japanese Graphic Character set for information interchange (JIS X 0212-1990).
Note that the code contains a space, i.e., "ISO IR159". 2.C.2.148
GB 18030-2000 Code for Chinese Character Set (GB 18030-2000)
Does not need an escape sequence. 2.C.2.148
KS X 1001 Code for Korean Character Set (KS X 1001)

2.C.2.148
CNS 11643-1992 Code for Taiwanese Character Set (CNS 11643-1992)
Does not need an escape sequence. 2.C.2.148
BIG-5 Code for Taiwanese Character Set (BIG-5)
Does not need an escape sequence.

BIG-5 does not need an escape sequence. ASCII is a 7 bit character set, which means that the top bit of the byte is "0". The parser knows that when the top bit of the byte is "0", the character set is ASCII. When it is

2.C.2.148
UNICODE The world wide character standard from ISO/IEC 10646-1-1993
Deprecated. Retained for backward compatibility only as v 2.5. Replaced by specific Unicode encoding codes. 2.C.2.148
UNICODE UTF-8 UCS Transformation Format, 8-bit form
UTF-8 is a variable-length encoding, each code value is represented by 1,2 or 3 bytes, depending on the code value. 7 bit ASCII is a proper subset of UTF-8. Note that the code contains a space before UTF but not before and after the hyphen. Since UTF-8 re 2.C.2.148

Comments made by the German HL7 User Group:

Zeichensatz

Used Within the following Data Elements

00692 Character Set / Zeichensatz (2.14.9.18)