visit the hl7 website The Demo site for our new HL7 Version 2+ (plus) Standard

5.2.0.148 0211 - Alternate Character Sets (2.C.2.148)

Concept Domain Information

Display Name: Alternate Character Sets
Description:
Interpretation: Zeichensatz

Value Set Information

OID: 2.16.840.1.113883.21.128
Description: Value Set of codes that identify one of a number of possible standard alternate character sets for a message, either single-byte or double-byte.
Symbolic Name: alternateCharacterSets

Binding Information

Binding: universal

Table Metadata

Table: 211
Steward: InM

Code System Identification Information

CS-OID: 2.16.840.1.113883.18.116
Description: HL7-defined code system of concepts used to specify the character set(s) in use. Includes both single-byte and double-byte characters sets, and used in Version 2.x messaging in the MSH segment.
CS Symbolic Name: alternateCharacterSets
case insensitive: Falsch
contains 'unknown': Falsch
contains 'other': Falsch

Code System Version Information

OID Version:
5

Table Values

Value Display Name Interpretation Comment Usage Note Modification Date Active
ASCII The printable 7-bit ASCII character set. (This is the default if this field is omitted) Jul 13 2015  
8859/1 The printable characters from the ISO 8859/1 Character set Jul 13 2015  
8859/2 The printable characters from the ISO 8859/2 Character set Jul 13 2015  
8859/3 The printable characters from the ISO 8859/3 Character set Jul 13 2015  
8859/4 The printable characters from the ISO 8859/4 Character set Jul 13 2015  
8859/5 The printable characters from the ISO 8859/5 Character set Jul 13 2015  
8859/6 The printable characters from the ISO 8859/6 Character set Jul 13 2015  
8859/7 The printable characters from the ISO 8859/7 Character set Jul 13 2015  
8859/8 The printable characters from the ISO 8859/8 Character set Jul 13 2015  
8859/9 The printable characters from the ISO 8859/9 Character set Jul 13 2015  
8859/15 The printable characters from the ISO 8859/15 (Latin-15) Jul 13 2015  
ISO IR6 ASCII graphic character set consisting of 94 characters. http://www.itscj.ipsj.or.jp/ISO-IR/006.pdf Jul 13 2015  
ISO IR14 Code for Information Exchange (one byte)(JIS X 0201-1976). Note that the code contains a space, i.e., "ISO IR14". Jul 13 2015  
ISO IR87 Code for the Japanese Graphic Character set for information interchange (JIS X 0208-1990), Note that the code contains a space, i.e., “ISO IR87”. The JIS X 0208 needs an escape sequence. In Japan, the escape technique is ISO 2022. From basic ASCII, escape sequence “escape” $ B (in HEX, 1B 24 42) lets the parser know that following bytes should be handled 2-byte wise. Back to ASCII is 1B 28 42. Jul 13 2015  
ISO IR159 Code of the supplementary Japanese Graphic Character set for information interchange (JIS X 0212-1990). Note that the code contains a space, i.e., "ISO IR159". Jul 13 2015  
GB 18030-2000 Code for Chinese Character Set (GB 18030-2000) Does not need an escape sequence. Jul 13 2015  
KS X 1001 Code for Korean Character Set (KS X 1001) Jul 13 2015  
CNS 11643-1992 Code for Taiwanese Character Set (CNS 11643-1992) Does not need an escape sequence. Jul 13 2015  
BIG-5 Code for Taiwanese Character Set (BIG-5) Does not need an escape sequence. BIG-5 does not need an escape sequence. ASCII is a 7 bit character set, which means that the top bit of the byte is “0”. The parser knows that when the top bit of the byte is “0”, the character set is ASCII. When it is “1”, the following bytes should be handled as 2 bytes (or more). No escape technique is needed. However, since some servers do not correctly interpret when they receive a top bit “1”, it is advised, in internet RFC, to not use these kind of non-safe non-escape extension. Jul 13 2015  
UNICODE The world wide character standard from ISO/IEC 10646-1-1993 Deprecated. Retained for backward compatibility only as v 2.5. Replaced by specific Unicode encoding codes. Available from The Unicode Consortium, P.O. Box 700519, San Jose, CA 95170-0519. See http://www.unicode.org/unicode/consortium/consort.html Jul 13 2015  
UNICODE UTF-8 UCS Transformation Format, 8-bit form UTF-8 is a variable-length encoding, each code value is represented by 1,2 or 3 bytes, depending on the code value. 7 bit ASCII is a proper subset of UTF-8. Note that the code contains a space before UTF but not before and after the hyphen. Since UTF-8 represents the full UNICODE character set, the following restriction apply to its use: 1. UTF-8 must be the default encoding of the message, UTF-8 cannot be specified as an additional character set in MSH-18 2. There are no other character sets allowed in a message where UTF-8 is the default encoding in the message. In other words, UNICODE UTF-8 can only be specified as a single value in MSH-18 3. A message encoded in UTF-8 must not use a Byte Order Mark (BOM). Jul 13 2015  
UNICODE UTF-16 UCS Transformation Format, 16-bit form UTF-16 is identical to ISO/IEC 10646 UCS-2. Note that the code contains a space before UTF but not before and after the hyphen. Aug 10 2023  
UNICODE UTF-32 UCS Transformation Format, 32-bit form UTF-32 is defined by Unicode Technical Report #19, and is an officially recognized encoding as of Unicode Version 3.1. UTF-32 is a proper subset of ISO/IEC 10646 UCS-4. Note that the code contains a space before UTF but not before and after the hyphen. Aug 10 2023