HL7 - REFACTORED

Concept Domain
Value Set
Code System

Concept Domain Information

Display Name:	Alternate Character Sets
Description:
Interpretation:	Zeichensatz

Value Set Information

OID:	2.16.840.1.113883.21.128
Description:	Value Set of codes that identify one of a number of possible standard alternate character sets for a message, either single-byte or double-byte.
Symbolic Name:	alternateCharacterSets

Binding Information

Binding:	universal

Table Metadata

Table:	211
Steward:	InM

Code System Identification Information

CS-OID:	2.16.840.1.113883.18.116
Description:	HL7-defined code system of concepts used to specify the character set(s) in use. Includes both single-byte and double-byte characters sets, and used in Version 2.x messaging in the MSH segment.
CS Symbolic Name:	alternateCharacterSets
case insensitive:	Falsch
contains 'unknown':	Falsch
contains 'other':	Falsch

Code System Version Information

OID Version:
5

Table Values

Value	Display Name	Comment	Usage Note	Modification Date
ASCII	The printable 7-bit ASCII character set.	(This is the default if this field is omitted)		Jul 13 2015
8859/1	The printable characters from the ISO 8859/1 Character set			Jul 13 2015
8859/2	The printable characters from the ISO 8859/2 Character set			Jul 13 2015
8859/3	The printable characters from the ISO 8859/3 Character set			Jul 13 2015
8859/4	The printable characters from the ISO 8859/4 Character set			Jul 13 2015
8859/5	The printable characters from the ISO 8859/5 Character set			Jul 13 2015
8859/6	The printable characters from the ISO 8859/6 Character set			Jul 13 2015
8859/7	The printable characters from the ISO 8859/7 Character set			Jul 13 2015
8859/8	The printable characters from the ISO 8859/8 Character set			Jul 13 2015
8859/9	The printable characters from the ISO 8859/9 Character set			Jul 13 2015
8859/15	The printable characters from the ISO 8859/15 (Latin-15)			Jul 13 2015
ISO IR6	ASCII graphic character set consisting of 94 characters.	http://www.itscj.ipsj.or.jp/ISO-IR/006.pdf		Jul 13 2015
ISO IR14	Code for Information Exchange (one byte)(JIS X 0201-1976).	Note that the code contains a space, i.e., "ISO IR14".		Jul 13 2015
ISO IR87	Code for the Japanese Graphic Character set for information interchange (JIS X 0208-1990),	Note that the code contains a space, i.e., �ISO IR87�. The JIS X 0208 needs an escape sequence. In Japan, the escape technique is ISO 2022. From basic ASCII, escape sequence �escape� $ B (in HEX, 1B 24 42) lets the parser know that following bytes should be handled 2-byte wise. Back to ASCII is 1B 28 42.		Jul 13 2015
ISO IR159	Code of the supplementary Japanese Graphic Character set for information interchange (JIS X 0212-1990).	Note that the code contains a space, i.e., "ISO IR159".		Jul 13 2015
GB 18030-2000	Code for Chinese Character Set (GB 18030-2000)	Does not need an escape sequence.		Jul 13 2015
KS X 1001	Code for Korean Character Set (KS X 1001)			Jul 13 2015
CNS 11643-1992	Code for Taiwanese Character Set (CNS 11643-1992)	Does not need an escape sequence.		Jul 13 2015
BIG-5	Code for Taiwanese Character Set (BIG-5)	Does not need an escape sequence. BIG-5 does not need an escape sequence. ASCII is a 7 bit character set, which means that the top bit of the byte is �0�. The parser knows that when the top bit of the byte is �0�, the character set is ASCII. When it is �1�, the following bytes should be handled as 2 bytes (or more). No escape technique is needed. However, since some servers do not correctly interpret when they receive a top bit �1�, it is advised, in internet RFC, to not use these kind of non-safe non-escape extension.		Jul 13 2015
UNICODE	The world wide character standard from ISO/IEC 10646-1-1993	Deprecated. Retained for backward compatibility only as v 2.5. Replaced by specific Unicode encoding codes.	Available from The Unicode Consortium, P.O. Box 700519, San Jose, CA 95170-0519. See http://www.unicode.org/unicode/consortium/consort.html	Jul 13 2015
UNICODE UTF-8	UCS Transformation Format, 8-bit form	UTF-8 is a variable-length encoding, each code value is represented by 1,2 or 3 bytes, depending on the code value. 7 bit ASCII is a proper subset of UTF-8. Note that the code contains a space before UTF but not before and after the hyphen. Since UTF-8 represents the full UNICODE character set, the following restriction apply to its use: 1. UTF-8 must be the default encoding of the message, UTF-8 cannot be specified as an additional character set in MSH-18 2. There are no other character sets allowed in a message where UTF-8 is the default encoding in the message. In other words, UNICODE UTF-8 can only be specified as a single value in MSH-18 3. A message encoded in UTF-8 must not use a Byte Order Mark (BOM).		Jul 13 2015
UNICODE UTF-16	UCS Transformation Format, 16-bit form		UTF-16 is identical to ISO/IEC 10646 UCS-2. Note that the code contains a space before UTF but not before and after the hyphen.	Aug 10 2023
UNICODE UTF-32	UCS Transformation Format, 32-bit form		UTF-32 is defined by Unicode Technical Report #19, and is an officially recognized encoding as of Unicode Version 3.1. UTF-32 is a proper subset of ISO/IEC 10646 UCS-4. Note that the code contains a space before UTF but not before and after the hyphen.	Aug 10 2023

5.2.0.148 0211 - Alternate Character Sets (2.C.2.148)

Concept Domain Information

Value Set Information

Binding Information

Table Metadata

Code System Identification Information

Code System Version Information

Table Values