![]() The IAB model, as defined in Section 3.2 of ,ĭistinguishes three levels: Coded Character Set ( CCS), CharacterĮncoding Scheme ( CES), and Transfer Encoding Syntax ( TES).ĭefined to adequately cover the distinctions required for the UnicodeĬharacter encoding model. Repertoire to serialized sequences of bytesīridging all four levels in a single operation TES: Transfer Encoding Syntax a reversible transform of encoded data, which may or may not Useful concepts: CM: Character Map a mapping from sequences of members of an abstract character In addition to the four individual levels, there are two other related Sequences of particular code units of some specified width, such as 32-bit integers CES: Character Encoding Scheme a reversible transformation from a set of sequences of code units (from one or more CEFs) The four levels of the Unicode Character Encoding ModelĬan be summarized as: ACR: Abstract Character Repertoire the set of characters to be encoded, for example, some alphabet or symbol set CCS: Coded Character Set a specific mapping from an abstract character repertoire to a set of nonnegative integers, which need not be contiguous CEF: Character Encoding Form a specific mapping from a set of nonnegative integers that are elements of a CCS to a set of (Common acronyms used in this text are highlighted. The Unicode Character Encoding Model extends these models to cover all the aspects of the Unicode Standard and ISO/ IEC 10646 Internet, or the Character Data Representation Architecture definedīy IBM for organizing and cataloging its own proprietary array of character Other character encoding models such as the character architecture promoted by the Internet Standard in the context of other character encodings of all types, as well as The Unicode Character Encoding Model places the Unicode This report describes a model for the structure
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |