ISO-8859-1
ISO-8859-1 is the IANA charset name for the character encoding ISO/IEC 8859-1
Related Topics:
IANA - ISO/IEC 8859-1
~ ~ ~ ~ ~ ~ ~ ~ ~ ~
used together with the control codes from ISO/IEC 6429 for the C0 (0x00-0x1F) and C1
~ ~ ~ ~ ~ ~ ~ ~ ~ ~
(0x80-0x9F) parts. Escape sequences (from ISO/IEC 6429 or ISO/IEC 2022) are not
~ ~ ~ ~ ~ ~ ~ ~ ~ ~
to be interpreted. Most applications only interpret the control codes for LF, CR, and HT.
~ ~ ~ ~ ~ ~ ~ ~ ~ ~
A few applications also interpret VT, FF, and NEL (in C1). Very few applications
~ ~ ~ ~ ~ ~ ~ ~ ~ ~
interpret the other C0 and C1 control codes.
~ ~ ~ ~ ~ ~ ~ ~ ~ ~
It is fairly common to mislabel text data with the charset label ISO-8859-1, even though the data is
~ ~ ~ ~ ~ ~ ~ ~ ~ ~
really in Windows-1252. The difference is that in Windows-1252 codes between
~ ~ ~ ~ ~ ~ ~ ~ ~ ~
0x80 and 0x9F are used for letters and punctuation, not for control codes as in
~ ~ ~ ~ ~ ~ ~ ~ ~ ~
ISO-8859-1.
~ ~ ~ ~ ~ ~ ~ ~ ~ ~
The IANA allows all of the following aliases for ISO-8859-1 to be used case-insensitively:
~ ~ ~ ~ ~ ~ ~ ~ ~ ~
- ISO_8859-1:1987
- ISO_8859-1
- ISO-8859-1
- iso-ir-100
- csISOLatin1
- latin1
- l1
- IBM819
- CP819
The name Latin-1 is not registered with IANA, but is perhaps meaningful in some computer software. The Unicode standard uses Latin-1 to refer to characters in the U+0000 to U+00FF range.
~ ~ ~ ~ ~ ~ ~ ~ ~ ~
ISO-8859-1 is the standard encoding used by the X Window System on most Unix machines. In IBM's code page system, it is known as code page number 819.
Related Topics:
X Window System - Unix - Code page
~ ~ ~ ~ ~ ~ ~ ~ ~ ~
The following table shows the ISO-8859-1 character map. Control characters, the space character, the no-break space character, and the soft hyphen character are represented by two-, three-, or four-letter abbreviations for their names. All other characters are represented literally.
~ ~ ~ ~ ~ ~ ~ ~ ~ ~
There are additional parts to the ISO/IEC 8859 standard that have corresponding IANA-approved character maps. For example, ISO/IEC 8859-10 (Latin alphabet no. 6) is very similar to character map ISO-8859-10. Each of the ISO/IEC 8859-x parts encodes characters in the same way: they cover the ASCII range (hex 20-7E) plus 96 additional characters in the A0-FF range, for a total of 191 characters. The ISO-8859-x maps each add the ISO 646 C0 "control" characters from 00-1F, a control character at 7F, and control characters in the 80-9F range, thus encompassing a total of 256 characters. ISO-8859-1 is unique among these maps in that its coded characters are equivalent to the first 256 code points of Unicode.
Related Topics:
ASCII - Unicode
~ ~ ~ ~ ~ ~ ~ ~ ~ ~
~ Table of Content ~
| ► | Introduction |
~ What's Hot ~
~ Community ~
| ► | History Forum Come and discuss about History, Civilizations, Historical Events and Figures |
| ► | History Web-Ring A community of sites, blogs and forums dedicated to History. Do not hesitate to submit your site. |
and are licensed under the GNU Free Documentation License.
Lexicon - Privacy Policy - Spiritus-Temporis.com ©2005.
