Microsoft Store
 

ISO-8859-1


 

ISO-8859-1 is the IANA charset name for the character encoding ISO/IEC 8859-1

Related Topics:
IANA - ISO/IEC 8859-1

~ ~ ~ ~ ~ ~ ~ ~ ~ ~

used together with the control codes from ISO/IEC 6429 for the C0 (0x00-0x1F) and C1

~ ~ ~ ~ ~ ~ ~ ~ ~ ~

(0x80-0x9F) parts. Escape sequences (from ISO/IEC 6429 or ISO/IEC 2022) are not

~ ~ ~ ~ ~ ~ ~ ~ ~ ~

to be interpreted. Most applications only interpret the control codes for LF, CR, and HT.

~ ~ ~ ~ ~ ~ ~ ~ ~ ~

A few applications also interpret VT, FF, and NEL (in C1). Very few applications

~ ~ ~ ~ ~ ~ ~ ~ ~ ~

interpret the other C0 and C1 control codes.

~ ~ ~ ~ ~ ~ ~ ~ ~ ~

It is fairly common to mislabel text data with the charset label ISO-8859-1, even though the data is

~ ~ ~ ~ ~ ~ ~ ~ ~ ~

really in Windows-1252. The difference is that in Windows-1252 codes between

~ ~ ~ ~ ~ ~ ~ ~ ~ ~

0x80 and 0x9F are used for letters and punctuation, not for control codes as in

~ ~ ~ ~ ~ ~ ~ ~ ~ ~

ISO-8859-1.

~ ~ ~ ~ ~ ~ ~ ~ ~ ~

The IANA allows all of the following aliases for ISO-8859-1 to be used case-insensitively:

~ ~ ~ ~ ~ ~ ~ ~ ~ ~

  • ISO_8859-1:1987
  • ISO_8859-1
  • ISO-8859-1
  • iso-ir-100
  • csISOLatin1
  • latin1
  • l1
  • IBM819
  • CP819
  • The name Latin-1 is not registered with IANA, but is perhaps meaningful in some computer software. The Unicode standard uses Latin-1 to refer to characters in the U+0000 to U+00FF range.

    ~ ~ ~ ~ ~ ~ ~ ~ ~ ~

    ISO-8859-1 is the standard encoding used by the X Window System on most Unix machines. In IBM's code page system, it is known as code page number 819.

    Related Topics:
    X Window System - Unix - Code page

    ~ ~ ~ ~ ~ ~ ~ ~ ~ ~

    The following table shows the ISO-8859-1 character map. Control characters, the space character, the no-break space character, and the soft hyphen character are represented by two-, three-, or four-letter abbreviations for their names. All other characters are represented literally.

    ~ ~ ~ ~ ~ ~ ~ ~ ~ ~

    There are additional parts to the ISO/IEC 8859 standard that have corresponding IANA-approved character maps. For example, ISO/IEC 8859-10 (Latin alphabet no. 6) is very similar to character map ISO-8859-10. Each of the ISO/IEC 8859-x parts encodes characters in the same way: they cover the ASCII range (hex 20-7E) plus 96 additional characters in the A0-FF range, for a total of 191 characters. The ISO-8859-x maps each add the ISO 646 C0 "control" characters from 00-1F, a control character at 7F, and control characters in the 80-9F range, thus encompassing a total of 256 characters. ISO-8859-1 is unique among these maps in that its coded characters are equivalent to the first 256 code points of Unicode.

    Related Topics:
    ASCII - Unicode

    ~ ~ ~ ~ ~ ~ ~ ~ ~ ~