
The Extended ASCII character set extends the basic 7-bit ASCII range to 8-bits, to provide additional character encoding for broader character representation scenarios.
It builds upon ISO/IEC 2022 which defined the initial 8-bit ASCII character set, with the C0 character set range replicating 7-bit ASCII and the C1 range covering extended control characters using character codes 128-159.
The main problem with the basic ASCII character set (ASCII codes 0-127) is that it was defined and published to target English language data only. The extended character set aims to address that for Latin-based alphabets. The revised standard is split into 16 parts. For example, part 1 refers to ‘Western European’ languages and is denoted by a -1 suffix on the standard (ISO/IEC 8859-1).
For each higher-range ‘part’ (codes 128+) the first 32 characters should follow the ISO/IEC 2022 C1 character control set. Codes 128 and 129 were deprecated in a later draft of the C1 set and are available for use by specific encoding schemes. In ISO/IEC 8859-1, code 128 is the EURO currency symbol (€).
Dec | Hex | Abbreviation | Notation | Description |
128 | 80 | PAD | Padding Character (Removed in Later Drafts, Often Used for Euro Currency Symbol) | |
129 | 81 | HOP | High Octed Present (Removed in Later Drafts) | |
130 | 82 | BPH | Break Permitted Here | |
131 | 83 | NBH | No Break Here | |
132 | 84 | IND | Index | |
133 | 85 | NEL | Next Line (Equivalent to CR+LF) | |
134 | 86 | SSA | Start of Selected Area | |
135 | 87 | ESA | End of Selected Area | |
136 | 88 | HTS | Horiztonal Tabulation Set | |
137 | 89 | HTJ | Horizontal Tabulation With Justification | |
138 | 8A | VTS | Vertical Tabulation Set | |
139 | 8B | PLD | Partial Line Forward (Used for Subscript and Superscript) | |
140 | 8C | PLU | Partial Line Backward (Used for Subscript and Superscript) | |
141 | 8D | RI | Reverse Line-Feed | |
142 | 8E | SS2 | Single-Shift 2 | |
143 | 8F | SS3 | Single-Shift 3 | |
144 | 90 | DCS | Device Control String | |
145 | 91 | PU1 | Private Use 1 | |
146 | 92 | PU2 | Private Use 2 | |
147 | 93 | STS | Set Transmit State | |
148 | 94 | CCH | Cancel Character (Destructive Backspace) | |
149 | 95 | MW | Message Waiting | |
150 | 96 | SPA | Start of Protected Area | |
151 | 97 | EPA | End of Protected Area | |
152 | 98 | SOS | Start of String | |
153 | 99 | SGCI | Single Graphic Character Introducer | |
154 | 9A | SCI | Single Character Introducer | |
155 | 9B | CSI | Control Sequence Introducer | |
156 | 9C | ST | String Terminator | |
157 | 9D | OSC | Operating System Command | |
158 | 9E | PM | Privacy Message | |
159 | 9F | APC | Application Program Command |
The remaining ISO/IEC 8859-1 Western Europe encoding scheme is described below.
Dec | Hex | Abbreviation | Notation | Description |
160 | A0 | NBSP | Non-Breaking Space | |
161 | A1 | ¡ | Inverted Exclamation (Used in Spanish, Etc.) | |
162 | A2 | ¢ | Cent Currency Symbol | |
163 | A3 | £ | UK Pound Currency Symbol | |
164 | A4 | ¤ | Curency Symbol (Unspecified) | |
165 | A5 | ¥ | Chinese Yuan or Japanese Yen Currency Symbol | |
166 | A6 | ¦ | Vertical Bar | |
167 | A7 | § | Section Symbol | |
168 | A8 | ¨ | Diaeresis or Umlaut (An Accent Character, Normally Displayed Over A Vowel) | |
169 | A9 | © | Copyright Symbol | |
170 | AA | ª | Ordinal Indicator | |
171 | AB | « | Guillemet (Opening Latin Quotation Mark) | |
172 | AC | ¬ | Negation Symbol | |
173 | AD | SHY | Soft Hyphen Symbol | |
174 | AE | ® | Registered Trademark Symbol | |
175 | AF | ¯ | Macron (An Accent Character, Normally Displayed Over A Vowel) | |
176 | B0 | ° | Degree Symbol | |
177 | B1 | ± | Plus-Minus Symbol | |
178 | B2 | ² | Square Mathematical Operator Symbol | |
179 | B3 | ³ | Cube Mathematical Operator Symbol | |
180 | B4 | ´ | Acute (An Accent Character, Normally Displayed Over A Vowel) | |
181 | B5 | µ | Micro Symbol (Represents One Millionth) | |
182 | B6 | ¶ | Pilcrow (End Of Paragraph Symbol) | |
183 | B7 | · | Interpunct (Interword Separator) | |
184 | B8 | ¸ | Cedilla (A Descent Character, Normally Displayed Under A ‘C’ Character) | |
185 | B9 | ¹ | Superscript and SubScript | |
186 | BA | º | Ordinal Indicator | |
187 | BB | » | Guillemet (Closing Latin Quotation Mark) | |
188 | BC | ¼ | Quarter Mathematical Symbol | |
189 | BD | ½ | Half Mathematical Symbol | |
190 | BE | ¾ | Three-Quarters Mathematical Symbol | |
191 | BF | ¿ | Inverted Question Mark (Used In Spanish, Etc.) | |
192 | C0 | À | A-Grave (Capital A Character with Grave Applied) | |
193 | C1 | Á | A-Acute (Capital A Character with Acute Applied) | |
194 | C2 | Â | A-Circumflex (Capital A Character with Circumflex Applied) | |
195 | C3 | Ã | A-Tilde (Capital A Character with Tilde Applied) | |
196 | C4 | Ä | A-Diaeresis or A-Umlaut (Capital A Character with Diaeresis Applied) | |
197 | C5 | Å | A-Ring (A Separate Character In Its Own Right!) | |
198 | C6 | Æ | Ash (Capital Formed from A and E, A Separate Character In Its Own Right!) | |
199 | C7 | Ç | C-Cedilla (Capital C with Cedilla Applied) | |
200 | C8 | È | E-Grave (Capital E Character with Grave Applied) | |
201 | C9 | É | E-Acute (Capital E Character with Acute Applied) | |
202 | CA | Ê | E-Circumflex (Capital E Character with Circumflex Applied) | |
203 | CB | Ë | E-Diaeresis or E-Umlaut (Capital E Character with Diaeresis Applied) | |
204 | CC | Ì | I-Grave (Capital I Character with Grave Applied) | |
205 | CD | Í | I-Acute (Capital I Character with Acute Applied) | |
206 | CE | Î | I-Circumflex (Capital I Character with Circumflex Applied) | |
207 | CF | Ï | I-Diaeresis or I-Umlaut (Capital I Character with Diaeresis Applied) | |
208 | D0 | Ð | Eth (A ‘D’ Character Used In Old English And Other Languages) | |
209 | D1 | Ñ | N-Tilde (Capital N Character with Tilde Applied) | |
210 | D2 | Ò | O-Grave (Capital O Character with Grave Applied) | |
211 | D3 | Ó | O-Acute (Capital O Character with Acute Applied) | |
212 | D4 | Ô | O-Circumflex (Capital O Character with Circumflex Applied) | |
213 | D5 | Õ | O-Tilde (Capital O Character with Tilde Applied) | |
214 | D6 | Ö | O-Diaeresis or O-Umlaut (Capital O Character with Diaeresis Applied) | |
215 | D7 | × | Multiplication Symbol | |
216 | D8 | Ø | O-Slash (Capital O Character with a Forward-Slash Strike Through) | |
217 | D9 | Ù | U-Grave (Capital U Character with Grave Applied) | |
218 | DA | Ú | U-Acute (Capital U Character with Acute Applied) | |
219 | DB | Û | U-Circumflex (Capital U Character with Circumflex Applied) | |
220 | DC | Ü | U-Diaeresis or U-Umlaut (Capital U Character with Diaeresis Applied) | |
221 | DD | Ý | Y-Acute (Capital Y Character with Acute Applied) | |
222 | DE | Þ | Thorn Character (Capital, Old English and other Languages) | |
223 | DF | ß | Eszett (German B Character) | |
224 | E0 | à | A-Grave (Lower-Case A Character with Grave Applied) | |
225 | E1 | á | A-Acute (Lower-Case A Character with Acute Applied) | |
226 | E2 | â | A-Circumflex (Lower-Case A Character with Circumflex Applied) | |
227 | E3 | ã | A-Tilde (Lower-Case A Character with Tilde Applied) | |
228 | E4 | ä | A-Diaeresis or A-Umlaut (Lower-Case A Character with Diaeresis Applied) | |
229 | E5 | å | A-Ring Lower-Case (A Separate Character In Its Own Right!) | |
230 | E6 | æ | Ash Lower-Case (Lower-Case Formed from A and E, A Separate Character In Its Own Right!) | |
231 | E7 | ç | C-Cedilla (Lower-Case C with Cedilla Applied) | |
232 | E8 | è | E-Grave (Lower-Case E Character with Grave Applied) | |
233 | E9 | é | E-Acute (Lower-Case E Character with Acute Applied) | |
234 | EA | ê | E-Circumflex (Lower-Case E Character with Circumflex Applied) | |
235 | EB | ë | E-Diaeresis or E-Umlaut (Lower-Case E Character with Diaeresis Applied) | |
236 | EC | ì | I-Grave (Lower-Case I Character with Grave Applied) | |
237 | ED | í | I-Acute (Lower-Case I Character with Acute Applied) | |
238 | EE | î | I-Circumflex (Lower-Case I Character with Circumflex Applied) | |
239 | EF | ï | I-Diaeresis or I-Umlaut (Lower-Case I Character with Diaeresis Applied) | |
240 | F0 | ð | Eth (A ‘d’ Character Used In Old English And Other Languages) | |
241 | F1 | ñ | N-Tilde (Lower-Case N Character with Tilde Applied) | |
242 | F2 | ò | O-Grave (Lower-Case O Character with Grave Applied) | |
243 | F3 | ó | O-Acute (Lower-Case O Character with Acute Applied) | |
244 | F4 | ô | O-Circumflex (Lower-Case O Character with Circumflex Applied) | |
245 | F5 | õ | O-Tilde (Lower-Case O Character with Tilde Applied) | |
246 | F6 | ö | O-Diaeresis or O-Umlaut (Lower-Case O Character with Diaeresis Applied) | |
247 | F7 | ÷ | Division Symbol | |
248 | F8 | ø | O-Slash (Lower-Case O Character with a Forward-Slash Strike Through) | |
249 | F9 | ù | U-Grave (Lower-Case U Character with Grave Applied) | |
250 | FA | ú | U-Acute (Lower-Case U Character with Acute Applied) | |
251 | FB | û | U-Circumflex (Lower-Case U Character with Circumflex Applied) | |
252 | FC | ü | U-Diaeresis or U-Umlaut (Lower-Case U Character with Diaeresis Applied) | |
253 | FD | ý | Y-Acute (Lower-Case Y Character with Acute Applied) | |
254 | FE | þ | Thorn Character (Lower-Case, Old English and other Languages) | |
255 | FF | ÿ | Y-Acute (Lower-Case Y Character with Acute Applied) |