National Replacement Character Set
The National Replacement Character Set, or NRCS for short, was a feature supported by later models of Digital's (DEC) computer terminal systems, starting with the VT200 series in 1983. NRCS allowed individual characters from one character set to be replaced by one from another set, allowing the construction of different character sets on the fly. It was used to customize the character set to different local languages, without having to change the terminal's ROM for different counties, or alternately, include many different sets in a larger ROM. Many 3rd party terminals and terminal emulators supporting VT200 codes also supported NRCS.
Description
ASCII is a 7-bit standard, allowing a total of 128 characters in the character set. Some of these are reserved as control characters, leaving 96 printable characters. This set of 96 printable characters includes upper and lower case letters, numbers, and basic math and punctuation.
ASCII does not have enough room to include other common characters such as multi-national currency symbols or the various accented letters common in European languages. This led to a number of country-specific varieties of 7-bit ASCII with certain characters replaced. For instance, the UK standard simply replaced ASCII's hash mark, #
, with the pound symbol, £
. This normally led to different models of a given computer terminal or printer, differing only in the glyphs stored in ROM. These were standardized as part of ISO/IEC 646.[1][2]
On an 8-bit clean serial link, ASCII can be expanded to support a total of 256 characters. In this case, instead of replacing the characters in the original printable characters range from 32 to 127, new characters are added in the 128 to 255 range. This offers enough room for a single character set to include all the variety of characters used in North America and western Europe. This capability led to the introduction of the ISO/IEC 8859-1 standard character set containing 191 characters of what it calls the "Latin alphabet no. 1", but normally referred to as "ISO Latin". Windows-1252 is a slightly expanded superset of ISO Latin.[2]
NRCS was introduced to solve the problem of requiring different terminals for each country by allowing characters in the basic 7-bit ASCII set to be re-defined by copying the glyph from the DEC's version of ISO Latin, the Multinational Character Set (MCS). This meant that the ROM had to store only two character sets, standard ASCII and MCS, and could build any required local ASCII variant on the fly. For instance, instead of having a separate "UK ASCII" version of the terminal with a modified glyph in ROM, the terminal included an NRCS with instructions to replace the hash mark glyph with the pound. When used in the UK, typing Shift 3
produced the pound, the same keys pressed on a US terminal produced hash.[2]
The NRCS could be set through a setup command, or more commonly, by replacing the keyboard with a model that sent back a code when first booted. That way simply plugging in a UK keyboard, which had a pound sign on the 3 key, automatically set the NRSC to that same replacement.[2]
NRC Sets
DEC terminals from the VT220 on had 12 different NRCS sets in addition to standard ASCII:[2]
Character set | Code page | 0x23 | 0x40 | 0x5B | 0x5C | 0x5D | 0x5E | 0x5F | 0x60 | 0x7B | 0x7C | 0x7D | 0x7E |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Standard ASCII | 367 | # | @ | [ | \ | ] | ^ | _ | ` | { | | | } | ~ |
United Kingdom[3] | 1101[4] | £ | @ | [ | \ | ] | ^ | _ | ` | { | | | } | ~ |
Denmark/Norway[3] (Alternate) | 1107[5] | # | @ | Æ | Ø | Å | ^ | _ | ` | æ | ø | å | ~ |
Denmark/Norway | 1105[6] | # | Ä | Æ | Ø | Å | Ü | _ | ä | æ | ø | å | ü |
Dutch[3] | 1102[7] | £ | ¾ | ij[3][7] (ÿ[2]) | ½ | | | ^ | _ | ` | ¨ | ƒ | ¼ | ´ |
Finnish[3] | 1103[8] | # | @ | Ä | Ö | Å | Ü | _ | é | ä | ö | å | ü |
French[3] | 1104[9] | £ | à | ° | ç | § | ^ | _ | ` | é | ù | è | ¨ |
French Canadian[3] | 1020[10] | # | à | â | ç | ê | î | _ | ô | é | ù | è | û |
German[3] | # | § | Ä | Ö | Ü | ^ | _ | ` | ä | ö | ü | ß | |
Italian[3] | £ | § | ° | ç | é | ^ | _ | ù | à | ò | è | ì | |
Portuguese[2] | # | @ | Ã | Ç | Õ | ^ | _ | ` | ã | ç | õ | ~ | |
Spanish[3] | 1023[11] | £ | § | ¡ | Ñ | ¿ | ^ | _ | ` | ° | ñ | ç | ~ |
Swedish[3] | 1106[12] | # | É | Ä | Ö | Å | Ü | _ | é | ä | ö | å | ü |
Swiss[3] | 1021[13] | ù | à | é | ç | ê | î | è | ô | ä | ö | ü | û |
References
- ↑ Hartman Kennelly, Cynthia (1991). Unch, Jacqueline, ed. Digital Guide To Developing International Software (1 ed.). Digital Equipment Corporation. ISBN 1-55558-063-7. EY-F577E-DP.
- 1 2 3 4 5 6 7 DEC (June 1987). "Appendix E". VT320 Programming Summary. Digital Press. (The provided link goes to a digitized version, which contains some subtle OCR errors, therefore isn't a reliable reference for the character set mappings)
- 1 2 3 4 5 6 7 8 9 10 11 12 "VT220 Programmer Reference Manual" (2 ed.). Digital Equipment Corporation (DEC). 1984 [1983].
- ↑ "SBCS code page information - CPGID: 01101 / Name: British NRC Set". IBM Software: Globalization: Coded character sets and related resources: Code pages by CPGID: Code page identifiers. 1. IBM. 1992-10-01. Archived from the original on 2016-12-05. Retrieved 2016-12-05.
- ↑ "SBCS code page information - CPGID: 01107 / Name: Norwegian/Danish NRC Alternate". IBM Software: Globalization: Coded character sets and related resources: Code pages by CPGID: Code page identifiers. 1. IBM. 1992-10-01. Archived from the original on 2016-12-05. Retrieved 2016-12-05.
- ↑ "SBCS code page information - CPGID: 01105 / Name: Norwegian/Danish NRC Set". IBM Software: Globalization: Coded character sets and related resources: Code pages by CPGID: Code page identifiers. 1. IBM. 1992-10-01. Archived from the original on 2016-12-05. Retrieved 2016-12-05.
- 1 2 "SBCS code page information - CPGID: 01102 / Name: Dutch NRC Set". IBM Software: Globalization: Coded character sets and related resources: Code pages by CPGID: Code page identifiers. 1. IBM. 1992-10-01. Archived from the original on 2016-12-05. Retrieved 2016-12-05.
- ↑ "SBCS code page information - CPGID: 01103 / Name: Finnish NRC Set". IBM Software: Globalization: Coded character sets and related resources: Code pages by CPGID: Code page identifiers. 1. IBM. 1992-10-01. Archived from the original on 2016-12-05. Retrieved 2016-12-05.
- ↑ "SBCS code page information - CPGID: 01104 / Name: French NRC Set". IBM Software: Globalization: Coded character sets and related resources: Code pages by CPGID: Code page identifiers. 1. IBM. 1992-10-01. Archived from the original on 2016-12-05. Retrieved 2016-12-05.
- ↑ "SBCS code page information - CPGID: 01020 / Name: Canadian (French) Variant". IBM Software: Globalization: Coded character sets and related resources: Code pages by CPGID: Code page identifiers. 1. IBM. 1992-10-01. Archived from the original on 2016-12-05. Retrieved 2016-12-05.
- ↑ "SBCS code page information - CPGID: 01023 / Name: Spain Variant". IBM Software: Globalization: Coded character sets and related resources: Code pages by CPGID: Code page identifiers. 1. IBM. 1992-10-01. Archived from the original on 2016-12-05. Retrieved 2016-12-05.
- ↑ "SBCS code page information - CPGID: 01106 / Name: Swedish British NRC Set". IBM Software: Globalization: Coded character sets and related resources: Code pages by CPGID: Code page identifiers. 1. IBM. 1992-10-01. Archived from the original on 2016-12-05. Retrieved 2016-12-05.
- ↑ "SBCS code page information - CPGID: 01021 / Name: Switzerland Variant". IBM Software: Globalization: Coded character sets and related resources: Code pages by CPGID: Code page identifiers. 1. IBM. 1992-10-01. Archived from the original on 2016-12-05. Retrieved 2016-12-05.