Unicode symbols
In computing, a Unicode symbol is a Unicode character which is not part of a script used to write a natural language, but is nonetheless available for use as part of a text.
Many of the symbols are drawn from existing character sets or ISO or other national and international standards. The Unicode Standard states that "The universe of symbols is rich and open-ended."[1] This makes the issue of what symbols to encode and how symbols should be encoded more complicated than the issues surrounding writing systems. Unicode focuses on symbols that make sense in a one-dimensional plain-text context. For example, the typical two-dimensional arrangement of electronic diagram symbols justifies their exclusion.[2] (Box-drawing characters are a partial exception, for legacy purposes, and a number of electronic diagram symbols are indeed encoded in Unicode's Miscellaneous Technical block.) For adequate treatment in plain text, symbols must also be displayable in a monochromatic setting. Even with these limitations – monochromatic, one-dimensional and standards-based – the domain of potential Unicode symbols is extensive. (However, emojis – ideograms, graphic symbols – that where admitted into Unicode, allow colors while the colors are not standardized.)
Symbol Block Table
Code | Glyph | Description | # |
---|---|---|---|
U+2013 | – | En dash | 0903 |
U+2014 | — | Em dash | 0904 |
U+2015 | ― | Horizontal bar | 0905 |
U+2017 | ‗ | Double low line | 0906 |
U+2018 | ‘ | Left single quotation mark | 0907 |
U+2019 | ’ | Right single quotation mark | 0908 |
U+201A | ‚ | Single low-9 quotation mark | 0909 |
U+201B | ‛ | Single high-reversed-9 quotation mark | 0910 |
U+201C | “ | Left double quotation mark | 0911 |
U+201D | ” | Right double quotation mark | 0912 |
U+201E | „ | Double low-9 quotation mark | 0913 |
U+2020 | † | Dagger | 0914 |
U+2021 | ‡ | Double dagger | 0915 |
U+2022 | • | Bullet | 0916 |
U+2026 | … | Horizontal ellipsis | 0917 |
U+2030 | ‰ | Per mille sign | 0918 |
U+2032 | ′ | Prime | 0919 |
U+2033 | ″ | Double prime | 0920 |
U+2039 | ‹ | Single left-pointing angle quotation mark | 0921 |
U+203A | › | Single right-pointing angle quotation mark | 0922 |
U+203C | ‼ | Double exclamation mark | 0923 |
U+203E | ‾ | Overline | 0924 |
U+2044 | ⁄ | Fraction slash | 0925 |
U+204A | ⁊ | Tironian et sign | 0926 |
Symbol block list
The following Unicode ranges encode Symbols
- Alphanumeric variants (based on Latin characters in Unicode)
- General Punctuation (U+2000–U+206F)
- Superscripts and Subscripts (U+2070–U+209F)
- Currency Symbols (U+20A0–U+20CF)
- Letterlike Symbols (U+2100–U+214F)
- Number Forms (U+2150–U+218F)
- Phonetic Symbols (including IPA)
- Enclosed variants
- Enclosed Alphanumerics (U+2460–U+24FF)
- Enclosed Alphanumeric Supplement (1F100–1F1FF)
- Enclosed Ideographic Supplement (1F200–1F2FF)
- Arrows
- Arrows (U+2190–U+21FF)
- Supplemental Arrows-A (U+27F0–U+27FF)
- Supplemental Arrows-B (U+2900–U+297F)
- Supplemental Arrows-C (U+1F800-U+1F8FF)
- Miscellaneous Symbols and Arrows (U+2B00–U+2BFF)
- Dingbat arrows (U+2794–U+27BF)
- Mathematical
- Mathematical Operators (U+2200–U+22FF)
- Miscellaneous Mathematical Symbols-A (U+27C0–U+27EF)
- Miscellaneous Mathematical Symbols-B (U+2980–U+29FF)
- Supplemental Mathematical Operators (U+2A00–U+2AFF)
- Mathematical Alphanumeric Symbols (U+1D400–U+1D7FF)
- Technical
- Miscellaneous Technical (U+2300–U+23FF)
- Control Pictures (U+2400–U+243F)
- Character Recognition (U+2440–U+245F)
- Musical
- Byzantine Musical Symbols (U+1D000–U+1D0FF)
- Musical Symbols (U+1D100–U+1D1FF)
- Ancient Greek Musical Notation (U+1D200–U+1D24F)
- Games
- Mahjong Tiles (U+1F000–U+1F02F)
- Domino Tiles (U+1F030–U+1F09F)
- Playing Cards (U+1F0A0–U+1F0FF)
- Emoji and emoticons
- Miscellaneous Symbols (U+2600–U+26FF)
- Emoticons (U+1F600–U+1F64F)
- Miscellaneous Symbols and Pictographs (U+1F300–U+1F5FF)
- Transport and Map Symbols (U+1F680..U+1F6FF)
- Dingbats (U+2700–U+27BF)
- Additional emoji can be found in the following Unicode blocks: Arrows, CJK Symbols and Punctuation, Enclosed Alphanumeric Supplement, Enclosed CJK Letters and Months, Enclosed Ideographic Supplement, General Punctuation, Geometric Shapes, Latin-1 Supplement, Letterlike Symbols, Mahjong Tiles, Miscellaneous Symbols and Arrows, Miscellaneous Technical, Playing Cards, and Supplemental Arrows-B.
- Miscellaneous
- Combining Diacritical Marks for Symbols (U+20D0–U+20FF)
- Box Drawing (U+2500–U+257F)
- Block Elements (U+2580–U+259F)
- Geometric Shapes (U+25A0–U+25FF)
- Geometric Shapes Extended (U+1F780-U+1F7FF)
- Ornamental Dingbats (U+1F650-U+1F67F)
- Miscellaneous Symbols and Arrows (U+2B00–U+2BFF)
- Arabic Mathematical Alphabetic Symbols (1EE00–1EEFF)
- Alchemical Symbols (1F700–1F77F)
See also
Notes
- ↑ The Unicode Consortium. The Unicode Standard, Version 6.2.0, ISBN 978-1-936213-07-8, 2012, , Chapter 15, Symbols
- ↑ Unicode Standard 5.0; Chapter 12 (p302)
References
External links
- Unicode character code charts
- FileFormat.Info – The Digital Rosetta Stone
- Draft Unicode Technical Report #25: Unicode Support for Mathematics
- decodeunicode.org – Unicode-wiki with all 98,884 graphical Unicode 5.0 characters as GIF images in three sizes (including full text search) – English/German