Search references for UNICODE. Phrases containing UNICODE
See searches and references containing UNICODE!UNICODE
Character encoding standard
uncommon Unicode characters. Without proper rendering support, you may see question marks, boxes, or other symbols. Unicode (also known as The Unicode Standard
Unicode
Symbols for emotional cues in text
This article contains Unicode emoticons or emoji. Without proper rendering support, you may see question marks, boxes, or other symbols instead of the
Emoji
Unicode block containing some special codepoints and two non-characters
Specials is a short Unicode block of characters allocated at the very end of the Basic Multilingual Plane, at U+FFF0–FFFF, containing these code points:
Specials_(Unicode_block)
scripts in Unicode include: Ahom (Unicode block) Balinese (Unicode block) Batak (Unicode block) Bhaiksuki (Unicode block) Buhid (Unicode block) Buginese
List_of_Unicode_characters
Purposely unassigned Unicode code points
In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the standard. Three Private Use
Private_Use_Areas
ASCII-compatible variable-width encoding of Unicode
used for electronic communication. Defined by the Unicode Standard, the name is derived from Unicode Transformation Format – 8-bit. As of 2026, almost
UTF-8
Ninth letter of the Latin alphabet
LETTER I The positions 0x49 and 0x69 were used by ASCII and inherited by Unicode. EBCDIC used 0xC9 and 0x89 for I and i. Brown & Kiddle (1870) The institutes
I
Script used to write the Greek language
character list in Unicode Unicode collation charts – including Greek and Coptic letters, sorted by shape Examples of Greek handwriting Greek Unicode Issues (Nick
Greek_alphabet
Unicode denominator & numerator glyphs
rendering support, you may see question marks, boxes, or other symbols. Unicode has subscripted and superscripted versions of a number of characters including
Unicode subscripts and superscripts
Unicode_subscripts_and_superscripts
Input characters using their Unicode code points
Unicode input is a method to encode specific characters that are not directly available on a physical keyboard. Characters can be entered either by selecting
Unicode_input
Organization maintaining the Unicode Standard
The Unicode Consortium (legally Unicode, Inc.) is a 501(c)(3) non-profit organization incorporated and based in Mountain View, California, U.S. Its primary
Unicode_Consortium
Emoji and others representing or depicting heart shapes
found its way into many character sets and encodings, including those of Unicode. Some characters depict the shape directly, others reference it in a more
Hearts_in_Unicode
Block of Unicode symbols
may see question marks, boxes, or other symbols. Geometric Shapes is a Unicode block of 96 symbols at code point range U+25A0–25FF. The BLACK CIRCLE is
Geometric Shapes (Unicode block)
Geometric_Shapes_(Unicode_block)
Unicode block
symbols in Unicode Unicode input "Unicode character database". The Unicode Standard. Retrieved 2023-07-26. "Enumerated Versions of The Unicode Standard"
Arrows_(Unicode_block)
Punctuation and accent mark (~, ◌̃)
vs Unicode mapping table", JIS X 0213:2004, X 0213, archived from the original on 24 May 2009, retrieved 28 April 2009. Shift-JIS to Unicode, Unicode, archived
Tilde
Named range of Unicode code points
A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode
Unicode_block
Continuous group of 65536 Unicode code points
In the Unicode standard, a plane is a contiguous group of 65,536 (216) code points. There are 17 planes, identified by the numbers 0 to 16, which corresponds
Plane_(Unicode)
Unicode character
The byte order mark (BOM) is a particular usage of the special Unicode character code, U+FEFF ZERO WIDTH NO-BREAK SPACE, whose appearance as a magic number
Byte_order_mark
marks, boxes, or other symbols. The Unicode Standard encodes almost all standard characters used in mathematics. Unicode Technical Report #25 provides comprehensive
Mathematical operators and symbols in Unicode
Mathematical_operators_and_symbols_in_Unicode
Punctuation mark used to join words
entity. In character encoding for use with computers, it is represented in Unicode by any of several characters. These include the dual-use hyphen-minus,
Hyphen
Symbols used in pre-19th-century chemistry
This article contains Unicode alchemical symbols. Without proper rendering support, you may see question marks, boxes, or other symbols instead of alchemical
Alchemical_symbol
Computer text file character representing blank space
three-character-cells-wide SPACE symbol "SPC" (analogous to Unicode's single-cell-wide U+2420). The Braille Patterns Unicode block contains U+2800 ⠀ BRAILLE PATTERN BLANK
Whitespace_character
Using numbers to represent text characters
representing more characters were created, such as ASCII, ISO/IEC 8859, and Unicode encodings such as UTF-8 and UTF-16. The most popular character encoding
Character_encoding
Punctuation mark
"Small Form Variants" (PDF). The Unicode Standard. Unicode Consortium. "Ogham Code Chart" (PDF). The Unicode Standard. Unicode Consortium. Archived (PDF) from
Bracket
Characters for drawing frames and boxes
screen and portraying drop shadows. Unicode includes 128 such characters in the Box Drawing block. In many Unicode fonts, only the subset that is also
Box-drawing_characters
Typographic symbol class
This article contains Unicode emoticons or emoji. Without proper rendering support, you may see question marks, boxes, or other symbols instead of the
Dingbat
Graphemes for various number systems
A numeral (often called number in Unicode) is a character that denotes a number. The decimal number digits 0–9 are used widely in various writing systems
Numerals_in_Unicode
Letter of the Latin alphabet; used in the German language
"ISO/IEC 8859-1:1998 to Unicode". Unicode Consortium. Whistler, Ken (2015-12-02) [1999-07-27]. "ISO/IEC 8859-2:1999 to Unicode". Unicode Consortium. Whistler
ß
Computer font that maps glyphs to code points defined in the Unicode Standard
Unicode font is a computer font that maps glyphs to code points defined in the Unicode Standard. The term has become archaic because the vast majority
Unicode_font
Set of alphabetic symbols that allow for special handling
The regional indicator symbols are a set of 26 alphabetic Unicode characters (A–Z) intended to be used to encode ISO 3166-1 alpha-2 two-letter country
Regional_indicator_symbol
Unicode text character not part of a natural language script
In computing, a Unicode symbol is a Unicode character which is not part of a script used to write a natural language, but is nonetheless available for
Unicode_symbol
Sequence of characters that forms a search pattern
for Unicode. In most respects it makes no difference what the character set is, but some issues do arise when extending regexes to support Unicode. Supported
Regular_expression
Unicode code point property names and their uses
The Unicode Standard assigns various properties to each Unicode character and code point. The properties can be used to handle characters (code points)
Unicode_character_property
Unicode character block
rendering support, you may see question marks, boxes, or other symbols. In Unicode, the Sumero-Akkadian Cuneiform script is covered in three blocks in the
Cuneiform_(Unicode_block)
This article contains uncommon Unicode characters. Without proper rendering support, you may see question marks, boxes, or other symbols instead of the
List_of_numeral_systems
Text characters representing chess pieces
rendering support, you may see question marks, boxes, or other symbols. Unicode has text representations of chess pieces. These allow to produce the symbols
Chess_symbols_in_Unicode
Unicode character block
symbols in Unicode "Unicode 1.0.1 Addendum" (PDF). The Unicode Standard. 1992-11-03. Retrieved 2016-07-09. "Unicode character database". The Unicode Standard
Combining_Diacritical_Marks
Glyph combining two or more letterforms
handle Unicode, and have the correct Unicode fonts installed, some or all of these will display correctly. See also the provided graphic. Unicode maintains
Ligature_(writing)
String collation algorithm
The Unicode collation algorithm (UCA) is an algorithm defined in Unicode Technical Report #10, which is a customizable method to produce binary keys from
Unicode_collation_algorithm
Something that represents an idea, process, or physical entity
"Unicode Technical Report #28: Unicode 3.2". Unicode Consortium. 27 March 2002. Retrieved 23 June 2022. Jenkins, John H. (26 August 2021). "Unicode Standard
Symbol
Originally, these icons consisted of ASCII art, and later, Shift JIS art and Unicode art. In recent times, graphical icons, both static and animated, have joined
List_of_emoticons
script. For a far more comprehensive list of symbols and signs, see List of Unicode characters. For other languages and symbol sets (especially in mathematics
List of typographical symbols and punctuation marks
List_of_typographical_symbols_and_punctuation_marks
Slanting line punctuation mark (/)
5 also slash mark: DIAGONAL : 4 "Unicode 1.1 Composite Name List, including default properties". Unicode.org. Unicode Consortium. 5 July 1995. Archived
Slash_(punctuation)
Graphical symbol or pictogram used to point or indicate direction
Modifier Letters Unicode blocks. Dingbat Box-drawing character Box Drawing (Unicode Block) Block Elements (Unicode Block) Geometric Shapes (Unicode block) HTML
Arrow_(symbol)
Japanese syllabary
added to the Unicode Standard in October, 1991 with the release of version 1.0. The Unicode block for Hiragana is U+3040–U+309F: The Unicode hiragana block
Hiragana
Unicode character block
This article contains Unicode Braille characters. Without proper rendering support, you may see question marks, boxes, or other symbols instead of Braille
Braille_Patterns
definition of a Cyrillic letter for this list is a character encoded in the Unicode standard that a has script property of 'Cyrillic' and the general category
List_of_Cyrillic_letters
Unicode block
This article contains Unicode emoticons or emoji. Without proper rendering support, you may see question marks, boxes, or other symbols instead of the
Miscellaneous_Symbols
assignments, Unicode resolved this issue. Fonts which support a wide range of Unicode scripts and Unicode symbols are sometimes referred to as "pan-Unicode fonts"
List_of_typefaces
Emoji featuring laughing crying face
part of the Emoticons block of Unicode and was added to the Unicode Standard in 2010 in Unicode 6.0, the first Unicode release intended to release emoji
Face_with_Tears_of_Joy_emoji
Aspect of the Unicode standard
Unicode equivalence is the specification by the Unicode character encoding standard that some sequences of code points represent essentially the same
Unicode_equivalence
Complete list of the characters available on most computers
rendering support, you may see question marks, boxes, or other symbols. The Unicode Consortium and the ISO/IEC JTC 1/SC 2/WG 2 jointly collaborate on the list
Universal Character Set characters
Universal_Character_Set_characters
You may need rendering support to display the Unicode emoticons or emoji in this article correctly. Unicode 17.0 specifies a total of 3,953 emoji using
List_of_emojis
Japanese syllabary
to the Unicode standard in October 2010 with the release of version 6.0. The Unicode block for Kana Supplement is U+1B000–U+1B0FF: The Unicode block for
Katakana
Mathematical symbol representing infinity
Components for Unicode. Unicode Consortium. Retrieved 2022-02-19 – via GitHub. "IBM-970". International Components for Unicode. Unicode Consortium. May
Infinity_symbol
Characters from the Latin script encoded in the Unicode Standard
Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended
Latin_script_in_Unicode
definition of a Latin-script letter for this list is a character encoded in the Unicode Standard that has a script property of 'Latin' and the general category
List_of_Latin-script_letters
Punctuation mark with various forms
that involve these keys. Also, techniques using their Unicode code points are available; see Unicode input. Macintosh character sets have always had curved
Quotation_mark
Subset of characters in Unicode
v t e In Unicode, a script is a collection of letters and other written signs used to represent textual information in one or more writing systems. Some
Script_(Unicode)
Alternative width characters in East Asian typography
character, hence the name. Halfwidth and Fullwidth Forms is also the name of a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth
Halfwidth_and_fullwidth_forms
Unicode character block
The Unicode Standard. Retrieved 2025-10-13. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2025-10-13. The Unicode Consortium
Arabic_(Unicode_block)
Alternative forms for the Cyrillic letter O
Cyrillic letter O. They were proposed for inclusion into Unicode in 2007 and incorporated as in Unicode 5.1. Monocular O (Ꙩ ꙩ) is one of the rare glyph variants
Cyrillic_O_variants
Twentieth letter of the Latin alphabet
This article contains uncommon Unicode characters. Without proper rendering support, you may see question marks, boxes, or other symbols instead of the
T
Family of abugida writing systems
13: South and Central Asia-II" (PDF). The Unicode Standard, Version 11.0. Mountain View, California: Unicode, Inc. June 2018. ISBN 978-1-936213-19-1. Aditya
Brahmic_scripts
Unicode character block
is a Unicode block containing runic characters. It was introduced in Unicode 3.0 (1999), with eight additional characters introduced in Unicode 7.0 (2014)
Runic_(Unicode_block)
Neo-grotesque sans-serif typeface
Arial Unicode MS is a TrueType font and the extended version of the font Arial. Compared to Arial, it includes higher line height, omits kerning pairs
Arial_Unicode_MS
Unicode character block
Block Elements is a Unicode block containing square block symbols of various fill and shading. Used along with block elements are box-drawing characters
Block_Elements
Twenty-first letter in the Greek alphabet
descends from phi. Like other Greek letters, lowercase phi (encoded as the Unicode character U+03C6 φ GREEK SMALL LETTER PHI) is used as a mathematical or
Phi
Unicode character block
The Basic Latin Unicode block, sometimes informally called C0 Controls and Basic Latin, is the first block of the Unicode standard, and the only block
Basic_Latin_(Unicode_block)
Writing system
shown to conform to the Unicode definition of a character: this aspect is the responsibility of the typeface designer. The Unicode 5.1 standard, released
Cyrillic_script
Currency symbol for the Indian Rupee (INR)
the end of 2010. The sign was adopted by the Unicode Consortium and was given a codepoint (20B9) in Unicode with the coming of version 6.0 on 12 October
Indian_rupee_sign
Japanese syllabic writing systems
2026. "Kana Supplement" (PDF). Unicode 15.1. Unicode. Retrieved 11 March 2024. "Kana Extended-A" (PDF). Unicode 15.1. Unicode. Retrieved 11 March 2024. 關根江山
Kana
Unicode character block
This article contains uncommon Unicode characters. Without proper rendering support, you may see question marks, boxes, or other symbols instead of the
Musical Symbols (Unicode block)
Musical_Symbols_(Unicode_block)
Unicode character block
symbols. Look up Appendix:Unicode/Egyptian Hieroglyphs in Wiktionary, the free dictionary. Egyptian Hieroglyphs is a Unicode block containing the Gardiner's
Egyptian Hieroglyphs (Unicode block)
Egyptian_Hieroglyphs_(Unicode_block)
Native alphabet of the Korean language
not fully supported in Unicode until the late 2000s. Private Use Areas were often used for that purpose until official Unicode implementation was added
Hangul
Unicode script encoding
As of Unicode version 17.0, Cyrillic script is encoded across several blocks: Cyrillic: U+0400–U+04FF, 256 characters Cyrillic Supplement: U+0500–U+052F
Cyrillic_script_in_Unicode
System of Chinese character radicals
that order characters by radical and stroke count. They are encoded in Unicode alongside other CJK characters, under the block "Kangxi radicals", while
Kangxi_radicals
Many scripts in Unicode, such as Arabic, have special orthographic rules that require certain combinations of letterforms to be combined into special
Arabic_script_in_Unicode
Brahmic script
represented by combining multiple Unicode code points, as can be seen in the Unicode Tamil Syllabary below. In Unicode 5.1, named sequences were added for
Tamil_script
Typographical symbol
The symbol -, known in Unicode as hyphen-minus, is the form of hyphen most commonly used in digital documents. On most keyboards, it is the only character
Hyphen-minus
Markup language and file format
across the Internet. It is a textual data format with strong support via Unicode for different human languages. Although the design of XML focuses on documents
XML
Relationship between Unicode characters and HTML
multilingual text represented with the Unicode universal character set. Key to the relationship between Unicode and HTML is the relationship between the
Unicode_and_HTML
Non-printing format effectors and control codes included in Unicode
Many Unicode characters are used to control the interpretation or display of text, but these characters themselves have no visual or spatial representation
Unicode_control_characters
Characters representing cultural, political and religious symbols
rendering support, you may see question marks, boxes, or other symbols. Unicode contains a number of characters that represent various cultural, political
Religious and political symbols in Unicode
Religious_and_political_symbols_in_Unicode
Encoding for shared Han characters
characters were identified and named CJK Unified Ideographs. As of Unicode 17.0, Unicode defines a total of 101,996 characters. The term ideographs is a
CJK_Unified_Ideographs
Control characters in bidirectional text
bidirectional text. Unicode defines three such characters: the left-to-right mark, the right-to-left mark and the Arabic letter mark. In Unicode, the three marks
Implicit_directional_marks
Unicode character block
symbols "Unicode character database". The Unicode Standard. Retrieved 2023-07-26. "Enumerated Versions of The Unicode Standard". The Unicode Standard
Number_Forms
Unicode character block
Symbols is a Unicode block containing arrows, dots, enclosures, and overlays for modifying symbol characters. Its block name in Unicode 1.0 was simply
Combining Diacritical Marks for Symbols
Combining_Diacritical_Marks_for_Symbols
Unicode block
in Unicode Unicode symbols Mathematical operators and symbols in Unicode Mathematical Alphanumeric Symbols (Unicode block) Currency Symbols (Unicode block)
Letterlike_Symbols
Unicode block (U+1D000..U+1D0FF)
(Unicode block) Ancient Greek Musical Notation (Unicode block) Znamenny Musical Notation (Unicode block) "Unicode character database". The Unicode Standard
Byzantine_Musical_Symbols
Fonts for dingbats
This article contains Unicode emoticons or emoji. Without proper rendering support, you may see question marks, boxes, or other symbols instead of the
Wingdings
Punctuation mark (,)
their names in the Unicode Standard are 'letter with a cedilla'. They were introduced to the Unicode standard before 1992 and, per Unicode Consortium policy
Comma
Unicode character block
This article contains Unicode emoticons or emoji. Without proper rendering support, you may see question marks, boxes, or other symbols instead of the
Emoticons_(Unicode_block)
Unicode for depicting playing cards' fonts and symbols
Unicode is a computing industry standard for the handling of fonts and symbols. Within it is a set of code points representing playing cards, and another
Playing_cards_in_Unicode
Fourth letter of the Latin alphabet
fractions; split-complex numbers. (Unicode U+1D53B 𝔻 MATHEMATICAL DOUBLE-STRUCK CAPITAL D) ₫: Vietnamese đồng (Unicode U+20AB ₫ DONG SIGN) ⅆ: may be used
D
Symbol often denoting 'yes' or 'correct'
Hand gesture indicating approval or disapproval Unicode input – Input characters using their Unicode code points X mark – Symbol with multiple meanings
Check_mark
Diacritical mark
Various precomposed letters with a macron below are defined in Unicode: Note that the Unicode character names of precomposed characters whose decompositions
Macron_below
Unicode block
marks, boxes, or other symbols. Mathematical Alphanumeric Symbols is a Unicode block comprising styled forms of Latin and Greek letters and decimal digits
Mathematical Alphanumeric Symbols
Mathematical_Alphanumeric_Symbols
Unicode character block
This article contains Unicode alchemical symbols. Without proper rendering support, you may see question marks, boxes, or other symbols instead of alchemical
Alchemical Symbols (Unicode block)
Alchemical_Symbols_(Unicode_block)
Effort to map CJK characters in Unicode
boxes, or other symbols. Han unification is an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the Han
Han_unification
UNICODE
UNICODE
UNICODE
UNICODE
Boy/Male
Greek
Avenger.
Boy/Male
Bengali, Hindu, Indian, Kannada, Marathi, Tamil, Telugu
Lord Murugan
Boy/Male
Muslim
Servant of the merciful one
Boy/Male
Muslim/Islamic
Symbol
Boy/Male
Tamil
Flower
Girl/Female
Tamil
Jayitri | ஜாயிதà¯à®°à¯€
Victorious
Girl/Female
Arabic, Muslim
Beautiful; Sweet
Boy/Male
Indian
Blessed, Auspicious, Oath, Right hand, Right wing, Right side
Girl/Female
Spanish American Greek French
Violet.
Boy/Male
Indian
Moon
UNICODE
UNICODE
UNICODE
UNICODE
UNICODE