Unicode Character Identifier (Javascript)info |
home | sources:
html -
css -
js
Automatic
DetailedChars: 0
✕
Unicode Character Identifier
Enter characters in the text field and click Submit to display character identification results. If the first checkbox
is ticked, the text will be processed automatically as you input it. If the second checkbox is ticked, the output will
include much more detailed information about each character.
The basic output looks like this:
a - U+0061 - LATIN SMALL LETTER A
[character] - [Unicode value] - [name]
In the detailed output, there are a number of character attributes that may displayed. Some, but not all, of these
are present on a given character. In some cases an attribute field may be omitted from the output because
it is missing, or because it has the "normal" or default value, as explained below.
General category: One of several classifications for a character.
Canonical combining classes: This attribute indicates how a combining character is ordered with the character
it combines with. Characters with a default value of 0 here do not display this attribute in their output.
Bidrectional category: Indicates the character's text directionality, such as left-to-right for Latin
characters.
Character decomposition mapping: Some characters can be represented by two or more "combining" characters, such
as an accented letter being represented by the unaccented letter plus the accent character. Other characters have a
single "normalized" form associated with them, such as full-width letters associated with their ordinary Latin
counterparts. The codes for any such characters appear in this field.
Decimal/digit/numeric value: If a character has an associated numeric value, it is indicated in one or more of
these attributes, with each accepting a different interpretation of a numeric value.
Mirrored: Whether a character is considered a mirrored form of another character (such as ">"). This attribute
always exists but is only shown in the output if the chracter is mirrored.
Lowercase mapping: The code for the lowercase equivalent of a character appears in this field.
Uppercase mapping: The code for the uppercase equivalent of a character appears in this field.
Titlecase mapping: The code for the titlecase equivalent of a character appears in this field.
This attribute is only shown in the output if it is different from "Uppercase mapping."