java.lang.Object | ||
↳ | java.lang.Character.Subset | |
↳ | java.lang.Character.UnicodeBlock |
A family of character subsets representing the character blocks in the Unicode specification. Character blocks generally define characters used for a specific script or purpose. A character is contained by at most one Unicode block.
Fields | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
AEGEAN_NUMBERS | Constant for the "Aegean Numbers" Unicode character block. | ||||||||||
ALPHABETIC_PRESENTATION_FORMS | Constant for the "Alphabetic Presentation Forms" Unicode character block. | ||||||||||
ARABIC | Constant for the "Arabic" Unicode character block. | ||||||||||
ARABIC_PRESENTATION_FORMS_A | Constant for the "Arabic Presentation Forms-A" Unicode character block. | ||||||||||
ARABIC_PRESENTATION_FORMS_B | Constant for the "Arabic Presentation Forms-B" Unicode character block. | ||||||||||
ARMENIAN | Constant for the "Armenian" Unicode character block. | ||||||||||
ARROWS | Constant for the "Arrows" Unicode character block. | ||||||||||
BASIC_LATIN | Constant for the "Basic Latin" Unicode character block. | ||||||||||
BENGALI | Constant for the "Bengali" Unicode character block. | ||||||||||
BLOCK_ELEMENTS | Constant for the "Block Elements" Unicode character block. | ||||||||||
BOPOMOFO | Constant for the "Bopomofo" Unicode character block. | ||||||||||
BOPOMOFO_EXTENDED | Constant for the "Bopomofo Extended" Unicode character block. | ||||||||||
BOX_DRAWING | Constant for the "Box Drawing" Unicode character block. | ||||||||||
BRAILLE_PATTERNS | Constant for the "Braille Patterns" Unicode character block. | ||||||||||
BUHID | Constant for the "Buhid" Unicode character block. | ||||||||||
BYZANTINE_MUSICAL_SYMBOLS | Constant for the "Byzantine Musical Symbols" Unicode character block. | ||||||||||
CHEROKEE | Constant for the "Cherokee" Unicode character block. | ||||||||||
CJK_COMPATIBILITY | Constant for the "CJK Compatibility" Unicode character block. | ||||||||||
CJK_COMPATIBILITY_FORMS | Constant for the "CJK Compatibility Forms" Unicode character block. | ||||||||||
CJK_COMPATIBILITY_IDEOGRAPHS | Constant for the "CJK Compatibility Ideographs" Unicode character block. | ||||||||||
CJK_COMPATIBILITY_IDEOGRAPHS_SUPPLEMENT | Constant for the "CJK Compatibility Ideographs Supplement" Unicode character block. | ||||||||||
CJK_RADICALS_SUPPLEMENT | Constant for the "CJK Radicals Supplement" Unicode character block. | ||||||||||
CJK_SYMBOLS_AND_PUNCTUATION | Constant for the "CJK Symbols and Punctuation" Unicode character block. | ||||||||||
CJK_UNIFIED_IDEOGRAPHS | Constant for the "CJK Unified Ideographs" Unicode character block. | ||||||||||
CJK_UNIFIED_IDEOGRAPHS_EXTENSION_A | Constant for the "CJK Unified Ideographs Extension A" Unicode character block. | ||||||||||
CJK_UNIFIED_IDEOGRAPHS_EXTENSION_B | Constant for the "CJK Unified Ideographs Extension B" Unicode character block. | ||||||||||
COMBINING_DIACRITICAL_MARKS | Constant for the "Combining Diacritical Marks" Unicode character block. | ||||||||||
COMBINING_HALF_MARKS | Constant for the "Combining Half Marks" Unicode character block. | ||||||||||
COMBINING_MARKS_FOR_SYMBOLS | Constant for the "Combining Diacritical Marks for Symbols" Unicode character block. | ||||||||||
CONTROL_PICTURES | Constant for the "Control Pictures" Unicode character block. | ||||||||||
CURRENCY_SYMBOLS | Constant for the "Currency Symbols" Unicode character block. | ||||||||||
CYPRIOT_SYLLABARY | Constant for the "Cypriot Syllabary" Unicode character block. | ||||||||||
CYRILLIC | Constant for the "Cyrillic" Unicode character block. | ||||||||||
CYRILLIC_SUPPLEMENTARY | Constant for the "Cyrillic Supplementary" Unicode character block. | ||||||||||
DESERET | Constant for the "Deseret" Unicode character block. | ||||||||||
DEVANAGARI | Constant for the "Devanagari" Unicode character block. | ||||||||||
DINGBATS | Constant for the "Dingbats" Unicode character block. | ||||||||||
ENCLOSED_ALPHANUMERICS | Constant for the "Enclosed Alphanumerics" Unicode character block. | ||||||||||
ENCLOSED_CJK_LETTERS_AND_MONTHS | Constant for the "Enclosed CJK Letters and Months" Unicode character block. | ||||||||||
ETHIOPIC | Constant for the "Ethiopic" Unicode character block. | ||||||||||
GENERAL_PUNCTUATION | Constant for the "General Punctuation" Unicode character block. | ||||||||||
GEOMETRIC_SHAPES | Constant for the "Geometric Shapes" Unicode character block. | ||||||||||
GEORGIAN | Constant for the "Georgian" Unicode character block. | ||||||||||
GOTHIC | Constant for the "Gothic" Unicode character block. | ||||||||||
GREEK | Constant for the "Greek and Coptic" Unicode character block. | ||||||||||
GREEK_EXTENDED | Constant for the "Greek Extended" Unicode character block. | ||||||||||
GUJARATI | Constant for the "Gujarati" Unicode character block. | ||||||||||
GURMUKHI | Constant for the "Gurmukhi" Unicode character block. | ||||||||||
HALFWIDTH_AND_FULLWIDTH_FORMS | Constant for the "Halfwidth and Fullwidth Forms" Unicode character block. | ||||||||||
HANGUL_COMPATIBILITY_JAMO | Constant for the "Hangul Compatibility Jamo" Unicode character block. | ||||||||||
HANGUL_JAMO | Constant for the "Hangul Jamo" Unicode character block. | ||||||||||
HANGUL_SYLLABLES | Constant for the "Hangul Syllables" Unicode character block. | ||||||||||
HANUNOO | Constant for the "Hanunoo" Unicode character block. | ||||||||||
HEBREW | Constant for the "Hebrew" Unicode character block. | ||||||||||
HIGH_PRIVATE_USE_SURROGATES | Constant for the "High Private Use Surrogates" Unicode character block. | ||||||||||
HIGH_SURROGATES | Constant for the "High Surrogates" Unicode character block. | ||||||||||
HIRAGANA | Constant for the "Hiragana" Unicode character block. | ||||||||||
IDEOGRAPHIC_DESCRIPTION_CHARACTERS | Constant for the "Ideographic Description Characters" Unicode character block. | ||||||||||
IPA_EXTENSIONS | Constant for the "IPA Extensions" Unicode character block. | ||||||||||
KANBUN | Constant for the "Kanbun" Unicode character block. | ||||||||||
KANGXI_RADICALS | Constant for the "Kangxi Radicals" Unicode character block. | ||||||||||
KANNADA | Constant for the "Kannada" Unicode character block. | ||||||||||
KATAKANA | Constant for the "Katakana" Unicode character block. | ||||||||||
KATAKANA_PHONETIC_EXTENSIONS | Constant for the "Katakana Phonetic Extensions" Unicode character block. | ||||||||||
KHMER | Constant for the "Khmer" Unicode character block. | ||||||||||
KHMER_SYMBOLS | Constant for the "Khmer Symbols" Unicode character block. | ||||||||||
LAO | Constant for the "Lao" Unicode character block. | ||||||||||
LATIN_1_SUPPLEMENT | Constant for the "Latin-1 Supplement" Unicode character block. | ||||||||||
LATIN_EXTENDED_A | Constant for the "Latin Extended-A" Unicode character block. | ||||||||||
LATIN_EXTENDED_ADDITIONAL | Constant for the "Latin Extended Additional" Unicode character block. | ||||||||||
LATIN_EXTENDED_B | Constant for the "Latin Extended-B" Unicode character block. | ||||||||||
LETTERLIKE_SYMBOLS | Constant for the "Letterlike Symbols" Unicode character block. | ||||||||||
LIMBU | Constant for the "Limbu" Unicode character block. | ||||||||||
LINEAR_B_IDEOGRAMS | Constant for the "Linear B Ideograms" Unicode character block. | ||||||||||
LINEAR_B_SYLLABARY | Constant for the "Linear B Syllabary" Unicode character block. | ||||||||||
LOW_SURROGATES | Constant for the "Low Surrogates" Unicode character block. | ||||||||||
MALAYALAM | Constant for the "Malayalam" Unicode character block. | ||||||||||
MATHEMATICAL_ALPHANUMERIC_SYMBOLS | Constant for the "Mathematical Alphanumeric Symbols" Unicode character block. | ||||||||||
MATHEMATICAL_OPERATORS | Constant for the "Mathematical Operators" Unicode character block. | ||||||||||
MISCELLANEOUS_MATHEMATICAL_SYMBOLS_A | Constant for the "Miscellaneous Mathematical Symbols-A" Unicode character block. | ||||||||||
MISCELLANEOUS_MATHEMATICAL_SYMBOLS_B | Constant for the "Miscellaneous Mathematical Symbols-B" Unicode character block. | ||||||||||
MISCELLANEOUS_SYMBOLS | Constant for the "Miscellaneous Symbols" Unicode character block. | ||||||||||
MISCELLANEOUS_SYMBOLS_AND_ARROWS | Constant for the "Miscellaneous Symbols and Arrows" Unicode character block. | ||||||||||
MISCELLANEOUS_TECHNICAL | Constant for the "Miscellaneous Technical" Unicode character block. | ||||||||||
MONGOLIAN | Constant for the "Mongolian" Unicode character block. | ||||||||||
MUSICAL_SYMBOLS | Constant for the "Musical Symbols" Unicode character block. | ||||||||||
MYANMAR | Constant for the "Myanmar" Unicode character block. | ||||||||||
NUMBER_FORMS | Constant for the "Number Forms" Unicode character block. | ||||||||||
OGHAM | Constant for the "Ogham" Unicode character block. | ||||||||||
OLD_ITALIC | Constant for the "Old Italic" Unicode character block. | ||||||||||
OPTICAL_CHARACTER_RECOGNITION | Constant for the "Optical Character Recognition" Unicode character block. | ||||||||||
ORIYA | Constant for the "Oriya" Unicode character block. | ||||||||||
OSMANYA | Constant for the "Osmanya" Unicode character block. | ||||||||||
PHONETIC_EXTENSIONS | Constant for the "Phonetic Extensions" Unicode character block. | ||||||||||
PRIVATE_USE_AREA | Constant for the "Private Use Area" Unicode character block. | ||||||||||
RUNIC | Constant for the "Runic" Unicode character block. | ||||||||||
SHAVIAN | Constant for the "Shavian" Unicode character block. | ||||||||||
SINHALA | Constant for the "Sinhala" Unicode character block. | ||||||||||
SMALL_FORM_VARIANTS | Constant for the "Small Form Variants" Unicode character block. | ||||||||||
SPACING_MODIFIER_LETTERS | Constant for the "Spacing Modifier Letters" Unicode character block. | ||||||||||
SPECIALS | Constant for the "Specials" Unicode character block. | ||||||||||
SUPERSCRIPTS_AND_SUBSCRIPTS | Constant for the "Superscripts and Subscripts" Unicode character block. | ||||||||||
SUPPLEMENTAL_ARROWS_A | Constant for the "Supplemental Arrows-A" Unicode character block. | ||||||||||
SUPPLEMENTAL_ARROWS_B | Constant for the "Supplemental Arrows-B" Unicode character block. | ||||||||||
SUPPLEMENTAL_MATHEMATICAL_OPERATORS | Constant for the "Supplemental Mathematical Operators" Unicode character block. | ||||||||||
SUPPLEMENTARY_PRIVATE_USE_AREA_A | Constant for the "Supplementary Private Use Area-A" Unicode character block. | ||||||||||
SUPPLEMENTARY_PRIVATE_USE_AREA_B | Constant for the "Supplementary Private Use Area-B" Unicode character block. | ||||||||||
SURROGATES_AREA |
This field is deprecated.
As of J2SE 5, use HIGH_SURROGATES ,
HIGH_PRIVATE_USE_SURROGATES , and
LOW_SURROGATES . These new constants match
the block definitions of the Unicode Standard.
The of(char) and of(int) methods
return the new constants, not SURROGATES_AREA.
|
||||||||||
SYRIAC | Constant for the "Syriac" Unicode character block. | ||||||||||
TAGALOG | Constant for the "Tagalog" Unicode character block. | ||||||||||
TAGBANWA | Constant for the "Tagbanwa" Unicode character block. | ||||||||||
TAGS | Constant for the "Tags" Unicode character block. | ||||||||||
TAI_LE | Constant for the "Tai Le" Unicode character block. | ||||||||||
TAI_XUAN_JING_SYMBOLS | Constant for the "Tai Xuan Jing Symbols" Unicode character block. | ||||||||||
TAMIL | Constant for the "Tamil" Unicode character block. | ||||||||||
TELUGU | Constant for the "Telugu" Unicode character block. | ||||||||||
THAANA | Constant for the "Thaana" Unicode character block. | ||||||||||
THAI | Constant for the "Thai" Unicode character block. | ||||||||||
TIBETAN | Constant for the "Tibetan" Unicode character block. | ||||||||||
UGARITIC | Constant for the "Ugaritic" Unicode character block. | ||||||||||
UNIFIED_CANADIAN_ABORIGINAL_SYLLABICS | Constant for the "Unified Canadian Aboriginal Syllabics" Unicode character block. | ||||||||||
VARIATION_SELECTORS | Constant for the "Variation Selectors" Unicode character block. | ||||||||||
VARIATION_SELECTORS_SUPPLEMENT | Constant for the "Variation Selectors Supplement" Unicode character block. | ||||||||||
YIJING_HEXAGRAM_SYMBOLS | Constant for the "Yijing Hexagram Symbols" Unicode character block. | ||||||||||
YI_RADICALS | Constant for the "Yi Radicals" Unicode character block. | ||||||||||
YI_SYLLABLES | Constant for the "Yi Syllables" Unicode character block. |
Public Methods | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Returns the UnicodeBlock with the given name.
| |||||||||||
Returns the object representing the Unicode block containing the
given character, or
null if the character is not a
member of a defined block. | |||||||||||
Returns the object representing the Unicode block
containing the given character (Unicode code point), or
null if the character is not a member of a
defined block. |
[Expand]
Inherited Methods | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
From class
java.lang.Character.Subset
| |||||||||||
From class
java.lang.Object
|
Constant for the "Aegean Numbers" Unicode character block.
Constant for the "Alphabetic Presentation Forms" Unicode character block.
Constant for the "Arabic" Unicode character block.
Constant for the "Arabic Presentation Forms-A" Unicode character block.
Constant for the "Arabic Presentation Forms-B" Unicode character block.
Constant for the "Armenian" Unicode character block.
Constant for the "Arrows" Unicode character block.
Constant for the "Basic Latin" Unicode character block.
Constant for the "Bengali" Unicode character block.
Constant for the "Block Elements" Unicode character block.
Constant for the "Bopomofo" Unicode character block.
Constant for the "Bopomofo Extended" Unicode character block.
Constant for the "Box Drawing" Unicode character block.
Constant for the "Braille Patterns" Unicode character block.
Constant for the "Byzantine Musical Symbols" Unicode character block.
Constant for the "Cherokee" Unicode character block.
Constant for the "CJK Compatibility" Unicode character block.
Constant for the "CJK Compatibility Forms" Unicode character block.
Constant for the "CJK Compatibility Ideographs" Unicode character block.
Constant for the "CJK Compatibility Ideographs Supplement" Unicode character block.
Constant for the "CJK Radicals Supplement" Unicode character block.
Constant for the "CJK Symbols and Punctuation" Unicode character block.
Constant for the "CJK Unified Ideographs" Unicode character block.
Constant for the "CJK Unified Ideographs Extension A" Unicode character block.
Constant for the "CJK Unified Ideographs Extension B" Unicode character block.
Constant for the "Combining Diacritical Marks" Unicode character block.
Constant for the "Combining Half Marks" Unicode character block.
Constant for the "Combining Diacritical Marks for Symbols" Unicode character block.
This block was previously known as "Combining Marks for Symbols".
Constant for the "Control Pictures" Unicode character block.
Constant for the "Currency Symbols" Unicode character block.
Constant for the "Cypriot Syllabary" Unicode character block.
Constant for the "Cyrillic" Unicode character block.
Constant for the "Cyrillic Supplementary" Unicode character block.
Constant for the "Deseret" Unicode character block.
Constant for the "Devanagari" Unicode character block.
Constant for the "Dingbats" Unicode character block.
Constant for the "Enclosed Alphanumerics" Unicode character block.
Constant for the "Enclosed CJK Letters and Months" Unicode character block.
Constant for the "Ethiopic" Unicode character block.
Constant for the "General Punctuation" Unicode character block.
Constant for the "Geometric Shapes" Unicode character block.
Constant for the "Georgian" Unicode character block.
Constant for the "Gothic" Unicode character block.
Constant for the "Greek and Coptic" Unicode character block.
This block was previously known as the "Greek" block.
Constant for the "Greek Extended" Unicode character block.
Constant for the "Gujarati" Unicode character block.
Constant for the "Gurmukhi" Unicode character block.
Constant for the "Halfwidth and Fullwidth Forms" Unicode character block.
Constant for the "Hangul Compatibility Jamo" Unicode character block.
Constant for the "Hangul Jamo" Unicode character block.
Constant for the "Hangul Syllables" Unicode character block.
Constant for the "Hanunoo" Unicode character block.
Constant for the "Hebrew" Unicode character block.
Constant for the "High Private Use Surrogates" Unicode character block. This block represents codepoint values in the high surrogate range: 0xDB80 through 0xDBFF
Constant for the "High Surrogates" Unicode character block. This block represents codepoint values in the high surrogate range: 0xD800 through 0xDB7F
Constant for the "Hiragana" Unicode character block.
Constant for the "Ideographic Description Characters" Unicode character block.
Constant for the "IPA Extensions" Unicode character block.
Constant for the "Kanbun" Unicode character block.
Constant for the "Kangxi Radicals" Unicode character block.
Constant for the "Kannada" Unicode character block.
Constant for the "Katakana" Unicode character block.
Constant for the "Katakana Phonetic Extensions" Unicode character block.
Constant for the "Khmer Symbols" Unicode character block.
Constant for the "Latin-1 Supplement" Unicode character block.
Constant for the "Latin Extended-A" Unicode character block.
Constant for the "Latin Extended Additional" Unicode character block.
Constant for the "Latin Extended-B" Unicode character block.
Constant for the "Letterlike Symbols" Unicode character block.
Constant for the "Linear B Ideograms" Unicode character block.
Constant for the "Linear B Syllabary" Unicode character block.
Constant for the "Low Surrogates" Unicode character block. This block represents codepoint values in the high surrogate range: 0xDC00 through 0xDFFF
Constant for the "Malayalam" Unicode character block.
Constant for the "Mathematical Alphanumeric Symbols" Unicode character block.
Constant for the "Mathematical Operators" Unicode character block.
Constant for the "Miscellaneous Mathematical Symbols-A" Unicode character block.
Constant for the "Miscellaneous Mathematical Symbols-B" Unicode character block.
Constant for the "Miscellaneous Symbols" Unicode character block.
Constant for the "Miscellaneous Symbols and Arrows" Unicode character block.
Constant for the "Miscellaneous Technical" Unicode character block.
Constant for the "Mongolian" Unicode character block.
Constant for the "Musical Symbols" Unicode character block.
Constant for the "Myanmar" Unicode character block.
Constant for the "Number Forms" Unicode character block.
Constant for the "Old Italic" Unicode character block.
Constant for the "Optical Character Recognition" Unicode character block.
Constant for the "Osmanya" Unicode character block.
Constant for the "Phonetic Extensions" Unicode character block.
Constant for the "Private Use Area" Unicode character block.
Constant for the "Shavian" Unicode character block.
Constant for the "Sinhala" Unicode character block.
Constant for the "Small Form Variants" Unicode character block.
Constant for the "Spacing Modifier Letters" Unicode character block.
Constant for the "Specials" Unicode character block.
Constant for the "Superscripts and Subscripts" Unicode character block.
Constant for the "Supplemental Arrows-A" Unicode character block.
Constant for the "Supplemental Arrows-B" Unicode character block.
Constant for the "Supplemental Mathematical Operators" Unicode character block.
Constant for the "Supplementary Private Use Area-A" Unicode character block.
Constant for the "Supplementary Private Use Area-B" Unicode character block.
This field is deprecated.
As of J2SE 5, use HIGH_SURROGATES
,
HIGH_PRIVATE_USE_SURROGATES
, and
LOW_SURROGATES
. These new constants match
the block definitions of the Unicode Standard.
The of(char)
and of(int)
methods
return the new constants, not SURROGATES_AREA.
Constant for the "Syriac" Unicode character block.
Constant for the "Tagalog" Unicode character block.
Constant for the "Tagbanwa" Unicode character block.
Constant for the "Tai Le" Unicode character block.
Constant for the "Tai Xuan Jing Symbols" Unicode character block.
Constant for the "Telugu" Unicode character block.
Constant for the "Thaana" Unicode character block.
Constant for the "Tibetan" Unicode character block.
Constant for the "Ugaritic" Unicode character block.
Constant for the "Unified Canadian Aboriginal Syllabics" Unicode character block.
Constant for the "Variation Selectors" Unicode character block.
Constant for the "Variation Selectors Supplement" Unicode character block.
Constant for the "Yijing Hexagram Symbols" Unicode character block.
Constant for the "Yi Radicals" Unicode character block.
Constant for the "Yi Syllables" Unicode character block.
Returns the UnicodeBlock with the given name. Block
names are determined by The Unicode Standard. The file
Blocks-<version>.txt defines blocks for a particular
version of the standard. The Character
class specifies
the version of the standard that it supports.
This method accepts block names in the following forms:
BASIC_LATIN
block if
provided with the "BASIC_LATIN" name. This form replaces all spaces and
hyphens in the canonical name with underscores.
If the Unicode Standard changes block names, both the previous and current names will be accepted.
blockName | A UnicodeBlock name. |
---|
UnicodeBlock
instance identified
by blockName
IllegalArgumentException | if blockName is an
invalid name |
---|---|
NullPointerException | if blockName is null |
Returns the object representing the Unicode block containing the
given character, or null
if the character is not a
member of a defined block.
Note: This method cannot handle supplementary
characters. To support all Unicode characters,
including supplementary characters, use the of(int)
method.
c | The character in question |
---|
UnicodeBlock
instance representing the
Unicode block of which this character is a member, or
null
if the character is not a member of any
Unicode block
Returns the object representing the Unicode block
containing the given character (Unicode code point), or
null
if the character is not a member of a
defined block.
codePoint | the character (Unicode code point) in question. |
---|
UnicodeBlock
instance representing the
Unicode block of which this character is a member, or
null
if the character is not a member of any
Unicode blockIllegalArgumentException | if the specified
codePoint is an invalid Unicode code point. |
---|