Character (Java SE 23 & JDK 23)

Nested Class Summary

Nested Classes

static class

Instances of this class represent particular subsets of the Unicode character set.

static final class

A family of character subsets representing the character blocks in the Unicode specification.

static enum
Field Summary

Fields

static final int

The number of bytes used to represent a char value in unsigned binary form.

static final byte

General category "Mc" in the Unicode specification.

static final byte

General category "Pc" in the Unicode specification.

static final byte

General category "Cc" in the Unicode specification.

static final byte

General category "Sc" in the Unicode specification.

static final byte

General category "Pd" in the Unicode specification.

static final byte

General category "Nd" in the Unicode specification.

static final byte

Weak bidirectional character type "AN" in the Unicode specification.

static final byte

Weak bidirectional character type "BN" in the Unicode specification.

static final byte

Weak bidirectional character type "CS" in the Unicode specification.

static final byte

Weak bidirectional character type "EN" in the Unicode specification.

static final byte

Weak bidirectional character type "ES" in the Unicode specification.

static final byte

Weak bidirectional character type "ET" in the Unicode specification.

static final byte

Weak bidirectional character type "FSI" in the Unicode specification.

static final byte

Strong bidirectional character type "L" in the Unicode specification.

static final byte

Strong bidirectional character type "LRE" in the Unicode specification.

static final byte

Weak bidirectional character type "LRI" in the Unicode specification.

static final byte

Strong bidirectional character type "LRO" in the Unicode specification.

static final byte

Weak bidirectional character type "NSM" in the Unicode specification.

static final byte

Neutral bidirectional character type "ON" in the Unicode specification.

static final byte

Neutral bidirectional character type "B" in the Unicode specification.

static final byte

Weak bidirectional character type "PDF" in the Unicode specification.

static final byte

Weak bidirectional character type "PDI" in the Unicode specification.

static final byte

Strong bidirectional character type "R" in the Unicode specification.

static final byte

Strong bidirectional character type "AL" in the Unicode specification.

static final byte

Strong bidirectional character type "RLE" in the Unicode specification.

static final byte

Weak bidirectional character type "RLI" in the Unicode specification.

static final byte

Strong bidirectional character type "RLO" in the Unicode specification.

static final byte

Neutral bidirectional character type "S" in the Unicode specification.

static final byte

Undefined bidirectional character type.

static final byte

Neutral bidirectional character type "WS" in the Unicode specification.

static final byte

General category "Me" in the Unicode specification.

static final byte

General category "Pe" in the Unicode specification.

static final byte

General category "Pf" in the Unicode specification.

static final byte

General category "Cf" in the Unicode specification.

static final byte

General category "Pi" in the Unicode specification.

static final byte

General category "Nl" in the Unicode specification.

static final byte

General category "Zl" in the Unicode specification.

static final byte

General category "Ll" in the Unicode specification.

static final byte

General category "Sm" in the Unicode specification.

static final int

static final char

static final char

static final int

The maximum radix available for conversion to and from strings.

static final char

The maximum value of a Unicode surrogate code unit in the UTF-16 encoding, constant '\uDFFF'.

static final char

The constant value of this field is the largest value of type char, '\uFFFF'.

static final int

static final char

static final char

static final int

The minimum radix available for conversion to and from strings.

static final int

static final char

The minimum value of a Unicode surrogate code unit in the UTF-16 encoding, constant '\uD800'.

static final char

The constant value of this field is the smallest value of type char, '\u0000'.

static final byte

General category "Lm" in the Unicode specification.

static final byte

General category "Sk" in the Unicode specification.

static final byte

General category "Mn" in the Unicode specification.

static final byte

General category "Lo" in the Unicode specification.

static final byte

General category "No" in the Unicode specification.

static final byte

General category "Po" in the Unicode specification.

static final byte

General category "So" in the Unicode specification.

static final byte

General category "Zp" in the Unicode specification.

static final byte

General category "Co" in the Unicode specification.

static final int

The number of bits used to represent a char value in unsigned binary form, constant 16.

static final byte

General category "Zs" in the Unicode specification.

static final byte

General category "Ps" in the Unicode specification.

static final byte

General category "Cs" in the Unicode specification.

static final byte

General category "Lt" in the Unicode specification.

The Class instance representing the primitive type char.

static final byte

General category "Cn" in the Unicode specification.

static final byte

General category "Lu" in the Unicode specification.
Constructor Summary

Constructors

Character(char value)

Deprecated, for removal: This API element is subject to removal in a future version.
Method Summary

static int

charCount(int codePoint)

Determines the number of char values needed to represent the specified character (Unicode code point).

char

charValue()

Returns the value of this Character object.

static int

codePointAt(char[] a, int index)

Returns the code point at the given index of the char array.

static int

codePointAt(char[] a, int index, int limit)

Returns the code point at the given index of the char array, where only array elements with index less than limit can be used.

static int

Returns the code point at the given index of the CharSequence.

static int

codePointBefore(char[] a, int index)

Returns the code point preceding the given index of the char array.

static int

codePointBefore(char[] a, int index, int start)

Returns the code point preceding the given index of the char array, where only array elements with index greater than or equal to start can be used.

static int

Returns the code point preceding the given index of the CharSequence.

static int

codePointCount(char[] a, int offset, int count)

Returns the number of Unicode code points in a subarray of the char array argument.

static int

Returns the number of Unicode code points in the text range of the specified char sequence.

static int

Returns the code point value of the Unicode character specified by the given character name.

static int

compare(char x, char y)

Compares two char values numerically.

int

Compares two Character objects numerically.

Returns an Optional containing the nominal descriptor for this instance.

static int

digit(char ch, int radix)

Returns the numeric value of the character ch in the specified radix.

static int

digit(int codePoint, int radix)

Returns the numeric value of the specified character (Unicode code point) in the specified radix.

boolean

Compares this object against the specified object.

static char

forDigit(int digit, int radix)

Determines the character representation for a specific digit in the specified radix.

static byte

getDirectionality(char ch)

Returns the Unicode directionality property for the given character.

static byte

getDirectionality(int codePoint)

Returns the Unicode directionality property for the given character (Unicode code point).

getName(int codePoint)

Returns the name of the specified character codePoint, or null if the code point is unassigned.

static int

getNumericValue(char ch)

Returns the int value that the specified Unicode character represents.

static int

getNumericValue(int codePoint)

Returns the int value that the specified character (Unicode code point) represents.

static int

getType(char ch)

Returns a value indicating a character's general category.

static int

getType(int codePoint)

Returns a value indicating a character's general category.

int

hashCode()

Returns a hash code for this Character; equal to the result of invoking charValue().

static int

hashCode(char value)

Returns a hash code for a char value; compatible with Character.hashCode().

static char

highSurrogate(int codePoint)

static boolean

isAlphabetic(int codePoint)

Determines if the specified character (Unicode code point) is alphabetic.

static boolean

isBmpCodePoint(int codePoint)

Determines whether the specified character (Unicode code point) is in the Basic Multilingual Plane (BMP).

static boolean

isDefined(char ch)

Determines if a character is defined in Unicode.

static boolean

isDefined(int codePoint)

Determines if a character (Unicode code point) is defined in Unicode.

static boolean

isDigit(char ch)

Determines if the specified character is a digit.

static boolean

isDigit(int codePoint)

Determines if the specified character (Unicode code point) is a digit.

static boolean

isEmoji(int codePoint)

Determines if the specified character (Unicode code point) is an Emoji.

static boolean

isEmojiComponent(int codePoint)

Determines if the specified character (Unicode code point) is an Emoji Component.

static boolean

isEmojiModifier(int codePoint)

Determines if the specified character (Unicode code point) is an Emoji Modifier.

static boolean

isEmojiModifierBase(int codePoint)

Determines if the specified character (Unicode code point) is an Emoji Modifier Base.

static boolean

isEmojiPresentation(int codePoint)

Determines if the specified character (Unicode code point) has the Emoji Presentation property by default.

static boolean

isExtendedPictographic(int codePoint)

Determines if the specified character (Unicode code point) is an Extended Pictographic.

static boolean

isHighSurrogate(char ch)

static boolean

isIdentifierIgnorable(char ch)

Determines if the specified character should be regarded as an ignorable character in a Java identifier or a Unicode identifier.

static boolean

isIdentifierIgnorable(int codePoint)

Determines if the specified character (Unicode code point) should be regarded as an ignorable character in a Java identifier or a Unicode identifier.

static boolean

isIdeographic(int codePoint)

Determines if the specified character (Unicode code point) is a CJKV (Chinese, Japanese, Korean and Vietnamese) ideograph, as defined by the Unicode Standard.

static boolean

isISOControl(char ch)

Determines if the specified character is an ISO control character.

static boolean

isISOControl(int codePoint)

Determines if the referenced character (Unicode code point) is an ISO control character.

static boolean

isJavaIdentifierPart(char ch)

Determines if the specified character may be part of a Java identifier as other than the first character.

static boolean

isJavaIdentifierPart(int codePoint)

Determines if the character (Unicode code point) may be part of a Java identifier as other than the first character.

static boolean

isJavaIdentifierStart(char ch)

Determines if the specified character is permissible as the first character in a Java identifier.

static boolean

isJavaIdentifierStart(int codePoint)

Determines if the character (Unicode code point) is permissible as the first character in a Java identifier.

static boolean

isJavaLetter(char ch)

static boolean

isJavaLetterOrDigit(char ch)

static boolean

isLetter(char ch)

Determines if the specified character is a letter.

static boolean

isLetter(int codePoint)

Determines if the specified character (Unicode code point) is a letter.

static boolean

isLetterOrDigit(char ch)

Determines if the specified character is a letter or digit.

static boolean

isLetterOrDigit(int codePoint)

Determines if the specified character (Unicode code point) is a letter or digit.

static boolean

isLowerCase(char ch)

Determines if the specified character is a lowercase character.

static boolean

isLowerCase(int codePoint)

Determines if the specified character (Unicode code point) is a lowercase character.

static boolean

isLowSurrogate(char ch)

static boolean

isMirrored(char ch)

Determines whether the character is mirrored according to the Unicode specification.

static boolean

isMirrored(int codePoint)

Determines whether the specified character (Unicode code point) is mirrored according to the Unicode specification.

static boolean

isSpace(char ch)

static boolean

isSpaceChar(char ch)

Determines if the specified character is a Unicode space character.

static boolean

isSpaceChar(int codePoint)

Determines if the specified character (Unicode code point) is a Unicode space character.

static boolean

isSupplementaryCodePoint(int codePoint)

Determines whether the specified character (Unicode code point) is in the supplementary character range.

static boolean

isSurrogate(char ch)

Determines if the given char value is a Unicode surrogate code unit.

static boolean

isSurrogatePair(char high, char low)

static boolean

isTitleCase(char ch)

Determines if the specified character is a titlecase character.

static boolean

isTitleCase(int codePoint)

Determines if the specified character (Unicode code point) is a titlecase character.

static boolean

isUnicodeIdentifierPart(char ch)

Determines if the specified character may be part of a Unicode identifier as other than the first character.

static boolean

isUnicodeIdentifierPart(int codePoint)

Determines if the specified character (Unicode code point) may be part of a Unicode identifier as other than the first character.

static boolean

isUnicodeIdentifierStart(char ch)

Determines if the specified character is permissible as the first character in a Unicode identifier.

static boolean

isUnicodeIdentifierStart(int codePoint)

Determines if the specified character (Unicode code point) is permissible as the first character in a Unicode identifier.

static boolean

isUpperCase(char ch)

Determines if the specified character is an uppercase character.

static boolean

isUpperCase(int codePoint)

Determines if the specified character (Unicode code point) is an uppercase character.

static boolean

isValidCodePoint(int codePoint)

static boolean

isWhitespace(char ch)

Determines if the specified character is white space according to Java.

static boolean

isWhitespace(int codePoint)

Determines if the specified character (Unicode code point) is white space according to Java.

static char

lowSurrogate(int codePoint)

static int

offsetByCodePoints(char[] a, int start, int count, int index, int codePointOffset)

Returns the index within the given char subarray that is offset from the given index by codePointOffset code points.

static int

Returns the index within the given char sequence that is offset from the given index by codePointOffset code points.

static char

reverseBytes(char ch)

Returns the value obtained by reversing the order of the bytes in the specified char value.

static char[]

toChars(int codePoint)

Converts the specified character (Unicode code point) to its UTF-16 representation stored in a char array.

static int

toChars(int codePoint, char[] dst, int dstIndex)

Converts the specified character (Unicode code point) to its UTF-16 representation.

static int

toCodePoint(char high, char low)

Converts the specified surrogate pair to its supplementary code point value.

static char

toLowerCase(char ch)

Converts the character argument to lowercase using case mapping information from the UnicodeData file.

static int

toLowerCase(int codePoint)

Converts the character (Unicode code point) argument to lowercase using case mapping information from the UnicodeData file.

toString()

Returns a String object representing this Character's value.

toString(char c)

Returns a String object representing the specified char.

toString(int codePoint)

Returns a String object representing the specified character (Unicode code point).

static char

toTitleCase(char ch)

Converts the character argument to titlecase using case mapping information from the UnicodeData file.

static int

toTitleCase(int codePoint)

Converts the character (Unicode code point) argument to titlecase using case mapping information from the UnicodeData file.

static char

toUpperCase(char ch)

Converts the character argument to uppercase using case mapping information from the UnicodeData file.

static int

toUpperCase(int codePoint)

Converts the character (Unicode code point) argument to uppercase using case mapping information from the UnicodeData file.

valueOf(char c)

Returns a Character instance representing the specified char value.

Field Details
- MIN_RADIX
- MAX_RADIX
- MIN_VALUE
- MAX_VALUE
- TYPE
- UNASSIGNED
- UPPERCASE_LETTER
- LOWERCASE_LETTER
- TITLECASE_LETTER
- MODIFIER_LETTER
- OTHER_LETTER
- NON_SPACING_MARK
- ENCLOSING_MARK
- COMBINING_SPACING_MARK
- DECIMAL_DIGIT_NUMBER
- LETTER_NUMBER
- OTHER_NUMBER
- SPACE_SEPARATOR
- LINE_SEPARATOR
- PARAGRAPH_SEPARATOR
- CONTROL
- FORMAT
- PRIVATE_USE
- SURROGATE
- DASH_PUNCTUATION
- START_PUNCTUATION
- END_PUNCTUATION
- CONNECTOR_PUNCTUATION
- OTHER_PUNCTUATION
- MATH_SYMBOL
- CURRENCY_SYMBOL
- MODIFIER_SYMBOL
- OTHER_SYMBOL
- INITIAL_QUOTE_PUNCTUATION
- FINAL_QUOTE_PUNCTUATION
- DIRECTIONALITY_UNDEFINED
- DIRECTIONALITY_LEFT_TO_RIGHT
- DIRECTIONALITY_RIGHT_TO_LEFT
- DIRECTIONALITY_RIGHT_TO_LEFT_ARABIC
- DIRECTIONALITY_EUROPEAN_NUMBER
- DIRECTIONALITY_EUROPEAN_NUMBER_SEPARATOR
- DIRECTIONALITY_EUROPEAN_NUMBER_TERMINATOR
- DIRECTIONALITY_ARABIC_NUMBER
- DIRECTIONALITY_COMMON_NUMBER_SEPARATOR
- DIRECTIONALITY_NONSPACING_MARK
- DIRECTIONALITY_BOUNDARY_NEUTRAL
- DIRECTIONALITY_PARAGRAPH_SEPARATOR
- DIRECTIONALITY_SEGMENT_SEPARATOR
- DIRECTIONALITY_WHITESPACE
- DIRECTIONALITY_OTHER_NEUTRALS
- DIRECTIONALITY_LEFT_TO_RIGHT_EMBEDDING
- DIRECTIONALITY_LEFT_TO_RIGHT_OVERRIDE
- DIRECTIONALITY_RIGHT_TO_LEFT_EMBEDDING
- DIRECTIONALITY_RIGHT_TO_LEFT_OVERRIDE
- DIRECTIONALITY_POP_DIRECTIONAL_FORMAT
- DIRECTIONALITY_LEFT_TO_RIGHT_ISOLATE
- DIRECTIONALITY_RIGHT_TO_LEFT_ISOLATE
- DIRECTIONALITY_FIRST_STRONG_ISOLATE
- DIRECTIONALITY_POP_DIRECTIONAL_ISOLATE
- MIN_HIGH_SURROGATE
- MAX_HIGH_SURROGATE
- MIN_LOW_SURROGATE
- MAX_LOW_SURROGATE
- MIN_SURROGATE
- MAX_SURROGATE
- MIN_SUPPLEMENTARY_CODE_POINT
- MIN_CODE_POINT
- MAX_CODE_POINT
- SIZE
- BYTES
Constructor Details
- Character
Method Details
- describeConstable
- valueOf
- charValue
- hashCode
- hashCode
- equals
- toString
- toString
- toString
- isValidCodePoint
- isBmpCodePoint
- isSupplementaryCodePoint
- isHighSurrogate
- isLowSurrogate
- isSurrogate
- isSurrogatePair
- charCount
- toCodePoint
- codePointAt
- codePointAt
- codePointAt
- codePointBefore
- codePointBefore
- codePointBefore
- highSurrogate
- lowSurrogate
- toChars
- toChars
- codePointCount
- codePointCount
- offsetByCodePoints
- offsetByCodePoints
- isLowerCase
- isLowerCase
- isUpperCase
- isUpperCase
- isTitleCase
- isTitleCase
- isDigit
- isDigit
- isDefined
- isDefined
- isLetter
- isLetter
- isLetterOrDigit
- isLetterOrDigit
- isJavaLetter
- isJavaLetterOrDigit
- isAlphabetic
- isIdeographic
- isJavaIdentifierStart
- isJavaIdentifierStart
- isJavaIdentifierPart
- isJavaIdentifierPart
- isUnicodeIdentifierStart
- isUnicodeIdentifierStart
- isUnicodeIdentifierPart
- isUnicodeIdentifierPart
- isIdentifierIgnorable
- isIdentifierIgnorable
- isEmoji
- isEmojiPresentation
- isEmojiModifier
- isEmojiModifierBase
- isEmojiComponent
- isExtendedPictographic
- toLowerCase
- toLowerCase
- toUpperCase
- toUpperCase
- toTitleCase
- toTitleCase
- digit
- digit
- getNumericValue
- getNumericValue
- isSpace
- isSpaceChar
- isSpaceChar
- isWhitespace
- isWhitespace
- isISOControl
- isISOControl
- getType
- getType
- forDigit
- getDirectionality
- getDirectionality
- isMirrored
- isMirrored
- compareTo
- compare
- reverseBytes
- getName
- codePointOf

Nested Class Summary

Field Summary

Constructor Summary

Method Summary

Field Details

MIN_RADIX

MAX_RADIX

MIN_VALUE

MAX_VALUE

TYPE

UNASSIGNED

UPPERCASE_LETTER

LOWERCASE_LETTER

TITLECASE_LETTER

MODIFIER_LETTER

OTHER_LETTER

NON_SPACING_MARK

ENCLOSING_MARK

COMBINING_SPACING_MARK

DECIMAL_DIGIT_NUMBER

LETTER_NUMBER

OTHER_NUMBER

SPACE_SEPARATOR

LINE_SEPARATOR

PARAGRAPH_SEPARATOR

CONTROL

FORMAT

PRIVATE_USE

SURROGATE

DASH_PUNCTUATION

START_PUNCTUATION

END_PUNCTUATION

CONNECTOR_PUNCTUATION

OTHER_PUNCTUATION

MATH_SYMBOL

CURRENCY_SYMBOL

MODIFIER_SYMBOL

OTHER_SYMBOL

INITIAL_QUOTE_PUNCTUATION

FINAL_QUOTE_PUNCTUATION

DIRECTIONALITY_UNDEFINED

DIRECTIONALITY_LEFT_TO_RIGHT

DIRECTIONALITY_RIGHT_TO_LEFT

DIRECTIONALITY_RIGHT_TO_LEFT_ARABIC

DIRECTIONALITY_EUROPEAN_NUMBER

DIRECTIONALITY_EUROPEAN_NUMBER_SEPARATOR

DIRECTIONALITY_EUROPEAN_NUMBER_TERMINATOR

DIRECTIONALITY_ARABIC_NUMBER

DIRECTIONALITY_COMMON_NUMBER_SEPARATOR

DIRECTIONALITY_NONSPACING_MARK

DIRECTIONALITY_BOUNDARY_NEUTRAL

DIRECTIONALITY_PARAGRAPH_SEPARATOR

DIRECTIONALITY_SEGMENT_SEPARATOR

DIRECTIONALITY_WHITESPACE

DIRECTIONALITY_OTHER_NEUTRALS

DIRECTIONALITY_LEFT_TO_RIGHT_EMBEDDING

DIRECTIONALITY_LEFT_TO_RIGHT_OVERRIDE

DIRECTIONALITY_RIGHT_TO_LEFT_EMBEDDING

DIRECTIONALITY_RIGHT_TO_LEFT_OVERRIDE

DIRECTIONALITY_POP_DIRECTIONAL_FORMAT

DIRECTIONALITY_LEFT_TO_RIGHT_ISOLATE

DIRECTIONALITY_RIGHT_TO_LEFT_ISOLATE

DIRECTIONALITY_FIRST_STRONG_ISOLATE

DIRECTIONALITY_POP_DIRECTIONAL_ISOLATE

MIN_HIGH_SURROGATE

MAX_HIGH_SURROGATE

MIN_LOW_SURROGATE

MAX_LOW_SURROGATE

MIN_SURROGATE

MAX_SURROGATE

MIN_SUPPLEMENTARY_CODE_POINT

MIN_CODE_POINT

MAX_CODE_POINT

SIZE

BYTES

Constructor Details

Character

Method Details

describeConstable

valueOf