public class

CharEncoding

extends Object
java.lang.Object
   ↳ org.apache.commons.codec.CharEncoding

Class Overview

Character encoding names required of every implementation of the Java platform. From the Java documentation Standard charsets:

Every implementation of the Java platform is required to support the following character encodings. Consult the release documentation for your implementation to see if any other encodings are supported. Consult the release documentation for your implementation to see if any other encodings are supported.

  • US-ASCII
    Seven-bit ASCII, a.k.a. ISO646-US, a.k.a. the Basic Latin block of the Unicode character set.
  • ISO-8859-1
    ISO Latin Alphabet No. 1, a.k.a. ISO-LATIN-1.
  • UTF-8
    Eight-bit Unicode Transformation Format.
  • UTF-16BE
    Sixteen-bit Unicode Transformation Format, big-endian byte order.
  • UTF-16LE
    Sixteen-bit Unicode Transformation Format, little-endian byte order.
  • UTF-16
    Sixteen-bit Unicode Transformation Format, byte order specified by a mandatory initial byte-order mark (either order accepted on input, big-endian used on output.)
This perhaps would best belong in the [lang] project. Even if a similar interface is defined in [lang], it is not forseen that [codec] would be made to depend on [lang].

Summary

Constants
String ISO_8859_1 CharEncodingISO Latin Alphabet No.
String US_ASCII

Seven-bit ASCII, also known as ISO646-US, also known as the Basic Latin block of the Unicode character set.

String UTF_16

Sixteen-bit Unicode Transformation Format, The byte order specified by a mandatory initial byte-order mark (either order accepted on input, big-endian used on output)

Every implementation of the Java platform is required to support this character encoding.

String UTF_16BE

Sixteen-bit Unicode Transformation Format, big-endian byte order.

String UTF_16LE

Sixteen-bit Unicode Transformation Format, little-endian byte order.

String UTF_8

Eight-bit Unicode Transformation Format.

Public Constructors
CharEncoding()
[Expand]
Inherited Methods
From class java.lang.Object

Constants

public static final String ISO_8859_1

CharEncodingISO Latin Alphabet No. 1, a.k.a. ISO-LATIN-1.

Every implementation of the Java platform is required to support this character encoding.

Constant Value: "ISO-8859-1"

public static final String US_ASCII

Seven-bit ASCII, also known as ISO646-US, also known as the Basic Latin block of the Unicode character set.

Every implementation of the Java platform is required to support this character encoding.

Constant Value: "US-ASCII"

public static final String UTF_16

Sixteen-bit Unicode Transformation Format, The byte order specified by a mandatory initial byte-order mark (either order accepted on input, big-endian used on output)

Every implementation of the Java platform is required to support this character encoding.

Constant Value: "UTF-16"

public static final String UTF_16BE

Sixteen-bit Unicode Transformation Format, big-endian byte order.

Every implementation of the Java platform is required to support this character encoding.

Constant Value: "UTF-16BE"

public static final String UTF_16LE

Sixteen-bit Unicode Transformation Format, little-endian byte order.

Every implementation of the Java platform is required to support this character encoding.

Constant Value: "UTF-16LE"

public static final String UTF_8

Eight-bit Unicode Transformation Format.

Every implementation of the Java platform is required to support this character encoding.

Constant Value: "UTF-8"

Public Constructors

public CharEncoding ()