public final class

UTF8StreamJsonParser

extends ParserBase
java.lang.Object
   ↳ com.fasterxml.jackson.core.JsonParser
     ↳ com.fasterxml.jackson.core.base.ParserMinimalBase
       ↳ com.fasterxml.jackson.core.base.ParserBase
         ↳ com.fasterxml.jackson.core.json.UTF8StreamJsonParser

Class Overview

This is a concrete implementation of JsonParser, which is based on a java.io.InputStream as the input source.

Summary

[Expand]
Inherited Constants
From class com.fasterxml.jackson.core.base.ParserBase
From class com.fasterxml.jackson.core.base.ParserMinimalBase
Fields
protected boolean _bufferRecyclable Flag that indicates whether the input buffer is recycable (and needs to be returned to recycler once we are done) or not.
protected byte[] _inputBuffer Current buffer from which data is read; generally data is read into buffer from input source, but in some cases pre-loaded buffer is handed to the parser.
protected InputStream _inputStream
protected ObjectCodec _objectCodec Codec used for data binding when (if) requested; typically full ObjectMapper, but that abstract is not part of core package.
protected int[] _quadBuffer Temporary buffer used for name parsing.
protected final BytesToNameCanonicalizer _symbols Symbol table that contains field names encountered so far
protected boolean _tokenIncomplete Flag that indicates that the current token has not yet been fully processed, and needs to be finished for some access (or skipped to obtain the next token)
[Expand]
Inherited Fields
From class com.fasterxml.jackson.core.base.ParserBase
From class com.fasterxml.jackson.core.base.ParserMinimalBase
From class com.fasterxml.jackson.core.JsonParser
Public Constructors
UTF8StreamJsonParser(IOContext ctxt, int features, InputStream in, ObjectCodec codec, BytesToNameCanonicalizer sym, byte[] inputBuffer, int start, int end, boolean bufferRecyclable)
Public Methods
void close()
Closes the parser so that no further iteration or data access can be made; will also close the underlying input source if parser either owns the input source, or feature AUTO_CLOSE_SOURCE is enabled.
byte[] getBinaryValue(Base64Variant b64variant)
Method that can be used to read (and consume -- results may not be accessible using other methods after the call) base64-encoded binary data included in the current textual JSON value.
ObjectCodec getCodec()
Accessor for ObjectCodec associated with this parser, if any.
Object getEmbeddedObject()
Accessor that can be called if (and only if) the current token is VALUE_EMBEDDED_OBJECT.
Object getInputSource()
Method that can be used to get access to object that is used to access input being parsed; this is usually either InputStream or Reader, depending on what parser was constructed with.
String getText()
Method for accessing textual representation of the current token; if no current token (before first call to nextToken(), or after encountering end-of-input), returns null.
char[] getTextCharacters()
Method similar to getText(), but that will return underlying (unmodifiable) character array that contains textual value, instead of constructing a String object to contain this information.
int getTextLength()
Accessor used with getTextCharacters(), to know length of String stored in returned buffer.
int getTextOffset()
Accessor used with getTextCharacters(), to know offset of the first text content character within buffer.
static int[] growArrayBy(int[] arr, int more)
Boolean nextBooleanValue()
Method that fetches next token (as if calling nextToken()) and if it is VALUE_TRUE or VALUE_FALSE returns matching Boolean value; otherwise return null.
boolean nextFieldName(SerializableString str)
Method that fetches next token (as if calling nextToken()) and verifies whether it is FIELD_NAME with specified name and returns result of that comparison.
int nextIntValue(int defaultValue)
Method that fetches next token (as if calling nextToken()) and if it is VALUE_NUMBER_INT returns 32-bit int value; otherwise returns specified default value It is functionally equivalent to:
  return (nextToken() == JsonToken.VALUE_NUMBER_INT) ? getIntValue() : defaultValue;
but may be faster for parser to process, and can therefore be used if caller expects to get a String value next from input.
long nextLongValue(long defaultValue)
Method that fetches next token (as if calling nextToken()) and if it is VALUE_NUMBER_INT returns 64-bit long value; otherwise returns specified default value It is functionally equivalent to:
  return (nextToken() == JsonToken.VALUE_NUMBER_INT) ? getLongValue() : defaultValue;
but may be faster for parser to process, and can therefore be used if caller expects to get a String value next from input.
String nextTextValue()
Method that fetches next token (as if calling nextToken()) and if it is VALUE_STRING returns contained String value; otherwise returns null.
JsonToken nextToken()
Main iteration method, which will advance stream enough to determine type of the next token, if any.
int readBinaryValue(Base64Variant b64variant, OutputStream out)
Similar to readBinaryValue(OutputStream) but allows explicitly specifying base64 variant to use.
int releaseBuffered(OutputStream out)
Method that can be called to push back any content that has been read but not consumed by the parser.
void setCodec(ObjectCodec c)
Setter that allows defining ObjectCodec associated with this parser, if any.
Version version()
Accessor for getting version of the core package, given a parser instance.
Protected Methods
void _closeInput()
byte[] _decodeBase64(Base64Variant b64variant)
Efficient handling for incremental parsing of base64-encoded textual content.
int _decodeCharForError(int firstByte)
final char _decodeEscaped()
Method that sub-classes must implement to support escaped sequences in base64-encoded sections.
void _finishString()
final String _getText2(JsonToken t)
JsonToken _handleApostropheValue()
JsonToken _handleInvalidNumberStart(int ch, boolean negative)
Method called if expected numeric value (due to leading sign) does not look like a number
JsonToken _handleUnexpectedValue(int c)
Method for handling cases where first non-space character of an expected value token is not legal for standard JSON content.
final Name _handleUnusualFieldName(int ch)
Method called when we see non-white space character other than double quote, when expecting a field name.
final boolean _loadToHaveAtLeast(int minAvailable)
Helper method that will try to load at least specified number bytes in input buffer, possible moving existing data around if necessary
final void _matchToken(String matchStr, int i)
final Name _parseApostropheFieldName()
final Name _parseFieldName(int i)
int _readBinary(Base64Variant b64variant, OutputStream out, byte[] buffer)
void _releaseBuffers()
Method called to release internal buffers owned by the base reader.
void _reportInvalidChar(int c)
void _reportInvalidInitial(int mask)
void _reportInvalidOther(int mask)
void _reportInvalidOther(int mask, int ptr)
void _reportInvalidToken(String matchedPart, String msg)
final void _skipCR()
We actually need to check the character value here (to see if we have \n following \r).
final void _skipLF()
void _skipString()
Method called to skim through rest of unparsed String value, if it is not needed.
final boolean loadMore()
Name parseEscapedFieldName(int[] quads, int qlen, int currQuad, int ch, int currQuadBytes)
Slower parsing method which is generally branched to when an escape sequence is detected (or alternatively for long names, or ones crossing input buffer boundary).
Name parseLongFieldName(int q)
final Name parseMediumFieldName(int q2, int[] codes)
final JsonToken parseNumberText(int c)
Initial parsing method for number values.
Name slowParseFieldName()
Method called when not even first 8 bytes are guaranteed to come consequtively.
[Expand]
Inherited Methods
From class com.fasterxml.jackson.core.base.ParserBase
From class com.fasterxml.jackson.core.base.ParserMinimalBase
From class com.fasterxml.jackson.core.JsonParser
From class java.lang.Object
From interface com.fasterxml.jackson.core.Versioned
From interface java.io.Closeable
From interface java.lang.AutoCloseable

Fields

protected boolean _bufferRecyclable

Flag that indicates whether the input buffer is recycable (and needs to be returned to recycler once we are done) or not.

If it is not, it also means that parser can NOT modify underlying buffer.

protected byte[] _inputBuffer

Current buffer from which data is read; generally data is read into buffer from input source, but in some cases pre-loaded buffer is handed to the parser.

protected InputStream _inputStream

protected ObjectCodec _objectCodec

Codec used for data binding when (if) requested; typically full ObjectMapper, but that abstract is not part of core package.

protected int[] _quadBuffer

Temporary buffer used for name parsing.

protected final BytesToNameCanonicalizer _symbols

Symbol table that contains field names encountered so far

protected boolean _tokenIncomplete

Flag that indicates that the current token has not yet been fully processed, and needs to be finished for some access (or skipped to obtain the next token)

Public Constructors

public UTF8StreamJsonParser (IOContext ctxt, int features, InputStream in, ObjectCodec codec, BytesToNameCanonicalizer sym, byte[] inputBuffer, int start, int end, boolean bufferRecyclable)

Public Methods

public void close ()

Closes the parser so that no further iteration or data access can be made; will also close the underlying input source if parser either owns the input source, or feature AUTO_CLOSE_SOURCE is enabled. Whether parser owns the input source depends on factory method that was used to construct instance (so check JsonFactory for details, but the general idea is that if caller passes in closable resource (such as InputStream or Reader) parser does NOT own the source; but if it passes a reference (such as java.io.File or java.net.URL and creates stream or reader it does own them.

Throws
IOException

public byte[] getBinaryValue (Base64Variant b64variant)

Method that can be used to read (and consume -- results may not be accessible using other methods after the call) base64-encoded binary data included in the current textual JSON value. It works similar to getting String value via getText() and decoding result (except for decoding part), but should be significantly more performant.

Note that non-decoded textual contents of the current token are not guaranteed to be accessible after this method is called. Current implementation, for example, clears up textual content during decoding. Decoded binary content, however, will be retained until parser is advanced to the next event.

Parameters
b64variant Expected variant of base64 encoded content (see Base64Variants for definitions of "standard" variants).
Returns
  • Decoded binary data

public ObjectCodec getCodec ()

Accessor for ObjectCodec associated with this parser, if any. Codec is used by readValueAs(Class) method (and its variants).

public Object getEmbeddedObject ()

Accessor that can be called if (and only if) the current token is VALUE_EMBEDDED_OBJECT. For other token types, null is returned.

Note: only some specialized parser implementations support embedding of objects (usually ones that are facades on top of non-streaming sources, such as object trees).

public Object getInputSource ()

Method that can be used to get access to object that is used to access input being parsed; this is usually either InputStream or Reader, depending on what parser was constructed with. Note that returned value may be null in some cases; including case where parser implementation does not want to exposed raw source to caller. In cases where input has been decorated, object returned here is the decorated version; this allows some level of interaction between users of parser and decorator object.

In general use of this accessor should be considered as "last effort", i.e. only used if no other mechanism is applicable.

public String getText ()

Method for accessing textual representation of the current token; if no current token (before first call to nextToken(), or after encountering end-of-input), returns null. Method can be called for any token type.

public char[] getTextCharacters ()

Method similar to getText(), but that will return underlying (unmodifiable) character array that contains textual value, instead of constructing a String object to contain this information. Note, however, that:

  • Textual contents are not guaranteed to start at index 0 (rather, call getTextOffset()) to know the actual offset
  • Length of textual contents may be less than the length of returned buffer: call getTextLength() for actual length of returned content.

Note that caller MUST NOT modify the returned character array in any way -- doing so may corrupt current parser state and render parser instance useless.

The only reason to call this method (over getText()) is to avoid construction of a String object (which will make a copy of contents).

public int getTextLength ()

Accessor used with getTextCharacters(), to know length of String stored in returned buffer.

Returns
  • Number of characters within buffer returned by getTextCharacters() that are part of textual content of the current token.

public int getTextOffset ()

Accessor used with getTextCharacters(), to know offset of the first text content character within buffer.

Returns
  • Offset of the first character within buffer returned by getTextCharacters() that is part of textual content of the current token.

public static int[] growArrayBy (int[] arr, int more)

public Boolean nextBooleanValue ()

Method that fetches next token (as if calling nextToken()) and if it is VALUE_TRUE or VALUE_FALSE returns matching Boolean value; otherwise return null. It is functionally equivalent to:

  JsonToken t = nextToken();
  if (t == JsonToken.VALUE_TRUE) return Boolean.TRUE;
  if (t == JsonToken.VALUE_FALSE) return Boolean.FALSE;
  return null;
but may be faster for parser to process, and can therefore be used if caller expects to get a String value next from input.

public boolean nextFieldName (SerializableString str)

Method that fetches next token (as if calling nextToken()) and verifies whether it is FIELD_NAME with specified name and returns result of that comparison. It is functionally equivalent to:

  return (nextToken() == JsonToken.FIELD_NAME) && str.getValue().equals(getCurrentName());
but may be faster for parser to verify, and can therefore be used if caller expects to get such a property name from input next.

Parameters
str Property name to compare next token to (if next token is JsonToken.FIELD_NAME)

public int nextIntValue (int defaultValue)

Method that fetches next token (as if calling nextToken()) and if it is VALUE_NUMBER_INT returns 32-bit int value; otherwise returns specified default value It is functionally equivalent to:

  return (nextToken() == JsonToken.VALUE_NUMBER_INT) ? getIntValue() : defaultValue;
but may be faster for parser to process, and can therefore be used if caller expects to get a String value next from input.

public long nextLongValue (long defaultValue)

Method that fetches next token (as if calling nextToken()) and if it is VALUE_NUMBER_INT returns 64-bit long value; otherwise returns specified default value It is functionally equivalent to:

  return (nextToken() == JsonToken.VALUE_NUMBER_INT) ? getLongValue() : defaultValue;
but may be faster for parser to process, and can therefore be used if caller expects to get a String value next from input.

public String nextTextValue ()

Method that fetches next token (as if calling nextToken()) and if it is VALUE_STRING returns contained String value; otherwise returns null. It is functionally equivalent to:

  return (nextToken() == JsonToken.VALUE_STRING) ? getText() : null;
but may be faster for parser to process, and can therefore be used if caller expects to get a String value next from input.

public JsonToken nextToken ()

Main iteration method, which will advance stream enough to determine type of the next token, if any. If none remaining (stream has no content other than possible white space before ending), null will be returned.

Returns
  • Next token from the stream, if any found, or null to indicate end-of-input

public int readBinaryValue (Base64Variant b64variant, OutputStream out)

Similar to readBinaryValue(OutputStream) but allows explicitly specifying base64 variant to use.

Parameters
b64variant base64 variant to use
out Output stream to use for passing decoded binary data
Returns
  • Number of bytes that were decoded and written via OutputStream

public int releaseBuffered (OutputStream out)

Method that can be called to push back any content that has been read but not consumed by the parser. This is usually done after reading all content of interest using parser. Content is released by writing it to given stream if possible; if underlying input is byte-based it can released, if not (char-based) it can not.

Returns
  • -1 if the underlying content source is not byte based (that is, input can not be sent to OutputStream; otherwise number of bytes released (0 if there was nothing to release)
Throws
IOException

public void setCodec (ObjectCodec c)

Setter that allows defining ObjectCodec associated with this parser, if any. Codec is used by readValueAs(Class) method (and its variants).

public Version version ()

Accessor for getting version of the core package, given a parser instance. Left for sub-classes to implement.

Protected Methods

protected void _closeInput ()

Throws
IOException

protected byte[] _decodeBase64 (Base64Variant b64variant)

Efficient handling for incremental parsing of base64-encoded textual content.

protected int _decodeCharForError (int firstByte)

protected final char _decodeEscaped ()

Method that sub-classes must implement to support escaped sequences in base64-encoded sections. Sub-classes that do not need base64 support can leave this as is

protected void _finishString ()

protected final String _getText2 (JsonToken t)

protected JsonToken _handleApostropheValue ()

protected JsonToken _handleInvalidNumberStart (int ch, boolean negative)

Method called if expected numeric value (due to leading sign) does not look like a number

protected JsonToken _handleUnexpectedValue (int c)

Method for handling cases where first non-space character of an expected value token is not legal for standard JSON content.

protected final Name _handleUnusualFieldName (int ch)

Method called when we see non-white space character other than double quote, when expecting a field name. In standard mode will just throw an expection; but in non-standard modes may be able to parse name.

protected final boolean _loadToHaveAtLeast (int minAvailable)

Helper method that will try to load at least specified number bytes in input buffer, possible moving existing data around if necessary

Throws
IOException

protected final void _matchToken (String matchStr, int i)

protected final Name _parseApostropheFieldName ()

protected final Name _parseFieldName (int i)

protected int _readBinary (Base64Variant b64variant, OutputStream out, byte[] buffer)

protected void _releaseBuffers ()

Method called to release internal buffers owned by the base reader. This may be called along with _closeInput() (for example, when explicitly closing this reader instance), or separately (if need be).

Throws
IOException

protected void _reportInvalidChar (int c)

protected void _reportInvalidInitial (int mask)

protected void _reportInvalidOther (int mask)

protected void _reportInvalidOther (int mask, int ptr)

protected void _reportInvalidToken (String matchedPart, String msg)

protected final void _skipCR ()

We actually need to check the character value here (to see if we have \n following \r).

Throws
IOException

protected final void _skipLF ()

Throws
IOException

protected void _skipString ()

Method called to skim through rest of unparsed String value, if it is not needed. This can be done bit faster if contents need not be stored for future access.

protected final boolean loadMore ()

Throws
IOException

protected Name parseEscapedFieldName (int[] quads, int qlen, int currQuad, int ch, int currQuadBytes)

Slower parsing method which is generally branched to when an escape sequence is detected (or alternatively for long names, or ones crossing input buffer boundary). In any case, needs to be able to handle more exceptional cases, gets slower, and hance is offlined to a separate method.

protected Name parseLongFieldName (int q)

protected final Name parseMediumFieldName (int q2, int[] codes)

protected final JsonToken parseNumberText (int c)

Initial parsing method for number values. It needs to be able to parse enough input to be able to determine whether the value is to be considered a simple integer value, or a more generic decimal value: latter of which needs to be expressed as a floating point number. The basic rule is that if the number has no fractional or exponential part, it is an integer; otherwise a floating point number.

Because much of input has to be processed in any case, no partial parsing is done: all input text will be stored for further processing. However, actual numeric value conversion will be deferred, since it is usually the most complicated and costliest part of processing.

protected Name slowParseFieldName ()

Method called when not even first 8 bytes are guaranteed to come consequtively. Happens rarely, so this is offlined; plus we'll also do full checks for escaping etc.