public abstract class CharStream extends Reader
Reader. All Tokenizers accept a CharStream instead of
Readeras input, which enables arbitrary character based filtering before tokenization. The
correctOffset(int)method fixed offsets to account for removal or insertion of characters, so that the offsets reported in the tokens match the character offsets of the original Reader.
|Constructor and Description|
|Modifier and Type||Method and Description|
Called by CharFilter(s) and Tokenizer to correct token offset.
close, mark, markSupported, read, read, read, read, ready, reset, skip