- All Implemented Interfaces:
- Closeable, Readable
- Direct Known Subclasses:
- CharFilter, CharReader
public abstract class CharStream
- extends Reader
Reader. All Tokenizers accept a
CharStream instead of
Reader as input, which enables
arbitrary character based filtering before tokenization.
correctOffset(int) method fixed offsets to account for
removal or insertion of characters, so that the offsets
reported in the tokens match the character offsets of the
Called by CharFilter(s) and Tokenizer to correct token offset.
|Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
public abstract int correctOffset(int currentOff)
- Called by CharFilter(s) and Tokenizer to correct token offset.
currentOff - offset as seen in the output
- corrected offset based on the input
Copyright © 2000-2011 Apache Software Foundation. All Rights Reserved.