public abstract class Tokenizer extends TokenStream
This is an abstract class; subclasses must override TokenStream.incrementToken()
NOTE: Subclasses overriding TokenStream.incrementToken() must
call AttributeSource.clearAttributes() before
setting attributes.
AttributeSource.AttributeFactory, AttributeSource.State| Modifier and Type | Field and Description |
|---|---|
protected Reader |
input
The text source for this Tokenizer.
|
| Modifier | Constructor and Description |
|---|---|
protected |
Tokenizer(AttributeSource.AttributeFactory factory,
Reader input)
Construct a token stream processing the given input using the given AttributeFactory.
|
protected |
Tokenizer(AttributeSource source,
Reader input)
Construct a token stream processing the given input using the given AttributeSource.
|
protected |
Tokenizer(Reader input)
Construct a token stream processing the given input.
|
| Modifier and Type | Method and Description |
|---|---|
void |
close()
Releases resources associated with this stream.
|
protected int |
correctOffset(int currentOff)
Return the corrected offset.
|
void |
setReader(Reader input)
Expert: Set a new reader on the Tokenizer.
|
end, incrementToken, resetaddAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreStateprotected Reader input
protected Tokenizer(Reader input)
protected Tokenizer(AttributeSource.AttributeFactory factory, Reader input)
protected Tokenizer(AttributeSource source, Reader input)
public void close()
throws IOException
NOTE:
The default implementation closes the input Reader, so
be sure to call super.close() when overriding this method.
close in interface Closeableclose in class TokenStreamIOExceptionprotected final int correctOffset(int currentOff)
input is a CharFilter subclass
this method calls CharFilter.correctOffset(int), else returns currentOff.currentOff - offset as seen in the outputCharFilter.correctOffset(int)public final void setReader(Reader input) throws IOException
IOExceptionCopyright © 2000-2013 Apache Software Foundation. All Rights Reserved.