public final class SentenceTokenizer extends Tokenizer
The output tokens can then be broken into words with WordTokenFilter
AttributeSource.AttributeFactory, AttributeSource.State| Constructor and Description |
|---|
SentenceTokenizer(AttributeSource.AttributeFactory factory,
Reader reader) |
SentenceTokenizer(Reader reader) |
| Modifier and Type | Method and Description |
|---|---|
void |
end() |
boolean |
incrementToken() |
void |
reset() |
close, correctOffset, setReaderaddAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreStatepublic SentenceTokenizer(Reader reader)
public SentenceTokenizer(AttributeSource.AttributeFactory factory, Reader reader)
public boolean incrementToken()
throws IOException
incrementToken in class TokenStreamIOExceptionpublic void reset()
throws IOException
reset in class TokenStreamIOExceptionpublic void end()
end in class TokenStreamCopyright © 2000-2013 Apache Software Foundation. All Rights Reserved.