public final class SentenceTokenizer extends Tokenizer
The output tokens can then be broken into words with WordTokenFilter
AttributeSource.AttributeFactory, AttributeSource.State| Constructor and Description |
|---|
SentenceTokenizer(AttributeSource.AttributeFactory factory,
Reader reader) |
SentenceTokenizer(AttributeSource source,
Reader reader) |
SentenceTokenizer(Reader reader) |
| Modifier and Type | Method and Description |
|---|---|
void |
end() |
boolean |
incrementToken() |
void |
reset() |
void |
setReader(Reader input) |
close, correctOffsetaddAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreStatepublic SentenceTokenizer(Reader reader)
public SentenceTokenizer(AttributeSource source, Reader reader)
public SentenceTokenizer(AttributeSource.AttributeFactory factory, Reader reader)
public boolean incrementToken()
throws IOException
incrementToken in class TokenStreamIOExceptionpublic void reset()
throws IOException
reset in class TokenStreamIOExceptionpublic void setReader(Reader input) throws IOException
setReader in class TokenizerIOExceptionpublic void end()
end in class TokenStreamCopyright © 2000-2012 Apache Software Foundation. All Rights Reserved.