public class MorfologikFilter extends TokenFilter
TokenFilter using Morfologik library.
MorfologikFilter contains a MorphosyntacticTagsAttribute, which provides morphosyntactic
annotations for produced lemmas. See the Morfologik documentation for details.AttributeSource.AttributeFactory, AttributeSource.State| Constructor and Description |
|---|
MorfologikFilter(TokenStream in,
morfologik.stemming.PolishStemmer.DICTIONARY dict,
Version version)
Builds a filter for given PolishStemmer.DICTIONARY enum.
|
| Modifier and Type | Method and Description |
|---|---|
boolean |
incrementToken()
Retrieves the next token (possibly from the list of lemmas).
|
void |
reset()
Resets stems accumulator and hands over to superclass.
|
close, endaddAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreStatepublic MorfologikFilter(TokenStream in, morfologik.stemming.PolishStemmer.DICTIONARY dict, Version version)
in - input token streamdict - PolishStemmer.DICTIONARY enumversion - Lucene version compatibility for lowercasing.public final boolean incrementToken()
throws IOException
incrementToken in class TokenStreamIOExceptionpublic void reset()
throws IOException
reset in class TokenFilterIOExceptionCopyright © 2000-2012 Apache Software Foundation. All Rights Reserved.