org.apache.lucene.analysis.morfologik
Class MorfologikFilter
java.lang.Object
org.apache.lucene.util.AttributeSource
org.apache.lucene.analysis.TokenStream
org.apache.lucene.analysis.TokenFilter
org.apache.lucene.analysis.morfologik.MorfologikFilter
- All Implemented Interfaces:
- Closeable
public class MorfologikFilter
- extends TokenFilter
TokenFilter using Morfologik library to transform input tokens into lemma and
morphosyntactic (POS) tokens. Applies to Polish only.
MorfologikFilter contains a MorphosyntacticTagsAttribute, which provides morphosyntactic
annotations for produced lemmas. See the Morfologik documentation for details.
- See Also:
- Morfologik project page
|
Method Summary |
boolean |
incrementToken()
Retrieves the next token (possibly from the list of lemmas). |
void |
reset()
Resets stems accumulator and hands over to superclass. |
| Methods inherited from class org.apache.lucene.util.AttributeSource |
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString |
MorfologikFilter
public MorfologikFilter(TokenStream in,
Version version)
- Creates MorfologikFilter
- Parameters:
in - input token streamversion - Lucene version compatibility for lowercasing.
incrementToken
public final boolean incrementToken()
throws IOException
- Retrieves the next token (possibly from the list of lemmas).
- Specified by:
incrementToken in class TokenStream
- Throws:
IOException
reset
public void reset()
throws IOException
- Resets stems accumulator and hands over to superclass.
- Overrides:
reset in class TokenFilter
- Throws:
IOException
Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.