org.apache.lucene.analysis.morfologik
Class MorfologikFilter

java.lang.Object
  extended by org.apache.lucene.util.AttributeSource
      extended by org.apache.lucene.analysis.TokenStream
          extended by org.apache.lucene.analysis.TokenFilter
              extended by org.apache.lucene.analysis.morfologik.MorfologikFilter
All Implemented Interfaces:
Closeable

public class MorfologikFilter
extends TokenFilter

TokenFilter using Morfologik library. MorfologikFilter contains a MorphosyntacticTagsAttribute, which provides morphosyntactic annotations for produced lemmas. See the Morfologik documentation for details.

See Also:
Morfologik project page

Nested Class Summary
 
Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource
AttributeSource.AttributeFactory, AttributeSource.State
 
Constructor Summary
MorfologikFilter(TokenStream in, morfologik.stemming.PolishStemmer.DICTIONARY dict, Version version)
          Builds a filter for given PolishStemmer.DICTIONARY enum.
 
Method Summary
 boolean incrementToken()
          Retrieves the next token (possibly from the list of lemmas).
 void reset()
          Resets stems accumulator and hands over to superclass.
 
Methods inherited from class org.apache.lucene.analysis.TokenFilter
close, end
 
Methods inherited from class org.apache.lucene.util.AttributeSource
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState
 
Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

MorfologikFilter

public MorfologikFilter(TokenStream in,
                        morfologik.stemming.PolishStemmer.DICTIONARY dict,
                        Version version)
Builds a filter for given PolishStemmer.DICTIONARY enum.

Parameters:
in - input token stream
dict - PolishStemmer.DICTIONARY enum
version - Lucene version compatibility for lowercasing.
Method Detail

incrementToken

public final boolean incrementToken()
                             throws IOException
Retrieves the next token (possibly from the list of lemmas).

Specified by:
incrementToken in class TokenStream
Throws:
IOException

reset

public void reset()
           throws IOException
Resets stems accumulator and hands over to superclass.

Overrides:
reset in class TokenFilter
Throws:
IOException


Copyright © 2000-2013 Apache Software Foundation. All Rights Reserved.