public class MorfologikFilter extends TokenFilter
TokenFilter
using Morfologik library to transform input tokens into lemma and
morphosyntactic (POS) tokens. Applies to Polish only.
MorfologikFilter contains a MorphosyntacticTagsAttribute
, which provides morphosyntactic
annotations for produced lemmas. See the Morfologik documentation for details.
AttributeSource.AttributeFactory, AttributeSource.State
Constructor and Description |
---|
MorfologikFilter(TokenStream in,
Version version)
Creates MorfologikFilter
|
Modifier and Type | Method and Description |
---|---|
boolean |
incrementToken()
Retrieves the next token (possibly from the list of lemmas).
|
void |
reset()
Resets stems accumulator and hands over to superclass.
|
close, end
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString
public MorfologikFilter(TokenStream in, Version version)
in
- input token streamversion
- Lucene version compatibility for lowercasing.public final boolean incrementToken() throws IOException
incrementToken
in class TokenStream
IOException
public void reset() throws IOException
reset
in class TokenFilter
IOException
Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.