public final class HunspellStemFilter extends TokenFilter
Note: This filter is aware of the KeywordAttribute
. To prevent
certain terms from being passed to the stemmer
KeywordAttribute.isKeyword()
should be set to true
in a previous TokenStream
.
Note: For including the original term as well as the stemmed version, see
KeywordRepeatFilterFactory
AttributeSource.AttributeFactory, AttributeSource.State
input
Constructor and Description |
---|
HunspellStemFilter(TokenStream input,
HunspellDictionary dictionary)
Create a
HunspellStemFilter which deduplicates stems and has a maximum
recursion level of 2. |
HunspellStemFilter(TokenStream input,
HunspellDictionary dictionary,
boolean dedup)
Create a
HunspellStemFilter which has a maximum recursion level of 2. |
HunspellStemFilter(TokenStream input,
HunspellDictionary dictionary,
boolean dedup,
int recursionCap)
Creates a new HunspellStemFilter that will stem tokens from the given TokenStream using affix rules in the provided
HunspellDictionary
|
HunspellStemFilter(TokenStream input,
HunspellDictionary dictionary,
int recursionCap)
Creates a new HunspellStemFilter that will stem tokens from the given TokenStream using affix rules in the provided
HunspellDictionary
|
Modifier and Type | Method and Description |
---|---|
boolean |
incrementToken() |
void |
reset() |
close, end
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString
public HunspellStemFilter(TokenStream input, HunspellDictionary dictionary)
HunspellStemFilter
which deduplicates stems and has a maximum
recursion level of 2.public HunspellStemFilter(TokenStream input, HunspellDictionary dictionary, int recursionCap)
input
- TokenStream whose tokens will be stemmeddictionary
- HunspellDictionary containing the affix rules and words that will be used to stem the tokensrecursionCap
- maximum level of recursion stemmer can go into, defaults to 2
public HunspellStemFilter(TokenStream input, HunspellDictionary dictionary, boolean dedup)
HunspellStemFilter
which has a maximum recursion level of 2.public HunspellStemFilter(TokenStream input, HunspellDictionary dictionary, boolean dedup, int recursionCap)
input
- TokenStream whose tokens will be stemmeddictionary
- HunspellDictionary containing the affix rules and words that will be used to stem the tokensdedup
- true if only unique terms should be output.recursionCap
- maximum level of recursion stemmer can go into, defaults to 2
public boolean incrementToken() throws IOException
incrementToken
in class TokenStream
IOException
public void reset() throws IOException
reset
in class TokenFilter
IOException
Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.