public final class HunspellStemFilter extends TokenFilter
Note: This filter is aware of the KeywordAttribute. To prevent
certain terms from being passed to the stemmer
KeywordAttribute.isKeyword() should be set to true
in a previous TokenStream.
Note: For including the original term as well as the stemmed version, see
KeywordRepeatFilterFactory
AttributeSource.AttributeFactory, AttributeSource.Stateinput| Constructor and Description |
|---|
HunspellStemFilter(TokenStream input,
HunspellDictionary dictionary)
Create a
HunspellStemFilter which deduplicates stems and has a maximum
recursion level of 2. |
HunspellStemFilter(TokenStream input,
HunspellDictionary dictionary,
boolean dedup)
Create a
HunspellStemFilter which has a maximum recursion level of 2. |
HunspellStemFilter(TokenStream input,
HunspellDictionary dictionary,
boolean dedup,
int recursionCap)
Creates a new HunspellStemFilter that will stem tokens from the given TokenStream using affix rules in the provided
HunspellDictionary
|
HunspellStemFilter(TokenStream input,
HunspellDictionary dictionary,
int recursionCap)
Creates a new HunspellStemFilter that will stem tokens from the given TokenStream using affix rules in the provided
HunspellDictionary
|
| Modifier and Type | Method and Description |
|---|---|
boolean |
incrementToken() |
void |
reset() |
close, endaddAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toStringpublic HunspellStemFilter(TokenStream input, HunspellDictionary dictionary)
HunspellStemFilter which deduplicates stems and has a maximum
recursion level of 2.public HunspellStemFilter(TokenStream input, HunspellDictionary dictionary, int recursionCap)
input - TokenStream whose tokens will be stemmeddictionary - HunspellDictionary containing the affix rules and words that will be used to stem the tokensrecursionCap - maximum level of recursion stemmer can go into, defaults to 2public HunspellStemFilter(TokenStream input, HunspellDictionary dictionary, boolean dedup)
HunspellStemFilter which has a maximum recursion level of 2.public HunspellStemFilter(TokenStream input, HunspellDictionary dictionary, boolean dedup, int recursionCap)
input - TokenStream whose tokens will be stemmeddictionary - HunspellDictionary containing the affix rules and words that will be used to stem the tokensdedup - true if only unique terms should be output.recursionCap - maximum level of recursion stemmer can go into, defaults to 2public boolean incrementToken()
throws IOException
incrementToken in class TokenStreamIOExceptionpublic void reset()
throws IOException
reset in class TokenFilterIOExceptionCopyright © 2000-2014 Apache Software Foundation. All Rights Reserved.