SnowballFilter
with
RussianStemmer
instead, which has the
same functionality. This filter will be removed in Lucene 4.0@Deprecated public final class RussianStemFilter extends TokenFilter
TokenFilter
that stems Russian words.
The implementation was inspired by GermanStemFilter.
The input should be filtered by LowerCaseFilter
before passing it to RussianStemFilter ,
because RussianStemFilter only works with lowercase characters.
To prevent terms from being stemmed use an instance of
KeywordMarkerFilter
or a custom TokenFilter
that sets
the KeywordAttribute
before this TokenStream
.
KeywordMarkerFilter
AttributeSource.AttributeFactory, AttributeSource.State
input
Constructor and Description |
---|
RussianStemFilter(TokenStream in)
Deprecated.
|
Modifier and Type | Method and Description |
---|---|
boolean |
incrementToken()
Deprecated.
Returns the next token in the stream, or null at EOS
|
void |
setStemmer(org.apache.lucene.analysis.ru.RussianStemmer stemmer)
Deprecated.
Set a alternative/custom
RussianStemmer for this filter. |
close, end, reset
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString
public RussianStemFilter(TokenStream in)
public final boolean incrementToken() throws IOException
incrementToken
in class TokenStream
IOException
public void setStemmer(org.apache.lucene.analysis.ru.RussianStemmer stemmer)
RussianStemmer
for this filter.