Class HunspellStemFilter

  • All Implemented Interfaces:
    Closeable, AutoCloseable

    public final class HunspellStemFilter
    extends TokenFilter
    TokenFilter that uses hunspell affix rules and words to stem tokens. Since hunspell supports a word having multiple stems, this filter can emit multiple tokens for each consumed token

    Note: This filter is aware of the KeywordAttribute. To prevent certain terms from being passed to the stemmer KeywordAttribute.isKeyword() should be set to true in a previous TokenStream.

    Note: For including the original term as well as the stemmed version, see KeywordRepeatFilterFactory

    WARNING: This API is experimental and might change in incompatible ways in the next release.