Class KStemFilter

All Implemented Interfaces:
Closeable, AutoCloseable, Unwrappable<TokenStream>

public final class KStemFilter extends TokenFilter
A high-performance kstem filter for english.

See "Viewing Morphology as an Inference Process" (Krovetz, R., Proceedings of the Sixteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 191-203, 1993).

All terms must already be lowercased for this filter to work correctly.

Note: This filter is aware of the KeywordAttribute. To prevent certain terms from being passed to the stemmer KeywordAttribute.isKeyword() should be set to true in a previous TokenStream.

Note: For including the original term as well as the stemmed version, see KeywordRepeatFilterFactory

  • Constructor Details

  • Method Details

    • incrementToken

      public boolean incrementToken() throws IOException
      Returns the next, stemmed, input Token.
      Specified by:
      incrementToken in class TokenStream
      Returns:
      The stemmed form of a token.
      Throws:
      IOException - If there is a low-level I/O error.