Class ClassicAnalyzer

All Implemented Interfaces:
Closeable, AutoCloseable

public final class ClassicAnalyzer extends StopwordAnalyzerBase
Filters ClassicTokenizer with ClassicFilter, LowerCaseFilter and StopFilter, using a list of English stop words.

ClassicAnalyzer was named StandardAnalyzer in Lucene versions prior to 3.1. As of 3.1, StandardAnalyzer implements Unicode text segmentation, as specified by UAX#29.

Since:
3.1
  • Field Details

    • DEFAULT_MAX_TOKEN_LENGTH

      public static final int DEFAULT_MAX_TOKEN_LENGTH
      Default maximum allowed token length
      See Also:
    • STOP_WORDS_SET

      public static final CharArraySet STOP_WORDS_SET
      An unmodifiable set containing some common English words that are usually not useful for searching.
  • Constructor Details

    • ClassicAnalyzer

      public ClassicAnalyzer(CharArraySet stopWords)
      Builds an analyzer with the given stop words.
      Parameters:
      stopWords - stop words
    • ClassicAnalyzer

      public ClassicAnalyzer()
      Builds an analyzer with the default stop words (STOP_WORDS_SET).
    • ClassicAnalyzer

      public ClassicAnalyzer(Reader stopwords) throws IOException
      Builds an analyzer with the stop words from the given reader.
      Parameters:
      stopwords - Reader to read stop words from
      Throws:
      IOException
      See Also:
  • Method Details