Class StopwordAnalyzerBase

  • All Implemented Interfaces:
    Closeable, AutoCloseable
    Direct Known Subclasses:
    StandardAnalyzer

    public abstract class StopwordAnalyzerBase
    extends Analyzer
    Base class for Analyzers that need to make use of stopword sets.
    Since:
    3.1
    • Field Detail

      • stopwords

        protected final CharArraySet stopwords
        An immutable stopword set
    • Constructor Detail

      • StopwordAnalyzerBase

        protected StopwordAnalyzerBase​(CharArraySet stopwords)
        Creates a new instance initialized with the given stopword set
        Parameters:
        stopwords - the analyzer's stopword set
      • StopwordAnalyzerBase

        protected StopwordAnalyzerBase()
        Creates a new Analyzer with an empty stopword set
    • Method Detail

      • getStopwordSet

        public CharArraySet getStopwordSet()
        Returns the analyzer's stopword set or an empty set if the analyzer has no stopwords
        Returns:
        the analyzer's stopword set or an empty set if the analyzer has no stopwords
      • loadStopwordSet

        protected static CharArraySet loadStopwordSet​(boolean ignoreCase,
                                                      Class<? extends Analyzer> aClass,
                                                      String resource,
                                                      String comment)
                                               throws IOException
        Creates a CharArraySet from a file resource associated with a class. (See Class.getResourceAsStream(String)).
        Parameters:
        ignoreCase - true if the set should ignore the case of the stopwords, otherwise false
        aClass - a class that is associated with the given stopwordResource
        resource - name of the resource file associated with the given class
        comment - comment string to ignore in the stopword file
        Returns:
        a CharArraySet containing the distinct stopwords from the given file
        Throws:
        IOException - if loading the stopwords throws an IOException
      • loadStopwordSet

        protected static CharArraySet loadStopwordSet​(Path stopwords)
                                               throws IOException
        Creates a CharArraySet from a path.
        Parameters:
        stopwords - the stopwords file to load
        Returns:
        a CharArraySet containing the distinct stopwords from the given file
        Throws:
        IOException - if loading the stopwords throws an IOException
      • loadStopwordSet

        protected static CharArraySet loadStopwordSet​(Reader stopwords)
                                               throws IOException
        Creates a CharArraySet from a file.
        Parameters:
        stopwords - the stopwords reader to load
        Returns:
        a CharArraySet containing the distinct stopwords from the given reader
        Throws:
        IOException - if loading the stopwords throws an IOException