public abstract class StopwordAnalyzerBase extends ReusableAnalyzerBase
ReusableAnalyzerBase.TokenStreamComponents| Modifier and Type | Field and Description |
|---|---|
protected Version |
matchVersion |
protected CharArraySet |
stopwords
An immutable stopword set
|
| Modifier | Constructor and Description |
|---|---|
protected |
StopwordAnalyzerBase(Version version)
Creates a new Analyzer with an empty stopword set
|
protected |
StopwordAnalyzerBase(Version version,
Set<?> stopwords)
Creates a new instance initialized with the given stopword set
|
| Modifier and Type | Method and Description |
|---|---|
Set<?> |
getStopwordSet()
Returns the analyzer's stopword set or an empty set if the analyzer has no
stopwords
|
protected static CharArraySet |
loadStopwordSet(boolean ignoreCase,
Class<? extends ReusableAnalyzerBase> aClass,
String resource,
String comment)
Creates a CharArraySet from a file resource associated with a class.
|
protected static CharArraySet |
loadStopwordSet(File stopwords,
Version matchVersion)
Creates a CharArraySet from a file.
|
protected static CharArraySet |
loadStopwordSet(Reader stopwords,
Version matchVersion)
Creates a CharArraySet from a file.
|
createComponents, initReader, reusableTokenStream, tokenStreamclose, getOffsetGap, getPositionIncrementGap, getPreviousTokenStream, setPreviousTokenStreamprotected final CharArraySet stopwords
protected final Version matchVersion
protected StopwordAnalyzerBase(Version version, Set<?> stopwords)
version - the Lucene version for cross version compatibilitystopwords - the analyzer's stopword setprotected StopwordAnalyzerBase(Version version)
version - the Lucene version for cross version compatibilitypublic Set<?> getStopwordSet()
protected static CharArraySet loadStopwordSet(boolean ignoreCase, Class<? extends ReusableAnalyzerBase> aClass, String resource, String comment) throws IOException
Class.getResourceAsStream(String)).ignoreCase - true if the set should ignore the case of the
stopwords, otherwise falseaClass - a class that is associated with the given stopwordResourceresource - name of the resource file associated with the given classcomment - comment string to ignore in the stopword fileIOException - if loading the stopwords throws an IOExceptionprotected static CharArraySet loadStopwordSet(File stopwords, Version matchVersion) throws IOException
stopwords - the stopwords file to loadmatchVersion - the Lucene version for cross version compatibilityIOException - if loading the stopwords throws an IOExceptionprotected static CharArraySet loadStopwordSet(Reader stopwords, Version matchVersion) throws IOException
stopwords - the stopwords reader to loadmatchVersion - the Lucene version for cross version compatibilityIOException - if loading the stopwords throws an IOException