Class SuggestStopFilterFactory
java.lang.Object
org.apache.lucene.analysis.AbstractAnalysisFactory
org.apache.lucene.analysis.TokenFilterFactory
org.apache.lucene.search.suggest.analyzing.SuggestStopFilterFactory
- All Implemented Interfaces:
ResourceLoaderAware
Factory for
SuggestStopFilter
.
<fieldType name="autosuggest" class="solr.TextField" positionIncrementGap="100" autoGeneratePhraseQueries="true"> <analyzer> <tokenizer class="solr.WhitespaceTokenizerFactory"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.SuggestStopFilterFactory" ignoreCase="true" words="stopwords.txt" format="wordset"/> </analyzer> </fieldType>
All attributes are optional:
ignoreCase
defaults tofalse
words
should be the name of a stopwords file to parse, if not specified the factory will useEnglishAnalyzer.ENGLISH_STOP_WORDS_SET
format
defines how thewords
file will be parsed, and defaults towordset
. Ifwords
is not specified, thenformat
must not be specified.
The valid values for the format
option are:
wordset
- This is the default format, which supports one word per line (including any intra-word whitespace) and allows whole line comments beginning with the "#" character. Blank lines are ignored. SeeWordlistLoader.getLines
for details.snowball
- This format allows for multiple words specified on each line, and trailing comments may be specified using the vertical line ("|"). Blank lines are ignored. SeeWordlistLoader.getSnowballWordSet
for details.
- Since:
- 5.0.0
- SPI Name (case-insensitive: if the name is 'htmlStrip', 'htmlstrip' can be used when looking up the service).
- "suggestStop"
-
Field Summary
Modifier and TypeFieldDescriptionstatic final String
multiple words may be specified on each line, trailing comments start with "|"static final String
the default format, one word per line, whole line comments start with "#"static final String
SPI nameFields inherited from class org.apache.lucene.analysis.AbstractAnalysisFactory
LUCENE_MATCH_VERSION_PARAM, luceneMatchVersion
-
Constructor Summary
ConstructorDescriptionDefault ctor for compatibility with SPISuggestStopFilterFactory
(Map<String, String> args) Creates a new StopFilterFactory -
Method Summary
Modifier and TypeMethodDescriptioncreate
(TokenStream input) Returns the configured stopword setvoid
inform
(ResourceLoader loader) boolean
Whether or not to ignore caseMethods inherited from class org.apache.lucene.analysis.TokenFilterFactory
availableTokenFilters, findSPIName, forName, lookupClass, normalize, reloadTokenFilters
Methods inherited from class org.apache.lucene.analysis.AbstractAnalysisFactory
defaultCtorException, get, get, get, get, get, getBoolean, getChar, getClassArg, getFloat, getInt, getLines, getLuceneMatchVersion, getOriginalArgs, getPattern, getSet, getSnowballWordSet, getWordSet, isExplicitLuceneMatchVersion, require, require, require, requireBoolean, requireChar, requireFloat, requireInt, setExplicitLuceneMatchVersion, splitAt, splitFileNames
-
Field Details
-
NAME
SPI name- See Also:
-
FORMAT_WORDSET
the default format, one word per line, whole line comments start with "#"- See Also:
-
FORMAT_SNOWBALL
multiple words may be specified on each line, trailing comments start with "|"- See Also:
-
-
Constructor Details
-
SuggestStopFilterFactory
Creates a new StopFilterFactory -
SuggestStopFilterFactory
public SuggestStopFilterFactory()Default ctor for compatibility with SPI
-
-
Method Details
-
inform
- Specified by:
inform
in interfaceResourceLoaderAware
- Throws:
IOException
-
isIgnoreCase
public boolean isIgnoreCase()Whether or not to ignore case -
getStopWords
Returns the configured stopword set -
create
- Specified by:
create
in classTokenFilterFactory
-