Package | Description |
---|---|
org.apache.lucene.analysis.ar |
Analyzer for Arabic.
|
org.apache.lucene.analysis.bg |
Analyzer for Bulgarian.
|
org.apache.lucene.analysis.br |
Analyzer for Brazilian Portuguese.
|
org.apache.lucene.analysis.ca |
Analyzer for Catalan.
|
org.apache.lucene.analysis.cjk |
Analyzer for Chinese, Japanese, and Korean, which indexes bigrams.
|
org.apache.lucene.analysis.commongrams |
Construct n-grams for frequently occurring terms and phrases.
|
org.apache.lucene.analysis.compound |
A filter that decomposes compound words you find in many Germanic
languages into the word parts.
|
org.apache.lucene.analysis.core |
Basic, general-purpose analysis components.
|
org.apache.lucene.analysis.cz |
Analyzer for Czech.
|
org.apache.lucene.analysis.da |
Analyzer for Danish.
|
org.apache.lucene.analysis.de |
Analyzer for German.
|
org.apache.lucene.analysis.el |
Analyzer for Greek.
|
org.apache.lucene.analysis.en |
Analyzer for English.
|
org.apache.lucene.analysis.es |
Analyzer for Spanish.
|
org.apache.lucene.analysis.eu |
Analyzer for Basque.
|
org.apache.lucene.analysis.fa |
Analyzer for Persian.
|
org.apache.lucene.analysis.fi |
Analyzer for Finnish.
|
org.apache.lucene.analysis.fr |
Analyzer for French.
|
org.apache.lucene.analysis.ga |
Analysis for Irish.
|
org.apache.lucene.analysis.gl |
Analyzer for Galician.
|
org.apache.lucene.analysis.hi |
Analyzer for Hindi.
|
org.apache.lucene.analysis.hu |
Analyzer for Hungarian.
|
org.apache.lucene.analysis.hy |
Analyzer for Armenian.
|
org.apache.lucene.analysis.id |
Analyzer for Indonesian.
|
org.apache.lucene.analysis.it |
Analyzer for Italian.
|
org.apache.lucene.analysis.lv |
Analyzer for Latvian.
|
org.apache.lucene.analysis.miscellaneous |
Miscellaneous TokenStreams
|
org.apache.lucene.analysis.nl |
Analyzer for Dutch.
|
org.apache.lucene.analysis.no |
Analyzer for Norwegian.
|
org.apache.lucene.analysis.pt |
Analyzer for Portuguese.
|
org.apache.lucene.analysis.ro |
Analyzer for Romanian.
|
org.apache.lucene.analysis.ru |
Analyzer for Russian.
|
org.apache.lucene.analysis.snowball |
TokenFilter and Analyzer implementations that use Snowball
stemmers. |
org.apache.lucene.analysis.standard |
Fast, general-purpose grammar-based tokenizers.
|
org.apache.lucene.analysis.sv |
Analyzer for Swedish.
|
org.apache.lucene.analysis.th |
Analyzer for Thai.
|
org.apache.lucene.analysis.tr |
Analyzer for Turkish.
|
org.apache.lucene.analysis.util |
Utility functions for text analysis.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
ArabicAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
ArabicAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words
|
ArabicAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop word.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
BulgarianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
BulgarianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
BulgarianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words and a stem exclusion set.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
BrazilianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
BrazilianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words
|
BrazilianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words and stemming exclusion words
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
CatalanAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
CatalanAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
CatalanAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
CJKAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
CJKAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words
|
Modifier and Type | Method and Description |
---|---|
CharArraySet |
CommonGramsFilterFactory.getCommonWords() |
Constructor and Description |
---|
CommonGramsFilter(Version matchVersion,
TokenStream input,
CharArraySet commonWords)
Construct a token stream filtering the given input using a Set of common
words to create bigrams.
|
Modifier and Type | Field and Description |
---|---|
protected CharArraySet |
CompoundWordTokenFilterBase.dictionary |
Constructor and Description |
---|
CompoundWordTokenFilterBase(Version matchVersion,
TokenStream input,
CharArraySet dictionary) |
CompoundWordTokenFilterBase(Version matchVersion,
TokenStream input,
CharArraySet dictionary,
boolean onlyLongestMatch) |
CompoundWordTokenFilterBase(Version matchVersion,
TokenStream input,
CharArraySet dictionary,
int minWordSize,
int minSubwordSize,
int maxSubwordSize,
boolean onlyLongestMatch) |
DictionaryCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
CharArraySet dictionary)
Creates a new
DictionaryCompoundWordTokenFilter |
DictionaryCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
CharArraySet dictionary,
int minWordSize,
int minSubwordSize,
int maxSubwordSize,
boolean onlyLongestMatch)
Creates a new
DictionaryCompoundWordTokenFilter |
HyphenationCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
HyphenationTree hyphenator,
CharArraySet dictionary)
Creates a new
HyphenationCompoundWordTokenFilter instance. |
HyphenationCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
HyphenationTree hyphenator,
CharArraySet dictionary,
int minWordSize,
int minSubwordSize,
int maxSubwordSize,
boolean onlyLongestMatch)
Creates a new
HyphenationCompoundWordTokenFilter instance. |
Modifier and Type | Field and Description |
---|---|
static CharArraySet |
StopAnalyzer.ENGLISH_STOP_WORDS_SET
An unmodifiable set containing some common English words that are not usually useful
for searching.
|
Modifier and Type | Method and Description |
---|---|
CharArraySet |
StopFilterFactory.getStopWords() |
static CharArraySet |
StopFilter.makeStopSet(Version matchVersion,
List<?> stopWords)
Builds a Set from an array of stop words,
appropriate for passing into the StopFilter constructor.
|
static CharArraySet |
StopFilter.makeStopSet(Version matchVersion,
List<?> stopWords,
boolean ignoreCase)
Creates a stopword set from the given stopword list.
|
static CharArraySet |
StopFilter.makeStopSet(Version matchVersion,
String... stopWords)
Builds a Set from an array of stop words,
appropriate for passing into the StopFilter constructor.
|
static CharArraySet |
StopFilter.makeStopSet(Version matchVersion,
String[] stopWords,
boolean ignoreCase)
Creates a stopword set from the given stopword array.
|
Constructor and Description |
---|
StopAnalyzer(Version matchVersion,
CharArraySet stopWords)
Builds an analyzer with the stop words from the given set.
|
StopFilter(Version matchVersion,
TokenStream in,
CharArraySet stopWords)
Constructs a filter which removes words from the input TokenStream that are
named in the Set.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
CzechAnalyzer.getDefaultStopSet()
Returns a set of default Czech-stopwords
|
Constructor and Description |
---|
CzechAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
CzechAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionTable)
Builds an analyzer with the given stop words and a set of work to be
excluded from the
CzechStemFilter . |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
DanishAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
DanishAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
DanishAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
GermanAnalyzer.getDefaultStopSet()
Returns a set of default German-stopwords
|
Constructor and Description |
---|
GermanAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words
|
GermanAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
GreekAnalyzer.getDefaultStopSet()
Returns a set of default Greek-stopwords
|
Constructor and Description |
---|
GreekAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
EnglishAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
EnglishAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
EnglishAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
SpanishAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
SpanishAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
SpanishAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
BasqueAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
BasqueAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
BasqueAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
PersianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
PersianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
FinnishAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
FinnishAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
FinnishAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
static CharArraySet |
FrenchAnalyzer.DEFAULT_ARTICLES
Default set of articles for ElisionFilter
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
FrenchAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
FrenchAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words
|
FrenchAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclutionSet)
Builds an analyzer with the given stop words
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
IrishAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
IrishAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
IrishAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
GalicianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
GalicianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
GalicianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
HindiAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
HindiAnalyzer(Version version,
CharArraySet stopwords)
Builds an analyzer with the given stop words
|
HindiAnalyzer(Version version,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
HungarianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
HungarianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
HungarianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
ArmenianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
ArmenianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
ArmenianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
IndonesianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
IndonesianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words
|
IndonesianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop word.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
ItalianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
ItalianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
ItalianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
LatvianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
LatvianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
LatvianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
CharArraySet |
KeepWordFilterFactory.getWords() |
Constructor and Description |
---|
CapitalizationFilter(TokenStream in,
boolean onlyFirstWord,
CharArraySet keep,
boolean forceFirstLetter,
Collection<char[]> okPrefix,
int minWordLength,
int maxWordCount,
int maxTokenLength)
Creates a CapitalizationFilter with the specified parameters.
|
KeepWordFilter(Version version,
boolean enablePositionIncrements,
TokenStream in,
CharArraySet words)
Deprecated.
enablePositionIncrements=false is not supported anymore as of Lucene 4.4.
|
KeepWordFilter(Version version,
TokenStream in,
CharArraySet words)
Create a new
KeepWordFilter . |
PatternAnalyzer(Version matchVersion,
Pattern pattern,
boolean toLowerCase,
CharArraySet stopWords)
Deprecated.
Constructs a new instance with the given parameters.
|
SetKeywordMarkerFilter(TokenStream in,
CharArraySet keywordSet)
Create a new KeywordSetMarkerFilter, that marks the current token as a
keyword if the tokens term buffer is contained in the given set via the
KeywordAttribute . |
WordDelimiterFilter(TokenStream in,
byte[] charTypeTable,
int configurationFlags,
CharArraySet protWords)
Creates a new WordDelimiterFilter
|
WordDelimiterFilter(TokenStream in,
int configurationFlags,
CharArraySet protWords)
Creates a new WordDelimiterFilter using
WordDelimiterIterator.DEFAULT_WORD_DELIM_TABLE
as its charTypeTable |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
DutchAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
DutchAnalyzer(Version matchVersion,
CharArraySet stopwords) |
DutchAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionTable) |
DutchAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionTable,
CharArrayMap<String> stemOverrideDict) |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
NorwegianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
NorwegianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
NorwegianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
protected CharArraySet |
RSLPStemmerBase.RuleWithSetExceptions.exceptions |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
PortugueseAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
PortugueseAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
PortugueseAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
RomanianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
RomanianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
RomanianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
RussianAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Constructor and Description |
---|
RussianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words
|
RussianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words
|
Constructor and Description |
---|
SnowballAnalyzer(Version matchVersion,
String name,
CharArraySet stopWords)
Deprecated.
Builds the named analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
static CharArraySet |
StandardAnalyzer.STOP_WORDS_SET
An unmodifiable set containing some common English words that are usually not
useful for searching.
|
static CharArraySet |
ClassicAnalyzer.STOP_WORDS_SET
An unmodifiable set containing some common English words that are usually not
useful for searching.
|
static CharArraySet |
UAX29URLEmailAnalyzer.STOP_WORDS_SET
An unmodifiable set containing some common English words that are usually not
useful for searching.
|
Constructor and Description |
---|
ClassicAnalyzer(Version matchVersion,
CharArraySet stopWords)
Builds an analyzer with the given stop words.
|
StandardAnalyzer(Version matchVersion,
CharArraySet stopWords)
Builds an analyzer with the given stop words.
|
UAX29URLEmailAnalyzer(Version matchVersion,
CharArraySet stopWords)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
SwedishAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
SwedishAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
SwedishAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
ThaiAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
ThaiAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
TurkishAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
TurkishAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
TurkishAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
static CharArraySet |
CharArraySet.EMPTY_SET |
protected CharArraySet |
StopwordAnalyzerBase.stopwords
An immutable stopword set
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
CharArraySet.copy(Version matchVersion,
Set<?> set)
Returns a copy of the given set as a
CharArraySet . |
static CharArraySet |
WordlistLoader.getSnowballWordSet(Reader reader,
CharArraySet result)
Reads stopwords from a stopword list in Snowball format.
|
static CharArraySet |
WordlistLoader.getSnowballWordSet(Reader reader,
Version matchVersion)
Reads stopwords from a stopword list in Snowball format.
|
protected CharArraySet |
AbstractAnalysisFactory.getSnowballWordSet(ResourceLoader loader,
String wordFiles,
boolean ignoreCase)
same as
AbstractAnalysisFactory.getWordSet(ResourceLoader, String, boolean) ,
except the input is in snowball format. |
CharArraySet |
StopwordAnalyzerBase.getStopwordSet()
Returns the analyzer's stopword set or an empty set if the analyzer has no
stopwords
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
CharArraySet result)
Reads lines from a Reader and adds every line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
String comment,
CharArraySet result)
Reads lines from a Reader and adds every non-comment line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
String comment,
Version matchVersion)
Reads lines from a Reader and adds every non-comment line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
Version matchVersion)
Reads lines from a Reader and adds every line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
protected CharArraySet |
AbstractAnalysisFactory.getWordSet(ResourceLoader loader,
String wordFiles,
boolean ignoreCase)
Returns as
CharArraySet from wordFiles, which
can be a comma-separated list of filenames |
CharArraySet |
CharArrayMap.keySet()
Returns an
CharArraySet view on the map's keys. |
protected static CharArraySet |
StopwordAnalyzerBase.loadStopwordSet(boolean ignoreCase,
Class<? extends Analyzer> aClass,
String resource,
String comment)
Creates a CharArraySet from a file resource associated with a class.
|
protected static CharArraySet |
StopwordAnalyzerBase.loadStopwordSet(File stopwords,
Version matchVersion)
Creates a CharArraySet from a file.
|
protected static CharArraySet |
StopwordAnalyzerBase.loadStopwordSet(Reader stopwords,
Version matchVersion)
Creates a CharArraySet from a file.
|
static CharArraySet |
CharArraySet.unmodifiableSet(CharArraySet set)
Returns an unmodifiable
CharArraySet . |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
WordlistLoader.getSnowballWordSet(Reader reader,
CharArraySet result)
Reads stopwords from a stopword list in Snowball format.
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
CharArraySet result)
Reads lines from a Reader and adds every line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
String comment,
CharArraySet result)
Reads lines from a Reader and adds every non-comment line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
CharArraySet.unmodifiableSet(CharArraySet set)
Returns an unmodifiable
CharArraySet . |
Constructor and Description |
---|
ElisionFilter(TokenStream input,
CharArraySet articles)
Constructs an elision filter with a Set of stop words
|
StopwordAnalyzerBase(Version version,
CharArraySet stopwords)
Creates a new instance initialized with the given stopword set
|
Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.