Package | Description |
---|---|
org.apache.lucene.analysis |
API and code to convert text into indexable/searchable tokens.
|
org.apache.lucene.analysis.ar |
Analyzer for Arabic.
|
org.apache.lucene.analysis.bg |
Analyzer for Bulgarian.
|
org.apache.lucene.analysis.br |
Analyzer for Brazilian Portuguese.
|
org.apache.lucene.analysis.ca |
Analyzer for Catalan.
|
org.apache.lucene.analysis.cjk |
Analyzer for Chinese, Japanese, and Korean, which indexes bigrams (overlapping groups of two adjacent Han characters).
|
org.apache.lucene.analysis.cn.smart |
Analyzer for Simplified Chinese, which indexes words.
|
org.apache.lucene.analysis.compound |
A filter that decomposes compound words you find in many Germanic
languages into the word parts.
|
org.apache.lucene.analysis.cz |
Analyzer for Czech.
|
org.apache.lucene.analysis.da |
Analyzer for Danish.
|
org.apache.lucene.analysis.de |
Analyzer for German.
|
org.apache.lucene.analysis.el |
Analyzer for Greek.
|
org.apache.lucene.analysis.en |
Analyzer for English.
|
org.apache.lucene.analysis.es |
Analyzer for Spanish.
|
org.apache.lucene.analysis.eu |
Analyzer for Basque.
|
org.apache.lucene.analysis.fa |
Analyzer for Persian.
|
org.apache.lucene.analysis.fi |
Analyzer for Finnish.
|
org.apache.lucene.analysis.fr |
Analyzer for French.
|
org.apache.lucene.analysis.ga |
Analysis for Irish.
|
org.apache.lucene.analysis.gl |
Analyzer for Galician.
|
org.apache.lucene.analysis.hi |
Analyzer for Hindi.
|
org.apache.lucene.analysis.hu |
Analyzer for Hungarian.
|
org.apache.lucene.analysis.hunspell |
Stemming TokenFilter using a Java implementation of the
Hunspell stemming algorithm.
|
org.apache.lucene.analysis.hy |
Analyzer for Armenian.
|
org.apache.lucene.analysis.id |
Analyzer for Indonesian.
|
org.apache.lucene.analysis.in |
Analysis components for Indian languages.
|
org.apache.lucene.analysis.it |
Analyzer for Italian.
|
org.apache.lucene.analysis.ja |
Analyzer for Japanese.
|
org.apache.lucene.analysis.lv |
Analyzer for Latvian.
|
org.apache.lucene.analysis.miscellaneous |
Miscellaneous TokenStreams
|
org.apache.lucene.analysis.nl |
Analyzer for Dutch.
|
org.apache.lucene.analysis.no |
Analyzer for Norwegian.
|
org.apache.lucene.analysis.pl |
Analyzer for Polish.
|
org.apache.lucene.analysis.pt |
Analyzer for Portuguese.
|
org.apache.lucene.analysis.query |
Automatically filter high-frequency stopwords.
|
org.apache.lucene.analysis.reverse |
Filter to reverse token text.
|
org.apache.lucene.analysis.ro |
Analyzer for Romanian.
|
org.apache.lucene.analysis.ru |
Analyzer for Russian.
|
org.apache.lucene.analysis.shingle |
Word n-gram filters
|
org.apache.lucene.analysis.snowball |
TokenFilter and Analyzer implementations that use Snowball
stemmers. |
org.apache.lucene.analysis.standard |
Standards-based analyzers implemented with JFlex.
|
org.apache.lucene.analysis.sv |
Analyzer for Swedish.
|
org.apache.lucene.analysis.th |
Analyzer for Thai.
|
org.apache.lucene.analysis.tr |
Analyzer for Turkish.
|
org.apache.lucene.index |
Code to maintain and access indices.
|
org.apache.lucene.queryParser |
A simple query parser implemented with JavaCC.
|
org.apache.lucene.queryParser.analyzing |
QueryParser that passes Fuzzy-, Prefix-, Range-, and WildcardQuerys through the given analyzer.
|
org.apache.lucene.queryParser.complexPhrase |
QueryParser which permits complex phrase query syntax eg "(john jon jonathan~) peters*"
|
org.apache.lucene.queryParser.ext |
Extendable QueryParser provides a simple and flexible extension mechanism by overloading query field names.
|
org.apache.lucene.util |
Some utility classes.
|
Modifier and Type | Field and Description |
---|---|
protected Version |
StopwordAnalyzerBase.matchVersion |
Modifier and Type | Method and Description |
---|---|
static <V> CharArrayMap<V> |
CharArrayMap.copy(Version matchVersion,
Map<?,? extends V> map)
Returns a copy of the given map as a
CharArrayMap . |
static CharArraySet |
CharArraySet.copy(Version matchVersion,
Set<?> set)
Returns a copy of the given set as a
CharArraySet . |
static boolean |
StopFilter.getEnablePositionIncrementsVersionDefault(Version matchVersion)
Deprecated.
use
StopFilter.StopFilter(Version, TokenStream, Set) instead |
static CharArraySet |
WordlistLoader.getSnowballWordSet(Reader reader,
Version matchVersion)
Reads stopwords from a stopword list in Snowball format.
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
String comment,
Version matchVersion)
Reads lines from a Reader and adds every non-comment line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
Version matchVersion)
Reads lines from a Reader and adds every line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
protected static CharArraySet |
StopwordAnalyzerBase.loadStopwordSet(File stopwords,
Version matchVersion)
Creates a CharArraySet from a file.
|
protected static CharArraySet |
StopwordAnalyzerBase.loadStopwordSet(Reader stopwords,
Version matchVersion)
Creates a CharArraySet from a file.
|
static Set<Object> |
StopFilter.makeStopSet(Version matchVersion,
List<?> stopWords)
Builds a Set from an array of stop words,
appropriate for passing into the StopFilter constructor.
|
static Set<Object> |
StopFilter.makeStopSet(Version matchVersion,
List<?> stopWords,
boolean ignoreCase)
Creates a stopword set from the given stopword list.
|
static Set<Object> |
StopFilter.makeStopSet(Version matchVersion,
String... stopWords)
Builds a Set from an array of stop words,
appropriate for passing into the StopFilter constructor.
|
static Set<Object> |
StopFilter.makeStopSet(Version matchVersion,
String[] stopWords,
boolean ignoreCase)
Creates a stopword set from the given stopword array.
|
Constructor and Description |
---|
CharArrayMap(Version matchVersion,
int startSize,
boolean ignoreCase)
Create map with enough capacity to hold startSize terms
|
CharArrayMap(Version matchVersion,
Map<?,? extends V> c,
boolean ignoreCase)
Creates a map from the mappings in another map.
|
CharArraySet(Version matchVersion,
Collection<?> c,
boolean ignoreCase)
Creates a set from a Collection of objects.
|
CharArraySet(Version matchVersion,
int startSize,
boolean ignoreCase)
Create set with enough capacity to hold startSize terms
|
CharTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader input)
Creates a new
CharTokenizer instance |
CharTokenizer(Version matchVersion,
AttributeSource source,
Reader input)
Creates a new
CharTokenizer instance |
CharTokenizer(Version matchVersion,
Reader input)
Creates a new
CharTokenizer instance |
LetterTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader in)
Construct a new LetterTokenizer using a given
AttributeSource.AttributeFactory . |
LetterTokenizer(Version matchVersion,
AttributeSource source,
Reader in)
Construct a new LetterTokenizer using a given
AttributeSource . |
LetterTokenizer(Version matchVersion,
Reader in)
Construct a new LetterTokenizer.
|
LowerCaseFilter(Version matchVersion,
TokenStream in)
Create a new LowerCaseFilter, that normalizes token text to lower case.
|
LowerCaseTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader in)
Construct a new LowerCaseTokenizer using a given
AttributeSource.AttributeFactory . |
LowerCaseTokenizer(Version matchVersion,
AttributeSource source,
Reader in)
Construct a new LowerCaseTokenizer using a given
AttributeSource . |
LowerCaseTokenizer(Version matchVersion,
Reader in)
Construct a new LowerCaseTokenizer.
|
SimpleAnalyzer(Version matchVersion)
Creates a new
SimpleAnalyzer |
StopAnalyzer(Version matchVersion)
Builds an analyzer which removes words in
StopAnalyzer.ENGLISH_STOP_WORDS_SET . |
StopAnalyzer(Version matchVersion,
File stopwordsFile)
Builds an analyzer with the stop words from the given file.
|
StopAnalyzer(Version matchVersion,
Reader stopwords)
Builds an analyzer with the stop words from the given reader.
|
StopAnalyzer(Version matchVersion,
Set<?> stopWords)
Builds an analyzer with the stop words from the given set.
|
StopFilter(Version matchVersion,
TokenStream in,
Set<?> stopWords)
Constructs a filter which removes words from the input TokenStream that are
named in the Set.
|
StopFilter(Version matchVersion,
TokenStream input,
Set<?> stopWords,
boolean ignoreCase)
Deprecated.
Use
StopFilter.StopFilter(Version, TokenStream, Set) instead |
StopwordAnalyzerBase(Version version)
Creates a new Analyzer with an empty stopword set
|
StopwordAnalyzerBase(Version version,
Set<?> stopwords)
Creates a new instance initialized with the given stopword set
|
WhitespaceAnalyzer(Version matchVersion)
Creates a new
WhitespaceAnalyzer |
WhitespaceTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader in)
Construct a new WhitespaceTokenizer using a given
AttributeSource.AttributeFactory . |
WhitespaceTokenizer(Version matchVersion,
AttributeSource source,
Reader in)
Construct a new WhitespaceTokenizer using a given
AttributeSource . |
WhitespaceTokenizer(Version matchVersion,
Reader in)
Construct a new WhitespaceTokenizer. * @param matchVersion Lucene version
to match See above
|
Constructor and Description |
---|
ArabicAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
ArabicAnalyzer.DEFAULT_STOPWORD_FILE . |
ArabicAnalyzer(Version matchVersion,
File stopwords)
Deprecated.
use
ArabicAnalyzer.ArabicAnalyzer(Version, Set) instead |
ArabicAnalyzer(Version matchVersion,
Hashtable<?,?> stopwords)
Deprecated.
use
ArabicAnalyzer.ArabicAnalyzer(Version, Set) instead |
ArabicAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words
|
ArabicAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop word.
|
ArabicAnalyzer(Version matchVersion,
String... stopwords)
Deprecated.
use
ArabicAnalyzer.ArabicAnalyzer(Version, Set) instead |
ArabicLetterTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader in)
Deprecated.
Construct a new ArabicLetterTokenizer using a given
AttributeSource.AttributeFactory . * @param
matchVersion Lucene version to match See
above |
ArabicLetterTokenizer(Version matchVersion,
AttributeSource source,
Reader in)
Deprecated.
Construct a new ArabicLetterTokenizer using a given
AttributeSource . |
ArabicLetterTokenizer(Version matchVersion,
Reader in)
Deprecated.
Construct a new ArabicLetterTokenizer.
|
Constructor and Description |
---|
BulgarianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
BulgarianAnalyzer.DEFAULT_STOPWORD_FILE . |
BulgarianAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
BulgarianAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words and a stem exclusion set.
|
Constructor and Description |
---|
BrazilianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words (
BrazilianAnalyzer.getDefaultStopSet() ). |
BrazilianAnalyzer(Version matchVersion,
File stopwords)
Deprecated.
|
BrazilianAnalyzer(Version matchVersion,
Map<?,?> stopwords)
Deprecated.
|
BrazilianAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words
|
BrazilianAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words and stemming exclusion words
|
BrazilianAnalyzer(Version matchVersion,
String... stopwords)
Deprecated.
|
Constructor and Description |
---|
CatalanAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
CatalanAnalyzer.DEFAULT_STOPWORD_FILE . |
CatalanAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
CatalanAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
CJKAnalyzer(Version matchVersion)
Builds an analyzer which removes words in
CJKAnalyzer.getDefaultStopSet() . |
CJKAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words
|
CJKAnalyzer(Version matchVersion,
String... stopWords)
Deprecated.
use
CJKAnalyzer.CJKAnalyzer(Version, Set) instead |
Constructor and Description |
---|
SmartChineseAnalyzer(Version matchVersion)
Create a new SmartChineseAnalyzer, using the default stopword list.
|
SmartChineseAnalyzer(Version matchVersion,
boolean useDefaultStopWords)
Create a new SmartChineseAnalyzer, optionally using the default stopword list.
|
SmartChineseAnalyzer(Version matchVersion,
Set stopWords)
Create a new SmartChineseAnalyzer, using the provided
Set of stopwords. |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
CompoundWordTokenFilterBase.makeDictionary(Version matchVersion,
String[] dictionary)
Deprecated.
Only available for backwards compatibility.
|
Constructor and Description |
---|
CompoundWordTokenFilterBase(Version matchVersion,
TokenStream input,
Set<?> dictionary) |
CompoundWordTokenFilterBase(Version matchVersion,
TokenStream input,
Set<?> dictionary,
boolean onlyLongestMatch) |
CompoundWordTokenFilterBase(Version matchVersion,
TokenStream input,
Set<?> dictionary,
int minWordSize,
int minSubwordSize,
int maxSubwordSize,
boolean onlyLongestMatch) |
CompoundWordTokenFilterBase(Version matchVersion,
TokenStream input,
String[] dictionary) |
CompoundWordTokenFilterBase(Version matchVersion,
TokenStream input,
String[] dictionary,
boolean onlyLongestMatch) |
CompoundWordTokenFilterBase(Version matchVersion,
TokenStream input,
String[] dictionary,
int minWordSize,
int minSubwordSize,
int maxSubwordSize,
boolean onlyLongestMatch) |
DictionaryCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
Set<?> dictionary)
Creates a new
DictionaryCompoundWordTokenFilter |
DictionaryCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
Set<?> dictionary,
int minWordSize,
int minSubwordSize,
int maxSubwordSize,
boolean onlyLongestMatch)
Creates a new
DictionaryCompoundWordTokenFilter |
DictionaryCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
String[] dictionary)
Deprecated.
Use the constructors taking
Set |
DictionaryCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
String[] dictionary,
int minWordSize,
int minSubwordSize,
int maxSubwordSize,
boolean onlyLongestMatch)
Deprecated.
Use the constructors taking
Set |
HyphenationCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
HyphenationTree hyphenator)
Create a HyphenationCompoundWordTokenFilter with no dictionary.
|
HyphenationCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
HyphenationTree hyphenator,
int minWordSize,
int minSubwordSize,
int maxSubwordSize)
Create a HyphenationCompoundWordTokenFilter with no dictionary.
|
HyphenationCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
HyphenationTree hyphenator,
Set<?> dictionary)
Creates a new
HyphenationCompoundWordTokenFilter instance. |
HyphenationCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
HyphenationTree hyphenator,
Set<?> dictionary,
int minWordSize,
int minSubwordSize,
int maxSubwordSize,
boolean onlyLongestMatch)
Creates a new
HyphenationCompoundWordTokenFilter instance. |
HyphenationCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
HyphenationTree hyphenator,
String[] dictionary)
Deprecated.
Use the constructors taking
Set |
HyphenationCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
HyphenationTree hyphenator,
String[] dictionary,
int minWordSize,
int minSubwordSize,
int maxSubwordSize,
boolean onlyLongestMatch)
Deprecated.
Use the constructors taking
Set |
Constructor and Description |
---|
CzechAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words (
CzechAnalyzer.getDefaultStopSet() ). |
CzechAnalyzer(Version matchVersion,
File stopwords)
Deprecated.
use
CzechAnalyzer.CzechAnalyzer(Version, Set) instead |
CzechAnalyzer(Version matchVersion,
HashSet<?> stopwords)
Deprecated.
use
CzechAnalyzer.CzechAnalyzer(Version, Set) instead |
CzechAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
CzechAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionTable)
Builds an analyzer with the given stop words and a set of work to be
excluded from the
CzechStemFilter . |
CzechAnalyzer(Version matchVersion,
String... stopwords)
Deprecated.
use
CzechAnalyzer.CzechAnalyzer(Version, Set) instead |
Constructor and Description |
---|
DanishAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
DanishAnalyzer.DEFAULT_STOPWORD_FILE . |
DanishAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
DanishAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
GermanAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
GermanAnalyzer.getDefaultStopSet() . |
GermanAnalyzer(Version matchVersion,
File stopwords)
Deprecated.
|
GermanAnalyzer(Version matchVersion,
Map<?,?> stopwords)
Deprecated.
|
GermanAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words
|
GermanAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words
|
GermanAnalyzer(Version matchVersion,
String... stopwords)
Deprecated.
|
Constructor and Description |
---|
GreekAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words.
|
GreekAnalyzer(Version matchVersion,
Map<?,?> stopwords)
Deprecated.
use
GreekAnalyzer.GreekAnalyzer(Version, Set) instead |
GreekAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
GreekAnalyzer(Version matchVersion,
String... stopwords)
Deprecated.
use
GreekAnalyzer.GreekAnalyzer(Version, Set) instead |
GreekLowerCaseFilter(Version matchVersion,
TokenStream in)
Create a GreekLowerCaseFilter that normalizes Greek token text.
|
Constructor and Description |
---|
EnglishAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
EnglishAnalyzer.getDefaultStopSet() . |
EnglishAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
EnglishAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words.
|
EnglishPossessiveFilter(Version version,
TokenStream input) |
Constructor and Description |
---|
SpanishAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
SpanishAnalyzer.DEFAULT_STOPWORD_FILE . |
SpanishAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
SpanishAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
BasqueAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
BasqueAnalyzer.DEFAULT_STOPWORD_FILE . |
BasqueAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
BasqueAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
PersianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
PersianAnalyzer.DEFAULT_STOPWORD_FILE . |
PersianAnalyzer(Version matchVersion,
File stopwords)
Deprecated.
use
PersianAnalyzer.PersianAnalyzer(Version, Set) instead |
PersianAnalyzer(Version matchVersion,
Hashtable<?,?> stopwords)
Deprecated.
use
PersianAnalyzer.PersianAnalyzer(Version, Set) instead |
PersianAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words
|
PersianAnalyzer(Version matchVersion,
String... stopwords)
Deprecated.
use
PersianAnalyzer.PersianAnalyzer(Version, Set) instead |
Constructor and Description |
---|
FinnishAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
FinnishAnalyzer.DEFAULT_STOPWORD_FILE . |
FinnishAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
FinnishAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
void |
ElisionFilter.setArticles(Version matchVersion,
Set<?> articles)
Deprecated.
|
Constructor and Description |
---|
ElisionFilter(Version matchVersion,
TokenStream input)
Constructs an elision filter with standard stop words
|
ElisionFilter(Version matchVersion,
TokenStream input,
Set<?> articles)
Constructs an elision filter with a Set of stop words
|
FrenchAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words (
FrenchAnalyzer.getDefaultStopSet() ). |
FrenchAnalyzer(Version matchVersion,
File stopwords)
Deprecated.
use
FrenchAnalyzer.FrenchAnalyzer(Version, Set) instead |
FrenchAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words
|
FrenchAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclutionSet)
Builds an analyzer with the given stop words
|
FrenchAnalyzer(Version matchVersion,
String... stopwords)
Deprecated.
use
FrenchAnalyzer.FrenchAnalyzer(Version, Set) instead |
Constructor and Description |
---|
IrishAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
IrishAnalyzer.DEFAULT_STOPWORD_FILE . |
IrishAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
IrishAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
GalicianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
GalicianAnalyzer.DEFAULT_STOPWORD_FILE . |
GalicianAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
GalicianAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
HindiAnalyzer(Version version)
Builds an analyzer with the default stop words:
HindiAnalyzer.DEFAULT_STOPWORD_FILE . |
HindiAnalyzer(Version version,
Set<?> stopwords)
Builds an analyzer with the given stop words
|
HindiAnalyzer(Version version,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words
|
Constructor and Description |
---|
HungarianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
HungarianAnalyzer.DEFAULT_STOPWORD_FILE . |
HungarianAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
HungarianAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
Version |
HunspellDictionary.getVersion() |
Constructor and Description |
---|
HunspellDictionary(InputStream affix,
InputStream dictionary,
Version version)
Creates a new HunspellDictionary containing the information read from the provided InputStreams to hunspell affix
and dictionary files.
|
HunspellDictionary(InputStream affix,
InputStream dictionary,
Version version,
boolean ignoreCase)
Creates a new HunspellDictionary containing the information read from the provided InputStreams to hunspell affix
and dictionary files.
|
HunspellDictionary(InputStream affix,
List<InputStream> dictionaries,
Version version,
boolean ignoreCase)
Creates a new HunspellDictionary containing the information read from the provided InputStreams to hunspell affix
and dictionary files.
|
Constructor and Description |
---|
ArmenianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
ArmenianAnalyzer.DEFAULT_STOPWORD_FILE . |
ArmenianAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
ArmenianAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
IndonesianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
IndonesianAnalyzer.DEFAULT_STOPWORD_FILE . |
IndonesianAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words
|
IndonesianAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop word.
|
Constructor and Description |
---|
IndicTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader input)
Deprecated.
|
IndicTokenizer(Version matchVersion,
AttributeSource source,
Reader input)
Deprecated.
|
IndicTokenizer(Version matchVersion,
Reader input)
Deprecated.
|
Constructor and Description |
---|
ItalianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
ItalianAnalyzer.DEFAULT_STOPWORD_FILE . |
ItalianAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
ItalianAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
JapaneseAnalyzer(Version matchVersion) |
JapaneseAnalyzer(Version matchVersion,
UserDictionary userDict,
JapaneseTokenizer.Mode mode,
CharArraySet stopwords,
Set<String> stoptags) |
Constructor and Description |
---|
LatvianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
LatvianAnalyzer.DEFAULT_STOPWORD_FILE . |
LatvianAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
LatvianAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
PatternAnalyzer(Version matchVersion,
Pattern pattern,
boolean toLowerCase,
Set<?> stopWords)
Constructs a new instance with the given parameters.
|
StemmerOverrideFilter(Version matchVersion,
TokenStream input,
Map<?,String> dictionary)
Create a new StemmerOverrideFilter, performing dictionary-based stemming
with the provided
dictionary . |
Constructor and Description |
---|
DutchAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words (
DutchAnalyzer.getDefaultStopSet() )
and a few default entries for the stem exclusion table. |
DutchAnalyzer(Version matchVersion,
File stopwords)
Deprecated.
use
DutchAnalyzer.DutchAnalyzer(Version, Set) instead |
DutchAnalyzer(Version matchVersion,
HashSet<?> stopwords)
Deprecated.
use
DutchAnalyzer.DutchAnalyzer(Version, Set) instead |
DutchAnalyzer(Version matchVersion,
Set<?> stopwords) |
DutchAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionTable) |
DutchAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionTable,
CharArrayMap<String> stemOverrideDict) |
DutchAnalyzer(Version matchVersion,
String... stopwords)
Deprecated.
use
DutchAnalyzer.DutchAnalyzer(Version, Set) instead |
Constructor and Description |
---|
NorwegianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
NorwegianAnalyzer.DEFAULT_STOPWORD_FILE . |
NorwegianAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
NorwegianAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
PolishAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
PolishAnalyzer.DEFAULT_STOPWORD_FILE . |
PolishAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
PolishAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
PortugueseAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
PortugueseAnalyzer.DEFAULT_STOPWORD_FILE . |
PortugueseAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
PortugueseAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
QueryAutoStopWordAnalyzer(Version matchVersion,
Analyzer delegate)
Deprecated.
Stopwords should be calculated at instantiation using one of the other constructors
|
QueryAutoStopWordAnalyzer(Version matchVersion,
Analyzer delegate,
IndexReader indexReader)
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for all
indexed fields from terms with a document frequency percentage greater than
QueryAutoStopWordAnalyzer.defaultMaxDocFreqPercent |
QueryAutoStopWordAnalyzer(Version matchVersion,
Analyzer delegate,
IndexReader indexReader,
Collection<String> fields,
float maxPercentDocs)
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for the
given selection of fields from terms with a document frequency percentage
greater than the given maxPercentDocs
|
QueryAutoStopWordAnalyzer(Version matchVersion,
Analyzer delegate,
IndexReader indexReader,
Collection<String> fields,
int maxDocFreq)
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for the
given selection of fields from terms with a document frequency greater than
the given maxDocFreq
|
QueryAutoStopWordAnalyzer(Version matchVersion,
Analyzer delegate,
IndexReader indexReader,
float maxPercentDocs)
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for all
indexed fields from terms with a document frequency percentage greater than
the given maxPercentDocs
|
QueryAutoStopWordAnalyzer(Version matchVersion,
Analyzer delegate,
IndexReader indexReader,
int maxDocFreq)
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for all
indexed fields from terms with a document frequency greater than the given
maxDocFreq
|
Modifier and Type | Method and Description |
---|---|
static void |
ReverseStringFilter.reverse(Version matchVersion,
char[] buffer)
Reverses the given input buffer in-place
|
static void |
ReverseStringFilter.reverse(Version matchVersion,
char[] buffer,
int len)
Partially reverses the given input buffer in-place from offset 0
up to the given length.
|
static void |
ReverseStringFilter.reverse(Version matchVersion,
char[] buffer,
int start,
int len)
Partially reverses the given input buffer in-place from the given offset
up to the given length.
|
static String |
ReverseStringFilter.reverse(Version matchVersion,
String input)
Reverses the given input string
|
Constructor and Description |
---|
ReverseStringFilter(Version matchVersion,
TokenStream in)
Create a new ReverseStringFilter that reverses all tokens in the
supplied
TokenStream . |
ReverseStringFilter(Version matchVersion,
TokenStream in,
char marker)
Create a new ReverseStringFilter that reverses and marks all tokens in the
supplied
TokenStream . |
Constructor and Description |
---|
RomanianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
RomanianAnalyzer.DEFAULT_STOPWORD_FILE . |
RomanianAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
RomanianAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
RussianAnalyzer(Version matchVersion) |
RussianAnalyzer(Version matchVersion,
Map<?,?> stopwords)
Deprecated.
use
RussianAnalyzer.RussianAnalyzer(Version, Set) instead |
RussianAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words
|
RussianAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words
|
RussianAnalyzer(Version matchVersion,
String... stopwords)
Deprecated.
use
RussianAnalyzer.RussianAnalyzer(Version, Set) instead |
RussianLetterTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader in)
Deprecated.
Construct a new RussianLetterTokenizer using a given
AttributeSource.AttributeFactory . * @param
matchVersion Lucene version to match See
above |
RussianLetterTokenizer(Version matchVersion,
AttributeSource source,
Reader in)
Deprecated.
Construct a new RussianLetterTokenizer using a given
AttributeSource . |
RussianLetterTokenizer(Version matchVersion,
Reader in)
Deprecated.
Construct a new RussianLetterTokenizer. * @param matchVersion Lucene version
to match See above
|
Constructor and Description |
---|
ShingleAnalyzerWrapper(Version matchVersion)
Wraps
StandardAnalyzer . |
ShingleAnalyzerWrapper(Version matchVersion,
int minShingleSize,
int maxShingleSize)
Wraps
StandardAnalyzer . |
Constructor and Description |
---|
SnowballAnalyzer(Version matchVersion,
String name)
Deprecated.
Builds the named analyzer with no stop words.
|
SnowballAnalyzer(Version matchVersion,
String name,
Set<?> stopWords)
Deprecated.
Builds the named analyzer with the given stop words.
|
SnowballAnalyzer(Version matchVersion,
String name,
String[] stopWords)
Deprecated.
|
Constructor and Description |
---|
ClassicAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words (
ClassicAnalyzer.STOP_WORDS_SET ). |
ClassicAnalyzer(Version matchVersion,
File stopwords)
Deprecated.
Use
ClassicAnalyzer.ClassicAnalyzer(Version, Reader) instead. |
ClassicAnalyzer(Version matchVersion,
Reader stopwords)
Builds an analyzer with the stop words from the given reader.
|
ClassicAnalyzer(Version matchVersion,
Set<?> stopWords)
Builds an analyzer with the given stop words.
|
ClassicTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader input)
Creates a new ClassicTokenizer with a given
AttributeSource.AttributeFactory |
ClassicTokenizer(Version matchVersion,
AttributeSource source,
Reader input)
Creates a new ClassicTokenizer with a given
AttributeSource . |
ClassicTokenizer(Version matchVersion,
Reader input)
Creates a new instance of the
ClassicTokenizer . |
StandardAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words (
StandardAnalyzer.STOP_WORDS_SET ). |
StandardAnalyzer(Version matchVersion,
File stopwords)
Deprecated.
Use
StandardAnalyzer.StandardAnalyzer(Version, Reader) instead. |
StandardAnalyzer(Version matchVersion,
Reader stopwords)
Builds an analyzer with the stop words from the given reader.
|
StandardAnalyzer(Version matchVersion,
Set<?> stopWords)
Builds an analyzer with the given stop words.
|
StandardFilter(Version matchVersion,
TokenStream in) |
StandardTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader input)
Creates a new StandardTokenizer with a given
AttributeSource.AttributeFactory |
StandardTokenizer(Version matchVersion,
AttributeSource source,
Reader input)
Creates a new StandardTokenizer with a given
AttributeSource . |
StandardTokenizer(Version matchVersion,
Reader input)
Creates a new instance of the
StandardTokenizer . |
UAX29URLEmailAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words (
UAX29URLEmailAnalyzer.STOP_WORDS_SET ). |
UAX29URLEmailAnalyzer(Version matchVersion,
Reader stopwords)
Builds an analyzer with the stop words from the given reader.
|
UAX29URLEmailAnalyzer(Version matchVersion,
Set<?> stopWords)
Builds an analyzer with the given stop words.
|
UAX29URLEmailTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader input)
Creates a new UAX29URLEmailTokenizer with a given
AttributeSource.AttributeFactory |
UAX29URLEmailTokenizer(Version matchVersion,
AttributeSource source,
Reader input)
Creates a new UAX29URLEmailTokenizer with a given
AttributeSource . |
UAX29URLEmailTokenizer(Version matchVersion,
Reader input)
Creates a new instance of the UAX29URLEmailTokenizer.
|
Constructor and Description |
---|
SwedishAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
SwedishAnalyzer.DEFAULT_STOPWORD_FILE . |
SwedishAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
SwedishAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
ThaiAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words.
|
ThaiAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
ThaiWordFilter(Version matchVersion,
TokenStream input)
Creates a new ThaiWordFilter with the specified match version.
|
Constructor and Description |
---|
TurkishAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
TurkishAnalyzer.DEFAULT_STOPWORD_FILE . |
TurkishAnalyzer(Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words.
|
TurkishAnalyzer(Version matchVersion,
Set<?> stopwords,
Set<?> stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
void |
MultiPassIndexSplitter.split(Version version,
IndexReader input,
Directory[] outputs,
boolean seq)
Split source index into multiple parts.
|
Constructor and Description |
---|
IndexUpgrader(Directory dir,
Version matchVersion)
Creates index upgrader on the given directory, using an
IndexWriter using the given
matchVersion . |
IndexUpgrader(Directory dir,
Version matchVersion,
PrintStream infoStream,
boolean deletePriorCommits)
Creates index upgrader on the given directory, using an
IndexWriter using the given
matchVersion . |
IndexWriterConfig(Version matchVersion,
Analyzer analyzer)
|
PersistentSnapshotDeletionPolicy(IndexDeletionPolicy primary,
Directory dir,
IndexWriterConfig.OpenMode mode,
Version matchVersion)
PersistentSnapshotDeletionPolicy wraps another
IndexDeletionPolicy to enable flexible snapshotting. |
PKIndexSplitter(Version version,
Directory input,
Directory dir1,
Directory dir2,
Filter docsInFirstIndex)
Split an index based on a
Filter . |
PKIndexSplitter(Version version,
Directory input,
Directory dir1,
Directory dir2,
Term midTerm)
Split an index based on a given primary key term
and a 'middle' term.
|
RandomIndexWriter(Random r,
Directory dir,
Version v,
Analyzer a)
create a RandomIndexWriter with a random config
|
Modifier and Type | Method and Description |
---|---|
static Query |
MultiFieldQueryParser.parse(Version matchVersion,
String[] queries,
String[] fields,
Analyzer analyzer)
Parses a query which searches on the fields specified.
|
static Query |
MultiFieldQueryParser.parse(Version matchVersion,
String[] queries,
String[] fields,
BooleanClause.Occur[] flags,
Analyzer analyzer)
Parses a query, searching on the fields specified.
|
static Query |
MultiFieldQueryParser.parse(Version matchVersion,
String query,
String[] fields,
BooleanClause.Occur[] flags,
Analyzer analyzer)
Parses a query, searching on the fields specified.
|
Constructor and Description |
---|
MultiFieldQueryParser(Version matchVersion,
String[] fields,
Analyzer analyzer)
Creates a MultiFieldQueryParser.
|
MultiFieldQueryParser(Version matchVersion,
String[] fields,
Analyzer analyzer,
Map<String,Float> boosts)
Creates a MultiFieldQueryParser.
|
QueryParser(Version matchVersion,
String f,
Analyzer a)
Constructs a query parser.
|
Constructor and Description |
---|
AnalyzingQueryParser(Version matchVersion,
String field,
Analyzer analyzer)
Constructs a query parser.
|
Constructor and Description |
---|
ComplexPhraseQueryParser(Version matchVersion,
String f,
Analyzer a) |
Constructor and Description |
---|
ExtendableQueryParser(Version matchVersion,
String f,
Analyzer a)
Creates a new
ExtendableQueryParser instance |
ExtendableQueryParser(Version matchVersion,
String f,
Analyzer a,
Extensions ext)
Creates a new
ExtendableQueryParser instance |
Modifier and Type | Field and Description |
---|---|
static Version |
LuceneTestCase.TEST_VERSION_CURRENT
Use this constant when creating Analyzers and any other version-dependent stuff.
|
Modifier and Type | Method and Description |
---|---|
static Version |
Version.valueOf(String name)
Returns the enum constant of this type with the specified name.
|
static Version[] |
Version.values()
Returns an array containing the constants of this enum type, in
the order they are declared.
|
Modifier and Type | Method and Description |
---|---|
static CharacterUtils |
CharacterUtils.getInstance(Version matchVersion)
Returns a
CharacterUtils implementation according to the given
Version instance. |
static IndexWriterConfig |
LuceneTestCase.newIndexWriterConfig(Random r,
Version v,
Analyzer a)
create a new index writer config with random defaults using the specified random
|
static IndexWriterConfig |
LuceneTestCase.newIndexWriterConfig(Version v,
Analyzer a)
create a new index writer config with random defaults
|
boolean |
Version.onOrAfter(Version other) |