A B C D E F G H I J K L M N O P Q R S T U V W Y
All Classes All Packages
All Classes All Packages
All Classes All Packages
A
- AA - org.apache.lucene.analysis.miscellaneous.ScandinavianNormalizer.Foldings
- AbstractEncoder - Class in org.apache.lucene.analysis.payloads
-
Base class for payload encoders.
- AbstractEncoder() - Constructor for class org.apache.lucene.analysis.payloads.AbstractEncoder
- AbstractWordsFileFilterFactory - Class in org.apache.lucene.analysis.en
-
Abstract parent class for analysis factories that accept a stopwords file as input.
- AbstractWordsFileFilterFactory() - Constructor for class org.apache.lucene.analysis.en.AbstractWordsFileFilterFactory
-
Default ctor for compatibility with SPI
- AbstractWordsFileFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.en.AbstractWordsFileFilterFactory
-
Initialize this factory via a set of key-value pairs.
- accept() - Method in class org.apache.lucene.analysis.core.TypeTokenFilter
-
By default accept the token if its type is not a stop type.
- accept() - Method in class org.apache.lucene.analysis.miscellaneous.CodepointCountFilter
- accept() - Method in class org.apache.lucene.analysis.miscellaneous.DateRecognizerFilter
- accept() - Method in class org.apache.lucene.analysis.miscellaneous.DropIfFlaggedFilter
- accept() - Method in class org.apache.lucene.analysis.miscellaneous.KeepWordFilter
- accept() - Method in class org.apache.lucene.analysis.miscellaneous.LengthFilter
- ACRONYM - Static variable in class org.apache.lucene.analysis.classic.ClassicTokenizer
- ACRONYM_DEP - Static variable in class org.apache.lucene.analysis.classic.ClassicTokenizer
- ACRONYM_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- add(CharSequence, CharSequence) - Method in class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter.Builder
-
Adds an input string and its stemmer override output to this builder.
- add(String, String) - Method in class org.apache.lucene.analysis.charfilter.NormalizeCharMap.Builder
-
Records a replacement to be applied to the input stream.
- add(CharsRef, CharsRef, boolean) - Method in class org.apache.lucene.analysis.synonym.SynonymMap.Builder
-
Add a phrase->phrase synonym mapping.
- addCharFilter(Class<? extends CharFilterFactory>, String...) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Adds the given char filter.
- addCharFilter(Class<? extends CharFilterFactory>, Map<String, String>) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Adds the given char filter.
- addCharFilter(String, String...) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Adds the given char filter.
- addCharFilter(String, Map<String, String>) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Adds the given char filter.
- addClass(String) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Add a character class to the tree.
- addClass(String) - Method in interface org.apache.lucene.analysis.compound.hyphenation.PatternConsumer
-
Add a character class.
- addException(String, ArrayList<Object>) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Add an exception to the tree.
- addException(String, ArrayList<Object>) - Method in interface org.apache.lucene.analysis.compound.hyphenation.PatternConsumer
-
Add a hyphenation exception.
- addOffCorrectMap(int, int) - Method in class org.apache.lucene.analysis.charfilter.BaseCharFilter
-
Adds an offset correction mapping at the given output stream offset.
- addPattern(String, String) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Add a pattern to the tree.
- addPattern(String, String) - Method in interface org.apache.lucene.analysis.compound.hyphenation.PatternConsumer
-
Add hyphenation patterns.
- addTermAndVector(TermAndVector) - Method in class org.apache.lucene.analysis.synonym.word2vec.Word2VecModel
- addTokenFilter(Class<? extends TokenFilterFactory>, String...) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Adds the given token filter.
- addTokenFilter(Class<? extends TokenFilterFactory>, String...) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.ConditionBuilder
-
Adds the given token filter.
- addTokenFilter(Class<? extends TokenFilterFactory>, Map<String, String>) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Adds the given token filter.
- addTokenFilter(Class<? extends TokenFilterFactory>, Map<String, String>) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.ConditionBuilder
-
Adds the given token filter.
- addTokenFilter(String, String...) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Adds the given token filter.
- addTokenFilter(String, String...) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.ConditionBuilder
-
Adds the given token filter.
- addTokenFilter(String, Map<String, String>) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Adds the given token filter.
- addTokenFilter(String, Map<String, String>) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.ConditionBuilder
-
Adds the given token filter.
- AE - org.apache.lucene.analysis.miscellaneous.ScandinavianNormalizer.Foldings
- AffixedWord - Class in org.apache.lucene.analysis.hunspell
-
An object representing the analysis result of a simple (non-compound) word
- AffixedWord.Affix - Class in org.apache.lucene.analysis.hunspell
-
An object representing a prefix or a suffix applied to a word stem
- ALEF - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- ALEF - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- ALEF_HAMZA_ABOVE - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- ALEF_HAMZA_BELOW - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- ALEF_MADDA - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- ALL_FOLDINGS - Static variable in class org.apache.lucene.analysis.miscellaneous.ScandinavianNormalizer
- alloc(int) - Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
-
This is to implement memory allocation in the array.
- alloc(int) - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
- ALPHA - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.
- ALPHA - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterIterator
- ALPHANUM - Static variable in class org.apache.lucene.analysis.classic.ClassicTokenizer
- ALPHANUM - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
-
Alpha/numeric token type
- ALPHANUM - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.
- ALPHANUM - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterIterator
- ALPHANUM_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- Among - Class in org.tartarus.snowball
-
Internal class used by Snowball stemmers
- Among(String, int, int) - Constructor for class org.tartarus.snowball.Among
- Among(String, int, int, String, MethodHandles.Lookup) - Constructor for class org.tartarus.snowball.Among
- analyze(String, CharsRefBuilder) - Method in class org.apache.lucene.analysis.synonym.SynonymMap.Parser
-
Sugar: analyzes the text with the analyzer and separates by
SynonymMap.WORD_SEPARATOR
. - analyzeSimpleWord(String) - Method in class org.apache.lucene.analysis.hunspell.Hunspell
- AO - org.apache.lucene.analysis.miscellaneous.ScandinavianNormalizer.Foldings
- APOSTROPHE - Static variable in class org.apache.lucene.analysis.classic.ClassicTokenizer
- APOSTROPHE_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- ApostropheFilter - Class in org.apache.lucene.analysis.tr
-
Strips all characters after an apostrophe (including the apostrophe itself).
- ApostropheFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.tr.ApostropheFilter
- ApostropheFilterFactory - Class in org.apache.lucene.analysis.tr
-
Factory for
ApostropheFilter
. - ApostropheFilterFactory() - Constructor for class org.apache.lucene.analysis.tr.ApostropheFilterFactory
-
Default ctor for compatibility with SPI
- ApostropheFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.tr.ApostropheFilterFactory
- append(char) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- append(CharSequence) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- append(CharSequence, int, int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- apply(char[], int) - Method in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
- ArabicAnalyzer - Class in org.apache.lucene.analysis.ar
-
Analyzer
for Arabic. - ArabicAnalyzer() - Constructor for class org.apache.lucene.analysis.ar.ArabicAnalyzer
-
Builds an analyzer with the default stop words:
ArabicAnalyzer.DEFAULT_STOPWORD_FILE
. - ArabicAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.ar.ArabicAnalyzer
-
Builds an analyzer with the given stop words
- ArabicAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.ar.ArabicAnalyzer
-
Builds an analyzer with the given stop word.
- ArabicNormalizationFilter - Class in org.apache.lucene.analysis.ar
-
A
TokenFilter
that appliesArabicNormalizer
to normalize the orthography. - ArabicNormalizationFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ar.ArabicNormalizationFilter
- ArabicNormalizationFilterFactory - Class in org.apache.lucene.analysis.ar
-
Factory for
ArabicNormalizationFilter
. - ArabicNormalizationFilterFactory() - Constructor for class org.apache.lucene.analysis.ar.ArabicNormalizationFilterFactory
-
Default ctor for compatibility with SPI
- ArabicNormalizationFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ar.ArabicNormalizationFilterFactory
-
Creates a new ArabicNormalizationFilterFactory
- ArabicNormalizer - Class in org.apache.lucene.analysis.ar
-
Normalizer for Arabic.
- ArabicNormalizer() - Constructor for class org.apache.lucene.analysis.ar.ArabicNormalizer
- ArabicStemFilter - Class in org.apache.lucene.analysis.ar
-
A
TokenFilter
that appliesArabicStemmer
to stem Arabic words.. - ArabicStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ar.ArabicStemFilter
- ArabicStemFilterFactory - Class in org.apache.lucene.analysis.ar
-
Factory for
ArabicStemFilter
. - ArabicStemFilterFactory() - Constructor for class org.apache.lucene.analysis.ar.ArabicStemFilterFactory
-
Default ctor for compatibility with SPI
- ArabicStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ar.ArabicStemFilterFactory
-
Creates a new ArabicStemFilterFactory
- ArabicStemmer - Class in org.apache.lucene.analysis.ar
-
Stemmer for Arabic.
- ArabicStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- ArabicStemmer() - Constructor for class org.apache.lucene.analysis.ar.ArabicStemmer
- ArabicStemmer() - Constructor for class org.tartarus.snowball.ext.ArabicStemmer
- ArmenianAnalyzer - Class in org.apache.lucene.analysis.hy
-
Analyzer
for Armenian. - ArmenianAnalyzer() - Constructor for class org.apache.lucene.analysis.hy.ArmenianAnalyzer
-
Builds an analyzer with the default stop words:
ArmenianAnalyzer.DEFAULT_STOPWORD_FILE
. - ArmenianAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.hy.ArmenianAnalyzer
-
Builds an analyzer with the given stop words.
- ArmenianAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.hy.ArmenianAnalyzer
-
Builds an analyzer with the given stop words.
- ArmenianStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- ArmenianStemmer() - Constructor for class org.tartarus.snowball.ext.ArmenianStemmer
- ASCIIFoldingFilter - Class in org.apache.lucene.analysis.miscellaneous
-
This class converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the "Basic Latin" Unicode block) into their ASCII equivalents, if one exists.
- ASCIIFoldingFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.ASCIIFoldingFilter
- ASCIIFoldingFilter(TokenStream, boolean) - Constructor for class org.apache.lucene.analysis.miscellaneous.ASCIIFoldingFilter
-
Create a new
ASCIIFoldingFilter
. - ASCIIFoldingFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
ASCIIFoldingFilter
. - ASCIIFoldingFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.ASCIIFoldingFilterFactory
-
Default ctor for compatibility with SPI
- ASCIIFoldingFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.ASCIIFoldingFilterFactory
-
Creates a new ASCIIFoldingFilterFactory
- assign_to(StringBuilder) - Method in class org.tartarus.snowball.SnowballProgram
- attributeFactory(String) - Method in class org.apache.lucene.collation.CollationKeyAnalyzer
- AVOID_BAD_URL - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
B
- balance() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Balance the tree for best search performance
- BaseCharFilter - Class in org.apache.lucene.analysis.charfilter
-
Base utility class for implementing a
CharFilter
. - BaseCharFilter(Reader) - Constructor for class org.apache.lucene.analysis.charfilter.BaseCharFilter
- BasqueAnalyzer - Class in org.apache.lucene.analysis.eu
-
Analyzer
for Basque. - BasqueAnalyzer() - Constructor for class org.apache.lucene.analysis.eu.BasqueAnalyzer
-
Builds an analyzer with the default stop words:
BasqueAnalyzer.DEFAULT_STOPWORD_FILE
. - BasqueAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.eu.BasqueAnalyzer
-
Builds an analyzer with the given stop words.
- BasqueAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.eu.BasqueAnalyzer
-
Builds an analyzer with the given stop words.
- BasqueStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- BasqueStemmer() - Constructor for class org.tartarus.snowball.ext.BasqueStemmer
- BEH - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- BengaliAnalyzer - Class in org.apache.lucene.analysis.bn
-
Analyzer for Bengali.
- BengaliAnalyzer() - Constructor for class org.apache.lucene.analysis.bn.BengaliAnalyzer
-
Builds an analyzer with the default stop words:
BengaliAnalyzer.DEFAULT_STOPWORD_FILE
. - BengaliAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.bn.BengaliAnalyzer
-
Builds an analyzer with the given stop words
- BengaliAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.bn.BengaliAnalyzer
-
Builds an analyzer with the given stop words
- BengaliNormalizationFilter - Class in org.apache.lucene.analysis.bn
-
A
TokenFilter
that appliesBengaliNormalizer
to normalize the orthography. - BengaliNormalizationFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.bn.BengaliNormalizationFilter
- BengaliNormalizationFilterFactory - Class in org.apache.lucene.analysis.bn
-
Factory for
BengaliNormalizationFilter
. - BengaliNormalizationFilterFactory() - Constructor for class org.apache.lucene.analysis.bn.BengaliNormalizationFilterFactory
-
Default ctor for compatibility with SPI
- BengaliNormalizationFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.bn.BengaliNormalizationFilterFactory
- BengaliNormalizer - Class in org.apache.lucene.analysis.bn
-
Normalizer for Bengali.
- BengaliNormalizer() - Constructor for class org.apache.lucene.analysis.bn.BengaliNormalizer
- BengaliStemFilter - Class in org.apache.lucene.analysis.bn
-
A
TokenFilter
that appliesBengaliStemmer
to stem Bengali words. - BengaliStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.bn.BengaliStemFilter
- BengaliStemFilterFactory - Class in org.apache.lucene.analysis.bn
-
Factory for
BengaliStemFilter
. - BengaliStemFilterFactory() - Constructor for class org.apache.lucene.analysis.bn.BengaliStemFilterFactory
-
Default ctor for compatibility with SPI
- BengaliStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.bn.BengaliStemFilterFactory
- BengaliStemmer - Class in org.apache.lucene.analysis.bn
-
Stemmer for Bengali.
- BengaliStemmer() - Constructor for class org.apache.lucene.analysis.bn.BengaliStemmer
- BLOCK_SIZE - Static variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- BOKMAAL - Static variable in class org.apache.lucene.analysis.no.NorwegianLightStemmer
-
Constant to remove Bokmål-specific endings
- BOLD - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- BOLD_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- BOLD_ITALICS - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- BOLD_ITALICS_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- boost - Variable in class org.apache.lucene.analysis.synonym.word2vec.TermAndBoost
-
the boost
- BOTH - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
Output the both the untokenized token and the splits
- bra - Variable in class org.tartarus.snowball.SnowballProgram
- BrazilianAnalyzer - Class in org.apache.lucene.analysis.br
-
Analyzer
for Brazilian Portuguese language. - BrazilianAnalyzer() - Constructor for class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
Builds an analyzer with the default stop words (
BrazilianAnalyzer.getDefaultStopSet()
). - BrazilianAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
Builds an analyzer with the given stop words
- BrazilianAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
Builds an analyzer with the given stop words and stemming exclusion words
- BrazilianStemFilter - Class in org.apache.lucene.analysis.br
-
A
TokenFilter
that appliesBrazilianStemmer
. - BrazilianStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.br.BrazilianStemFilter
-
Creates a new BrazilianStemFilter
- BrazilianStemFilterFactory - Class in org.apache.lucene.analysis.br
-
Factory for
BrazilianStemFilter
. - BrazilianStemFilterFactory() - Constructor for class org.apache.lucene.analysis.br.BrazilianStemFilterFactory
-
Default ctor for compatibility with SPI
- BrazilianStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.br.BrazilianStemFilterFactory
-
Creates a new BrazilianStemFilterFactory
- BrazilianStemmer - Class in org.apache.lucene.analysis.br
-
A stemmer for Brazilian Portuguese words.
- BrazilianStemmer() - Constructor for class org.apache.lucene.analysis.br.BrazilianStemmer
- buf - Variable in class org.apache.lucene.analysis.util.OpenStringBuilder
- buffer - Variable in class org.apache.lucene.analysis.util.SegmentingTokenizerBase
- BUFFERMAX - Static variable in class org.apache.lucene.analysis.util.SegmentingTokenizerBase
- build() - Method in class org.apache.lucene.analysis.charfilter.NormalizeCharMap.Builder
-
Builds the NormalizeCharMap; call this once you are done calling
NormalizeCharMap.Builder.add(java.lang.String, java.lang.String)
. - build() - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Builds the analyzer.
- build() - Method in class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter.Builder
-
Returns an
StemmerOverrideFilter.StemmerOverrideMap
to be used with theStemmerOverrideFilter
- build() - Method in class org.apache.lucene.analysis.synonym.SynonymMap.Builder
-
Builds an
SynonymMap
and returns it. - builder() - Static method in class org.apache.lucene.analysis.custom.CustomAnalyzer
-
Returns a builder for custom analyzers that loads all resources from Lucene's classloader.
- builder() - Method in interface org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter.BytesRefBuilderTermAttribute
-
Returns the builder from which the term is derived.
- builder() - Method in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter.BytesRefBuilderTermAttributeImpl
- builder(Path) - Static method in class org.apache.lucene.analysis.custom.CustomAnalyzer
-
Returns a builder for custom analyzers that loads all resources from the given file system base directory.
- builder(ResourceLoader) - Static method in class org.apache.lucene.analysis.custom.CustomAnalyzer
-
Returns a builder for custom analyzers that loads all resources using the given
ResourceLoader
. - Builder() - Constructor for class org.apache.lucene.analysis.charfilter.NormalizeCharMap.Builder
- Builder() - Constructor for class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter.Builder
-
Creates a new
StemmerOverrideFilter.Builder
with ignoreCase set tofalse
- Builder() - Constructor for class org.apache.lucene.analysis.synonym.SynonymMap.Builder
-
Default constructor, passes
dedup=true
. - Builder(boolean) - Constructor for class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter.Builder
-
Creates a new
StemmerOverrideFilter.Builder
- Builder(boolean) - Constructor for class org.apache.lucene.analysis.synonym.SynonymMap.Builder
-
If dedup is true then identical rules (same input, same output) will be added only once.
- BulgarianAnalyzer - Class in org.apache.lucene.analysis.bg
-
Analyzer
for Bulgarian. - BulgarianAnalyzer() - Constructor for class org.apache.lucene.analysis.bg.BulgarianAnalyzer
-
Builds an analyzer with the default stop words:
BulgarianAnalyzer.DEFAULT_STOPWORD_FILE
. - BulgarianAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.bg.BulgarianAnalyzer
-
Builds an analyzer with the given stop words.
- BulgarianAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.bg.BulgarianAnalyzer
-
Builds an analyzer with the given stop words and a stem exclusion set.
- BulgarianStemFilter - Class in org.apache.lucene.analysis.bg
-
A
TokenFilter
that appliesBulgarianStemmer
to stem Bulgarian words. - BulgarianStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.bg.BulgarianStemFilter
- BulgarianStemFilterFactory - Class in org.apache.lucene.analysis.bg
-
Factory for
BulgarianStemFilter
. - BulgarianStemFilterFactory() - Constructor for class org.apache.lucene.analysis.bg.BulgarianStemFilterFactory
-
Default ctor for compatibility with SPI
- BulgarianStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.bg.BulgarianStemFilterFactory
-
Creates a new BulgarianStemFilterFactory
- BulgarianStemmer - Class in org.apache.lucene.analysis.bg
-
Light Stemmer for Bulgarian.
- BulgarianStemmer() - Constructor for class org.apache.lucene.analysis.bg.BulgarianStemmer
- BytesRefBuilderTermAttributeImpl() - Constructor for class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter.BytesRefBuilderTermAttributeImpl
-
Sole constructor no-op
- ByteVector - Class in org.apache.lucene.analysis.compound.hyphenation
-
This class implements a simple byte vector with access to the underlying array.
- ByteVector() - Constructor for class org.apache.lucene.analysis.compound.hyphenation.ByteVector
- ByteVector(byte[]) - Constructor for class org.apache.lucene.analysis.compound.hyphenation.ByteVector
- ByteVector(byte[], int) - Constructor for class org.apache.lucene.analysis.compound.hyphenation.ByteVector
- ByteVector(int) - Constructor for class org.apache.lucene.analysis.compound.hyphenation.ByteVector
C
- canStemToOriginal(AffixedWord) - Method in class org.apache.lucene.analysis.hunspell.WordFormGenerator
-
A sanity-check that the word form generated by affixation in
WordFormGenerator.getAllWordForms(String, String, Runnable)
is indeed accepted by the spell-checker and analyzed to be the form of the original dictionary entry. - capacity() - Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
-
returns current capacity of array
- capacity() - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
-
returns current capacity of array
- capacity() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- CapitalizationFilter - Class in org.apache.lucene.analysis.miscellaneous
-
A filter to apply normal capitalization rules to Tokens.
- CapitalizationFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.CapitalizationFilter
-
Creates a CapitalizationFilter with the default parameters.
- CapitalizationFilter(TokenStream, boolean, CharArraySet, boolean, Collection<char[]>, int, int, int) - Constructor for class org.apache.lucene.analysis.miscellaneous.CapitalizationFilter
-
Creates a CapitalizationFilter with the specified parameters.
- CapitalizationFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
CapitalizationFilter
. - CapitalizationFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.CapitalizationFilterFactory
-
Default ctor for compatibility with SPI
- CapitalizationFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.CapitalizationFilterFactory
-
Creates a new CapitalizationFilterFactory
- CatalanAnalyzer - Class in org.apache.lucene.analysis.ca
-
Analyzer
for Catalan. - CatalanAnalyzer() - Constructor for class org.apache.lucene.analysis.ca.CatalanAnalyzer
-
Builds an analyzer with the default stop words:
CatalanAnalyzer.DEFAULT_STOPWORD_FILE
. - CatalanAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.ca.CatalanAnalyzer
-
Builds an analyzer with the given stop words.
- CatalanAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.ca.CatalanAnalyzer
-
Builds an analyzer with the given stop words.
- CatalanStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- CatalanStemmer() - Constructor for class org.tartarus.snowball.ext.CatalanStemmer
- CATEGORY - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- CATEGORY_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- CATENATE_ALL - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.Causes all subword parts to be catenated:
- CATENATE_ALL - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilter
-
Causes all subword parts to be catenated:
- CATENATE_NUMBERS - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.Causes maximum runs of word parts to be catenated:
- CATENATE_NUMBERS - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilter
-
Causes maximum runs of number parts to be catenated:
- CATENATE_WORDS - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.Causes maximum runs of word parts to be catenated:
- CATENATE_WORDS - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilter
-
Causes maximum runs of word parts to be catenated:
- characters(char[], int, int) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- CharArrayIterator - Class in org.apache.lucene.analysis.util
-
A CharacterIterator used internally for use with
BreakIterator
- CharArrayIterator() - Constructor for class org.apache.lucene.analysis.util.CharArrayIterator
- charAt(int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- charset - Variable in class org.apache.lucene.analysis.payloads.IdentityEncoder
- CharTokenizer - Class in org.apache.lucene.analysis.util
-
An abstract base class for simple, character-oriented tokenizers.
- CharTokenizer() - Constructor for class org.apache.lucene.analysis.util.CharTokenizer
-
Creates a new
CharTokenizer
instance - CharTokenizer(AttributeFactory) - Constructor for class org.apache.lucene.analysis.util.CharTokenizer
-
Creates a new
CharTokenizer
instance - CharTokenizer(AttributeFactory, int) - Constructor for class org.apache.lucene.analysis.util.CharTokenizer
-
Creates a new
CharTokenizer
instance - CharVector - Class in org.apache.lucene.analysis.compound.hyphenation
-
This class implements a simple char vector with access to the underlying array.
- CharVector() - Constructor for class org.apache.lucene.analysis.compound.hyphenation.CharVector
- CharVector(char[]) - Constructor for class org.apache.lucene.analysis.compound.hyphenation.CharVector
- CharVector(char[], int) - Constructor for class org.apache.lucene.analysis.compound.hyphenation.CharVector
- CharVector(int) - Constructor for class org.apache.lucene.analysis.compound.hyphenation.CharVector
- CITATION - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- CITATION_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- CJ - Static variable in class org.apache.lucene.analysis.classic.ClassicTokenizer
- CJ_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- CJKAnalyzer - Class in org.apache.lucene.analysis.cjk
-
An
Analyzer
that tokenizes text withStandardTokenizer
, normalizes content withCJKWidthFilter
, folds case withLowerCaseFilter
, forms bigrams of CJK withCJKBigramFilter
, and filters stopwords withStopFilter
- CJKAnalyzer() - Constructor for class org.apache.lucene.analysis.cjk.CJKAnalyzer
-
Builds an analyzer which removes words in
CJKAnalyzer.getDefaultStopSet()
. - CJKAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.cjk.CJKAnalyzer
-
Builds an analyzer with the given stop words
- CJKBigramFilter - Class in org.apache.lucene.analysis.cjk
-
Forms bigrams of CJK terms that are generated from StandardTokenizer or ICUTokenizer.
- CJKBigramFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.cjk.CJKBigramFilter
- CJKBigramFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.cjk.CJKBigramFilter
- CJKBigramFilter(TokenStream, int, boolean) - Constructor for class org.apache.lucene.analysis.cjk.CJKBigramFilter
-
Create a new CJKBigramFilter, specifying which writing systems should be bigrammed, and whether or not unigrams should also be output.
- CJKBigramFilterFactory - Class in org.apache.lucene.analysis.cjk
-
Factory for
CJKBigramFilter
. - CJKBigramFilterFactory() - Constructor for class org.apache.lucene.analysis.cjk.CJKBigramFilterFactory
-
Default ctor for compatibility with SPI
- CJKBigramFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.cjk.CJKBigramFilterFactory
-
Creates a new CJKBigramFilterFactory
- CJKWidthCharFilter - Class in org.apache.lucene.analysis.cjk
-
A
CharFilter
that normalizes CJK width differences: Folds fullwidth ASCII variants into the equivalent basic latin Folds halfwidth Katakana variants into the equivalent kana - CJKWidthCharFilter(Reader) - Constructor for class org.apache.lucene.analysis.cjk.CJKWidthCharFilter
-
Default constructor that takes a
Reader
. - CJKWidthCharFilterFactory - Class in org.apache.lucene.analysis.cjk
-
Factory for
CJKWidthCharFilter
. - CJKWidthCharFilterFactory() - Constructor for class org.apache.lucene.analysis.cjk.CJKWidthCharFilterFactory
-
Default ctor for compatibility with SPI
- CJKWidthCharFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.cjk.CJKWidthCharFilterFactory
-
Creates a new CJKWidthCharFilterFactory
- CJKWidthFilter - Class in org.apache.lucene.analysis.cjk
-
A
TokenFilter
that normalizes CJK width differences: Folds fullwidth ASCII variants into the equivalent basic latin Folds halfwidth Katakana variants into the equivalent kana - CJKWidthFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.cjk.CJKWidthFilter
- CJKWidthFilterFactory - Class in org.apache.lucene.analysis.cjk
-
Factory for
CJKWidthFilter
. - CJKWidthFilterFactory() - Constructor for class org.apache.lucene.analysis.cjk.CJKWidthFilterFactory
-
Default ctor for compatibility with SPI
- CJKWidthFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.cjk.CJKWidthFilterFactory
-
Creates a new CJKWidthFilterFactory
- ClassicAnalyzer - Class in org.apache.lucene.analysis.classic
-
Filters
ClassicTokenizer
withClassicFilter
,LowerCaseFilter
andStopFilter
, using a list of English stop words. - ClassicAnalyzer() - Constructor for class org.apache.lucene.analysis.classic.ClassicAnalyzer
-
Builds an analyzer with the default stop words (
ClassicAnalyzer.STOP_WORDS_SET
). - ClassicAnalyzer(Reader) - Constructor for class org.apache.lucene.analysis.classic.ClassicAnalyzer
-
Builds an analyzer with the stop words from the given reader.
- ClassicAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.classic.ClassicAnalyzer
-
Builds an analyzer with the given stop words.
- ClassicFilter - Class in org.apache.lucene.analysis.classic
-
Normalizes tokens extracted with
ClassicTokenizer
. - ClassicFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.classic.ClassicFilter
-
Construct filtering in.
- ClassicFilterFactory - Class in org.apache.lucene.analysis.classic
-
Factory for
ClassicFilter
. - ClassicFilterFactory() - Constructor for class org.apache.lucene.analysis.classic.ClassicFilterFactory
-
Default ctor for compatibility with SPI
- ClassicFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.classic.ClassicFilterFactory
-
Creates a new ClassicFilterFactory
- ClassicTokenizer - Class in org.apache.lucene.analysis.classic
-
A grammar-based tokenizer constructed with JFlex
- ClassicTokenizer() - Constructor for class org.apache.lucene.analysis.classic.ClassicTokenizer
-
Creates a new instance of the
ClassicTokenizer
. - ClassicTokenizer(AttributeFactory) - Constructor for class org.apache.lucene.analysis.classic.ClassicTokenizer
-
Creates a new ClassicTokenizer with a given
AttributeFactory
- ClassicTokenizerFactory - Class in org.apache.lucene.analysis.classic
-
Factory for
ClassicTokenizer
. - ClassicTokenizerFactory() - Constructor for class org.apache.lucene.analysis.classic.ClassicTokenizerFactory
-
Default ctor for compatibility with SPI
- ClassicTokenizerFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.classic.ClassicTokenizerFactory
-
Creates a new ClassicTokenizerFactory
- classmap - Variable in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
This map stores the character classes
- clear() - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
-
Reset Vector but don't resize or clear elements
- clear() - Method in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter.BytesRefBuilderTermAttributeImpl
- clone() - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
- clone() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- clone() - Method in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter.BytesRefBuilderTermAttributeImpl
- clone() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- close() - Method in class org.apache.lucene.analysis.charfilter.HTMLStripCharFilter
- close() - Method in class org.apache.lucene.analysis.classic.ClassicTokenizer
- close() - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
- close() - Method in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter
- close() - Method in class org.apache.lucene.analysis.miscellaneous.ConcatenatingTokenStream
- close() - Method in class org.apache.lucene.analysis.miscellaneous.ConditionalTokenFilter
- close() - Method in class org.apache.lucene.analysis.pattern.PatternTokenizer
- close() - Method in class org.apache.lucene.analysis.synonym.word2vec.Dl4jModelReader
- close() - Method in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- CodepointCountFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Removes words that are too long or too short from the stream.
- CodepointCountFilter(TokenStream, int, int) - Constructor for class org.apache.lucene.analysis.miscellaneous.CodepointCountFilter
-
Create a new
CodepointCountFilter
. - CodepointCountFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
CodepointCountFilter
. - CodepointCountFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.CodepointCountFilterFactory
-
Default ctor for compatibility with SPI
- CodepointCountFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.CodepointCountFilterFactory
-
Creates a new CodepointCountFilterFactory
- CollatedTermAttributeImpl - Class in org.apache.lucene.collation.tokenattributes
-
Extension of
CharTermAttributeImpl
that encodes the term text as a binary Unicode collation key instead of as UTF-8 bytes. - CollatedTermAttributeImpl(Collator) - Constructor for class org.apache.lucene.collation.tokenattributes.CollatedTermAttributeImpl
-
Create a new CollatedTermAttributeImpl
- CollationAttributeFactory - Class in org.apache.lucene.collation
-
Converts each token into its
CollationKey
, and then encodes the bytes as an index term. - CollationAttributeFactory(Collator) - Constructor for class org.apache.lucene.collation.CollationAttributeFactory
-
Create a CollationAttributeFactory, using
TokenStream.DEFAULT_TOKEN_ATTRIBUTE_FACTORY
as the factory for all other attributes. - CollationAttributeFactory(AttributeFactory, Collator) - Constructor for class org.apache.lucene.collation.CollationAttributeFactory
-
Create a CollationAttributeFactory, using the supplied Attribute Factory as the factory for all other attributes.
- CollationDocValuesField - Class in org.apache.lucene.collation
-
Indexes collation keys as a single-valued
SortedDocValuesField
. - CollationDocValuesField(String, Collator) - Constructor for class org.apache.lucene.collation.CollationDocValuesField
-
Create a new ICUCollationDocValuesField.
- CollationKeyAnalyzer - Class in org.apache.lucene.collation
-
Configures
KeywordTokenizer
withCollationAttributeFactory
. - CollationKeyAnalyzer(Collator) - Constructor for class org.apache.lucene.collation.CollationKeyAnalyzer
-
Create a new CollationKeyAnalyzer, using the specified collator.
- CommonGramsFilter - Class in org.apache.lucene.analysis.commongrams
-
Construct bigrams for frequently occurring terms while indexing.
- CommonGramsFilter(TokenStream, CharArraySet) - Constructor for class org.apache.lucene.analysis.commongrams.CommonGramsFilter
-
Construct a token stream filtering the given input using a Set of common words to create bigrams.
- CommonGramsFilterFactory - Class in org.apache.lucene.analysis.commongrams
-
Constructs a
CommonGramsFilter
. - CommonGramsFilterFactory() - Constructor for class org.apache.lucene.analysis.commongrams.CommonGramsFilterFactory
-
Default ctor for compatibility with SPI
- CommonGramsFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.commongrams.CommonGramsFilterFactory
-
Creates a new CommonGramsFilterFactory
- CommonGramsQueryFilter - Class in org.apache.lucene.analysis.commongrams
-
Wrap a CommonGramsFilter optimizing phrase queries by only returning single words when they are not a member of a bigram.
- CommonGramsQueryFilter(CommonGramsFilter) - Constructor for class org.apache.lucene.analysis.commongrams.CommonGramsQueryFilter
-
Constructs a new CommonGramsQueryFilter based on the provided CommomGramsFilter
- CommonGramsQueryFilterFactory - Class in org.apache.lucene.analysis.commongrams
-
Construct
CommonGramsQueryFilter
. - CommonGramsQueryFilterFactory() - Constructor for class org.apache.lucene.analysis.commongrams.CommonGramsQueryFilterFactory
-
Default ctor for compatibility with SPI
- CommonGramsQueryFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.commongrams.CommonGramsQueryFilterFactory
-
Creates a new CommonGramsQueryFilterFactory
- COMPANY - Static variable in class org.apache.lucene.analysis.classic.ClassicTokenizer
- COMPANY_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- CompoundToken(int, int) - Constructor for class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase.CompoundToken
-
Construct the compound token based on a slice of the current
CompoundWordTokenFilterBase.termAtt
. - CompoundWordTokenFilterBase - Class in org.apache.lucene.analysis.compound
-
Base class for decomposition token filters.
- CompoundWordTokenFilterBase(TokenStream, CharArraySet) - Constructor for class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- CompoundWordTokenFilterBase(TokenStream, CharArraySet, boolean) - Constructor for class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- CompoundWordTokenFilterBase(TokenStream, CharArraySet, int, int, int, boolean) - Constructor for class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- CompoundWordTokenFilterBase.CompoundToken - Class in org.apache.lucene.analysis.compound
-
Helper class to hold decompounded token information
- compress(List<String>) - Method in class org.apache.lucene.analysis.hunspell.Hunspell
-
Given a list of words, try to produce a smaller set of dictionary entries (with some flags) that would generate these words.
- compress(List<String>, Set<String>, Runnable) - Method in class org.apache.lucene.analysis.hunspell.WordFormGenerator
-
Given a list of words, try to produce a smaller set of dictionary entries (with some flags) that would generate these words.
- ConcatenateGraphFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Concatenates/Joins every incoming token with a separator into one output token for every path through the token stream (which is a graph).
- ConcatenateGraphFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter
-
Creates a token stream to convert
input
to a token stream of accepted strings by its token stream graph. - ConcatenateGraphFilter(TokenStream, boolean, boolean, int) - Constructor for class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter
- ConcatenateGraphFilter(TokenStream, Character, boolean, int) - Constructor for class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter
-
Creates a token stream to convert
input
to a token stream of accepted strings by its token stream graph. - ConcatenateGraphFilter.BytesRefBuilderTermAttribute - Interface in org.apache.lucene.analysis.miscellaneous
-
Attribute providing access to the term builder and UTF-16 conversion
- ConcatenateGraphFilter.BytesRefBuilderTermAttributeImpl - Class in org.apache.lucene.analysis.miscellaneous
-
Implementation of
ConcatenateGraphFilter.BytesRefBuilderTermAttribute
- ConcatenateGraphFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
ConcatenateGraphFilter
. - ConcatenateGraphFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilterFactory
-
Default ctor for compatibility with SPI
- ConcatenateGraphFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilterFactory
- ConcatenatingTokenStream - Class in org.apache.lucene.analysis.miscellaneous
-
A TokenStream that takes an array of input TokenStreams as sources, and concatenates them together.
- ConcatenatingTokenStream(TokenStream...) - Constructor for class org.apache.lucene.analysis.miscellaneous.ConcatenatingTokenStream
-
Create a new ConcatenatingTokenStream from a set of inputs
- ConditionalTokenFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Allows skipping TokenFilters based on the current set of attributes.
- ConditionalTokenFilter(TokenStream, Function<TokenStream, TokenStream>) - Constructor for class org.apache.lucene.analysis.miscellaneous.ConditionalTokenFilter
-
Create a new ConditionalTokenFilter
- ConditionalTokenFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Abstract parent class for analysis factories that create
ConditionalTokenFilter
instances - ConditionalTokenFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.ConditionalTokenFilterFactory
-
Default ctor for compatibility with SPI
- ConditionalTokenFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.ConditionalTokenFilterFactory
- CONSUME_ALL_TOKENS_KEY - Static variable in class org.apache.lucene.analysis.miscellaneous.LimitTokenCountFilterFactory
- CONSUME_ALL_TOKENS_KEY - Static variable in class org.apache.lucene.analysis.miscellaneous.LimitTokenOffsetFilterFactory
- CONSUME_ALL_TOKENS_KEY - Static variable in class org.apache.lucene.analysis.miscellaneous.LimitTokenPositionFilterFactory
- consumeAllTokens() - Method in class org.apache.lucene.analysis.sinks.TeeSinkTokenFilter
-
TeeSinkTokenFilter
passes all tokens to the added sinks when itself is consumed. - copy() - Method in class org.apache.lucene.analysis.synonym.word2vec.Word2VecModel
- copy_from(SnowballProgram) - Method in class org.tartarus.snowball.SnowballProgram
- copyTo(AttributeImpl) - Method in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter.BytesRefBuilderTermAttributeImpl
- correct(int) - Method in class org.apache.lucene.analysis.charfilter.BaseCharFilter
-
Retrieve the corrected offset.
- correct(int) - Method in class org.apache.lucene.analysis.fa.PersianCharFilter
- correct(int) - Method in class org.apache.lucene.analysis.pattern.PatternReplaceCharFilter
- create(Reader) - Method in class org.apache.lucene.analysis.charfilter.HTMLStripCharFilterFactory
- create(Reader) - Method in class org.apache.lucene.analysis.charfilter.MappingCharFilterFactory
- create(Reader) - Method in class org.apache.lucene.analysis.cjk.CJKWidthCharFilterFactory
- create(Reader) - Method in class org.apache.lucene.analysis.fa.PersianCharFilterFactory
- create(Reader) - Method in class org.apache.lucene.analysis.pattern.PatternReplaceCharFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ar.ArabicNormalizationFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ar.ArabicStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.bg.BulgarianStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.bn.BengaliNormalizationFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.bn.BengaliStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.boost.DelimitedBoostTokenFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.br.BrazilianStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.cjk.CJKBigramFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.cjk.CJKWidthFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ckb.SoraniNormalizationFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ckb.SoraniStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.classic.ClassicFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.commongrams.CommonGramsFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.commongrams.CommonGramsQueryFilterFactory
-
Create a CommonGramsFilter and wrap it with a CommonGramsQueryFilter
- create(TokenStream) - Method in class org.apache.lucene.analysis.compound.DictionaryCompoundWordTokenFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.core.DecimalDigitFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.core.FlattenGraphFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.core.LowerCaseFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.core.StopFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.core.TypeTokenFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.core.UpperCaseFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.cz.CzechStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.de.GermanLightStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.de.GermanMinimalStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.de.GermanNormalizationFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.de.GermanStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.el.GreekLowerCaseFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.el.GreekStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.en.EnglishMinimalStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.en.EnglishPossessiveFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.en.KStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.en.PorterStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.es.SpanishLightStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.es.SpanishMinimalStemFilterFactory
-
Deprecated.
- create(TokenStream) - Method in class org.apache.lucene.analysis.es.SpanishPluralStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.fa.PersianNormalizationFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.fa.PersianStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.fi.FinnishLightStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.fr.FrenchLightStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.fr.FrenchMinimalStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ga.IrishLowerCaseFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.gl.GalicianMinimalStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.gl.GalicianStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.hi.HindiNormalizationFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.hi.HindiStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.hu.HungarianLightStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.hunspell.HunspellStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.id.IndonesianStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.in.IndicNormalizationFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.it.ItalianLightStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.lv.LatvianStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.minhash.MinHashFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.ASCIIFoldingFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.CapitalizationFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.CodepointCountFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.ConditionalTokenFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.DateRecognizerFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.DelimitedTermFrequencyTokenFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.DropIfFlaggedFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.FingerprintFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.FixBrokenOffsetsFilterFactory
-
Deprecated.
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.HyphenatedWordsFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.KeepWordFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.KeywordMarkerFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.KeywordRepeatFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.LengthFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.LimitTokenCountFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.LimitTokenOffsetFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.LimitTokenPositionFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.RemoveDuplicatesTokenFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.ScandinavianFoldingFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.ScandinavianNormalizationFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.TrimFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.TruncateTokenFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.TypeAsSynonymFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilterFactory
-
Deprecated.
- create(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ngram.EdgeNGramFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ngram.NGramFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.no.NorwegianLightStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.no.NorwegianMinimalStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.no.NorwegianNormalizationFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.pattern.PatternCaptureGroupFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.pattern.PatternReplaceFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.pattern.PatternTypingFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.payloads.DelimitedPayloadTokenFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.payloads.NumericPayloadTokenFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.payloads.TokenOffsetPayloadTokenFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.payloads.TypeAsPayloadTokenFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.pt.PortugueseLightStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.pt.PortugueseMinimalStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.pt.PortugueseStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.reverse.ReverseStringFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ru.RussianLightStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.shingle.FixedShingleFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.shingle.ShingleFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.snowball.SnowballPorterFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.sr.SerbianNormalizationFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.sv.SwedishLightStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.sv.SwedishMinimalStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.synonym.SynonymFilterFactory
-
Deprecated.
- create(TokenStream) - Method in class org.apache.lucene.analysis.synonym.SynonymGraphFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.synonym.word2vec.Word2VecSynonymFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.te.TeluguNormalizationFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.te.TeluguStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.tr.ApostropheFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.tr.TurkishLowerCaseFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.util.ElisionFilterFactory
- create(TokenStream, Function<TokenStream, TokenStream>) - Method in class org.apache.lucene.analysis.miscellaneous.ConditionalTokenFilterFactory
-
Modify the incoming
TokenStream
with aConditionalTokenFilter
- create(TokenStream, Function<TokenStream, TokenStream>) - Method in class org.apache.lucene.analysis.miscellaneous.ProtectedTermFilterFactory
- create(AttributeFactory) - Method in class org.apache.lucene.analysis.classic.ClassicTokenizerFactory
- create(AttributeFactory) - Method in class org.apache.lucene.analysis.core.KeywordTokenizerFactory
- create(AttributeFactory) - Method in class org.apache.lucene.analysis.core.LetterTokenizerFactory
- create(AttributeFactory) - Method in class org.apache.lucene.analysis.core.WhitespaceTokenizerFactory
- create(AttributeFactory) - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerFactory
- create(AttributeFactory) - Method in class org.apache.lucene.analysis.ngram.EdgeNGramTokenizerFactory
- create(AttributeFactory) - Method in class org.apache.lucene.analysis.ngram.NGramTokenizerFactory
- create(AttributeFactory) - Method in class org.apache.lucene.analysis.path.PathHierarchyTokenizerFactory
- create(AttributeFactory) - Method in class org.apache.lucene.analysis.pattern.PatternTokenizerFactory
-
Split the input using configured pattern
- create(AttributeFactory) - Method in class org.apache.lucene.analysis.pattern.SimplePatternSplitTokenizerFactory
- create(AttributeFactory) - Method in class org.apache.lucene.analysis.pattern.SimplePatternTokenizerFactory
- create(AttributeFactory) - Method in class org.apache.lucene.analysis.th.ThaiTokenizerFactory
- create(AttributeFactory) - Method in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizerFactory
- createComponents(String) - Method in class org.apache.lucene.analysis.ar.ArabicAnalyzer
-
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.bg.BulgarianAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.bn.BengaliAnalyzer
-
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.ca.CatalanAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.cjk.CJKAnalyzer
- createComponents(String) - Method in class org.apache.lucene.analysis.ckb.SoraniAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.classic.ClassicAnalyzer
- createComponents(String) - Method in class org.apache.lucene.analysis.core.KeywordAnalyzer
- createComponents(String) - Method in class org.apache.lucene.analysis.core.SimpleAnalyzer
- createComponents(String) - Method in class org.apache.lucene.analysis.core.StopAnalyzer
-
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.core.UnicodeWhitespaceAnalyzer
- createComponents(String) - Method in class org.apache.lucene.analysis.core.WhitespaceAnalyzer
- createComponents(String) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer
- createComponents(String) - Method in class org.apache.lucene.analysis.cz.CzechAnalyzer
-
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.da.DanishAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.de.GermanAnalyzer
-
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.el.GreekAnalyzer
-
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.email.UAX29URLEmailAnalyzer
- createComponents(String) - Method in class org.apache.lucene.analysis.en.EnglishAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.es.SpanishAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.et.EstonianAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.eu.BasqueAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.fa.PersianAnalyzer
-
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.fi.FinnishAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.ga.IrishAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.gl.GalicianAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.hi.HindiAnalyzer
-
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.hu.HungarianAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.hy.ArmenianAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.id.IndonesianAnalyzer
-
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.it.ItalianAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.lt.LithuanianAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.lv.LatvianAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.ne.NepaliAnalyzer
-
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
-
Returns a (possibly reused)
TokenStream
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.no.NorwegianAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.pt.PortugueseAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.ro.RomanianAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.ru.RussianAnalyzer
-
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.sr.SerbianAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.sv.SwedishAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.ta.TamilAnalyzer
-
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.te.TeluguAnalyzer
-
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.th.ThaiAnalyzer
-
Creates
Analyzer.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.analysis.tr.TurkishAnalyzer
-
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String) - Method in class org.apache.lucene.collation.CollationKeyAnalyzer
- createDefaultWords() - Method in class org.apache.lucene.analysis.commongrams.CommonGramsFilterFactory
- createDefaultWords() - Method in class org.apache.lucene.analysis.core.StopFilterFactory
- createDefaultWords() - Method in class org.apache.lucene.analysis.en.AbstractWordsFileFilterFactory
-
Default word set implementation.
- createDefaultWords() - Method in class org.apache.lucene.analysis.miscellaneous.KeepWordFilterFactory
- createInstance() - Method in class org.apache.lucene.collation.CollationAttributeFactory
- CSVUtil - Class in org.apache.lucene.analysis.util
-
Utility class for parsing CSV text
- current() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- cursor - Variable in class org.tartarus.snowball.SnowballProgram
- CustomAnalyzer - Class in org.apache.lucene.analysis.custom
-
A general-purpose Analyzer that can be created with a builder-style API.
- CustomAnalyzer.Builder - Class in org.apache.lucene.analysis.custom
-
Builder for
CustomAnalyzer
. - CustomAnalyzer.ConditionBuilder - Class in org.apache.lucene.analysis.custom
-
Factory class for a
ConditionalTokenFilter
- CzechAnalyzer - Class in org.apache.lucene.analysis.cz
-
Analyzer
for Czech language. - CzechAnalyzer() - Constructor for class org.apache.lucene.analysis.cz.CzechAnalyzer
-
Builds an analyzer with the default stop words (
CzechAnalyzer.getDefaultStopSet()
). - CzechAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.cz.CzechAnalyzer
-
Builds an analyzer with the given stop words.
- CzechAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.cz.CzechAnalyzer
-
Builds an analyzer with the given stop words and a set of work to be excluded from the
CzechStemFilter
. - CzechStemFilter - Class in org.apache.lucene.analysis.cz
-
A
TokenFilter
that appliesCzechStemmer
to stem Czech words. - CzechStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.cz.CzechStemFilter
- CzechStemFilterFactory - Class in org.apache.lucene.analysis.cz
-
Factory for
CzechStemFilter
. - CzechStemFilterFactory() - Constructor for class org.apache.lucene.analysis.cz.CzechStemFilterFactory
-
Default ctor for compatibility with SPI
- CzechStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.cz.CzechStemFilterFactory
-
Creates a new CzechStemFilterFactory
- CzechStemmer - Class in org.apache.lucene.analysis.cz
-
Light Stemmer for Czech.
- CzechStemmer() - Constructor for class org.apache.lucene.analysis.cz.CzechStemmer
D
- DAMMA - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- DAMMATAN - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- DanishAnalyzer - Class in org.apache.lucene.analysis.da
-
Analyzer
for Danish. - DanishAnalyzer() - Constructor for class org.apache.lucene.analysis.da.DanishAnalyzer
-
Builds an analyzer with the default stop words:
DanishAnalyzer.DEFAULT_STOPWORD_FILE
. - DanishAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.da.DanishAnalyzer
-
Builds an analyzer with the given stop words.
- DanishAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.da.DanishAnalyzer
-
Builds an analyzer with the given stop words.
- DanishStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- DanishStemmer() - Constructor for class org.tartarus.snowball.ext.DanishStemmer
- DATE_PATTERN - Static variable in class org.apache.lucene.analysis.miscellaneous.DateRecognizerFilterFactory
- DATE_TYPE - Static variable in class org.apache.lucene.analysis.miscellaneous.DateRecognizerFilter
- DateRecognizerFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Filters all tokens that cannot be parsed to a date, using the provided
DateFormat
. - DateRecognizerFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.DateRecognizerFilter
- DateRecognizerFilter(TokenStream, DateFormat) - Constructor for class org.apache.lucene.analysis.miscellaneous.DateRecognizerFilter
- DateRecognizerFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
DateRecognizerFilter
. - DateRecognizerFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.DateRecognizerFilterFactory
-
Default ctor for compatibility with SPI
- DateRecognizerFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.DateRecognizerFilterFactory
-
Creates a new FingerprintFilterFactory
- DBBI_AVAILABLE - Static variable in class org.apache.lucene.analysis.th.ThaiTokenizer
-
True if the JRE supports a working dictionary-based breakiterator for Thai.
- DecimalDigitFilter - Class in org.apache.lucene.analysis.core
-
Folds all Unicode digits in
[:General_Category=Decimal_Number:]
to Basic Latin digits (0-9
). - DecimalDigitFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.core.DecimalDigitFilter
-
Creates a new DecimalDigitFilter over
input
- DecimalDigitFilterFactory - Class in org.apache.lucene.analysis.core
-
Factory for
DecimalDigitFilter
. - DecimalDigitFilterFactory() - Constructor for class org.apache.lucene.analysis.core.DecimalDigitFilterFactory
-
Default ctor for compatibility with SPI
- DecimalDigitFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.core.DecimalDigitFilterFactory
-
Creates a new DecimalDigitFilterFactory
- decodeFloat(byte[]) - Static method in class org.apache.lucene.analysis.payloads.PayloadHelper
- decodeFloat(byte[], int) - Static method in class org.apache.lucene.analysis.payloads.PayloadHelper
-
Decode the payload that was encoded using
PayloadHelper.encodeFloat(float)
. - decodeInt(byte[], int) - Static method in class org.apache.lucene.analysis.payloads.PayloadHelper
- decompose() - Method in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
Decomposes the current
CompoundWordTokenFilterBase.termAtt
and placesCompoundWordTokenFilterBase.CompoundToken
instances in theCompoundWordTokenFilterBase.tokens
list. - decompose() - Method in class org.apache.lucene.analysis.compound.DictionaryCompoundWordTokenFilter
- decompose() - Method in class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
- DEFAULT_ARTICLES - Static variable in class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
Default set of articles for ElisionFilter
- DEFAULT_BUCKET_COUNT - Static variable in class org.apache.lucene.analysis.minhash.MinHashFilter
- DEFAULT_BUFFER_SIZE - Static variable in class org.apache.lucene.analysis.core.KeywordTokenizer
-
Default read buffer size
- DEFAULT_DELIMITER - Static variable in class org.apache.lucene.analysis.boost.DelimitedBoostTokenFilterFactory
- DEFAULT_DELIMITER - Static variable in class org.apache.lucene.analysis.miscellaneous.DelimitedTermFrequencyTokenFilter
- DEFAULT_DELIMITER - Static variable in class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- DEFAULT_DELIMITER - Static variable in class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- DEFAULT_DELIMITER - Static variable in class org.apache.lucene.analysis.payloads.DelimitedPayloadTokenFilter
- DEFAULT_FILLER_TOKEN - Static variable in class org.apache.lucene.analysis.shingle.ShingleFilter
-
filler token for when positionIncrement is more than 1
- DEFAULT_HASH_COUNT - Static variable in class org.apache.lucene.analysis.minhash.MinHashFilter
- DEFAULT_HASH_SET_SIZE - Static variable in class org.apache.lucene.analysis.minhash.MinHashFilter
- DEFAULT_MAX_GRAM_SIZE - Static variable in class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
- DEFAULT_MAX_GRAPH_EXPANSIONS - Static variable in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter
- DEFAULT_MAX_NGRAM_SIZE - Static variable in class org.apache.lucene.analysis.ngram.NGramTokenizer
- DEFAULT_MAX_OUTPUT_TOKEN_SIZE - Static variable in class org.apache.lucene.analysis.miscellaneous.FingerprintFilter
- DEFAULT_MAX_SHINGLE_SIZE - Static variable in class org.apache.lucene.analysis.shingle.ShingleFilter
-
default maximum shingle size is 2.
- DEFAULT_MAX_SUBWORD_SIZE - Static variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
The default for maximal length of subwords that get propagated to the output of this filter
- DEFAULT_MAX_SYNONYMS_PER_TERM - Static variable in class org.apache.lucene.analysis.synonym.word2vec.Word2VecSynonymFilterFactory
- DEFAULT_MAX_TOKEN_LENGTH - Static variable in class org.apache.lucene.analysis.classic.ClassicAnalyzer
-
Default maximum allowed token length
- DEFAULT_MAX_TOKEN_LENGTH - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailAnalyzer
-
Default maximum allowed token length
- DEFAULT_MAX_TOKEN_LENGTH - Static variable in class org.apache.lucene.analysis.miscellaneous.CapitalizationFilter
- DEFAULT_MAX_WORD_COUNT - Static variable in class org.apache.lucene.analysis.miscellaneous.CapitalizationFilter
- DEFAULT_MAX_WORD_LEN - Static variable in class org.apache.lucene.analysis.util.CharTokenizer
- DEFAULT_MIN_ACCEPTED_SIMILARITY - Static variable in class org.apache.lucene.analysis.synonym.word2vec.Word2VecSynonymFilterFactory
- DEFAULT_MIN_GRAM_SIZE - Static variable in class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
- DEFAULT_MIN_NGRAM_SIZE - Static variable in class org.apache.lucene.analysis.ngram.NGramTokenizer
- DEFAULT_MIN_SHINGLE_SIZE - Static variable in class org.apache.lucene.analysis.shingle.ShingleFilter
-
default minimum shingle size is 2.
- DEFAULT_MIN_SUBWORD_SIZE - Static variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
The default for minimal length of subwords that get propagated to the output of this filter
- DEFAULT_MIN_WORD_SIZE - Static variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
The default for minimal word length that gets decomposed
- DEFAULT_PRESERVE_ORIGINAL - Static variable in class org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter
- DEFAULT_PRESERVE_ORIGINAL - Static variable in class org.apache.lucene.analysis.ngram.NGramTokenFilter
- DEFAULT_PRESERVE_POSITION_INCREMENTS - Static variable in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter
- DEFAULT_PRESERVE_SEP - Static variable in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter
- DEFAULT_SEPARATOR - Static variable in class org.apache.lucene.analysis.miscellaneous.FingerprintFilter
- DEFAULT_SKIP - Static variable in class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- DEFAULT_SKIP - Static variable in class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.ar.ArabicAnalyzer
-
File containing default Arabic stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.bg.BulgarianAnalyzer
-
File containing default Bulgarian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.bn.BengaliAnalyzer
-
File containing default Bengali stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
File containing default Brazilian Portuguese stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.ca.CatalanAnalyzer
-
File containing default Catalan stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.cjk.CJKAnalyzer
-
File containing default CJK stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.ckb.SoraniAnalyzer
-
File containing default Kurdish stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.cz.CzechAnalyzer
-
File containing default Czech stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.da.DanishAnalyzer
-
File containing default Danish stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.de.GermanAnalyzer
-
File containing default German stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.el.GreekAnalyzer
-
File containing default Greek stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.es.SpanishAnalyzer
-
File containing default Spanish stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.et.EstonianAnalyzer
-
File containing default Estonian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.eu.BasqueAnalyzer
-
File containing default Basque stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.fa.PersianAnalyzer
-
File containing default Persian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.fi.FinnishAnalyzer
-
File containing default Italian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
File containing default French stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.ga.IrishAnalyzer
-
File containing default Irish stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.gl.GalicianAnalyzer
-
File containing default Galician stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.hi.HindiAnalyzer
-
File containing default Hindi stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.hu.HungarianAnalyzer
-
File containing default Hungarian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.hy.ArmenianAnalyzer
-
File containing default Armenian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.id.IndonesianAnalyzer
-
File containing default Indonesian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.it.ItalianAnalyzer
-
File containing default Italian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.lt.LithuanianAnalyzer
-
File containing default Lithuanian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.lv.LatvianAnalyzer
-
File containing default Latvian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.ne.NepaliAnalyzer
-
File containing default Nepali stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.nl.DutchAnalyzer
-
File containing default Dutch stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.no.NorwegianAnalyzer
-
File containing default Norwegian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.pt.PortugueseAnalyzer
-
File containing default Portuguese stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.ro.RomanianAnalyzer
-
File containing default Romanian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.ru.RussianAnalyzer
-
File containing default Russian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.sr.SerbianAnalyzer
-
File containing default Serbian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.sv.SwedishAnalyzer
-
File containing default Swedish stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.ta.TamilAnalyzer
-
File containing default Tamil stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.te.TeluguAnalyzer
-
File containing default Telugu stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.th.ThaiAnalyzer
-
File containing default Thai stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.tr.TurkishAnalyzer
-
File containing default Turkish stopwords.
- DEFAULT_TOKEN_SEPARATOR - Static variable in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter
- DEFAULT_TOKEN_SEPARATOR - Static variable in class org.apache.lucene.analysis.shingle.ShingleFilter
-
The default string to use when joining adjacent tokens to form a shingle
- DEFAULT_TOKEN_TYPE - Static variable in class org.apache.lucene.analysis.shingle.ShingleFilter
-
default token type attribute value is "shingle"
- DEFAULT_WORD_DELIM_TABLE - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterIterator
- defaultMaxDocFreqPercent - Static variable in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
- delete(char[], int, int) - Static method in class org.apache.lucene.analysis.util.StemmerUtil
-
Delete a character in-place
- deleteN(char[], int, int, int) - Static method in class org.apache.lucene.analysis.util.StemmerUtil
-
Delete n characters in-place
- DelimitedBoostTokenFilter - Class in org.apache.lucene.analysis.boost
-
Characters before the delimiter are the "token", those after are the boost.
- DelimitedBoostTokenFilter(TokenStream, char) - Constructor for class org.apache.lucene.analysis.boost.DelimitedBoostTokenFilter
- DelimitedBoostTokenFilterFactory - Class in org.apache.lucene.analysis.boost
-
Factory for
DelimitedBoostTokenFilter
. - DelimitedBoostTokenFilterFactory() - Constructor for class org.apache.lucene.analysis.boost.DelimitedBoostTokenFilterFactory
-
Default ctor for compatibility with SPI
- DelimitedBoostTokenFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.boost.DelimitedBoostTokenFilterFactory
-
Creates a new DelimitedPayloadTokenFilterFactory
- DelimitedPayloadTokenFilter - Class in org.apache.lucene.analysis.payloads
-
Characters before the delimiter are the "token", those after are the payload.
- DelimitedPayloadTokenFilter(TokenStream, char, PayloadEncoder) - Constructor for class org.apache.lucene.analysis.payloads.DelimitedPayloadTokenFilter
- DelimitedPayloadTokenFilterFactory - Class in org.apache.lucene.analysis.payloads
-
Factory for
DelimitedPayloadTokenFilter
. - DelimitedPayloadTokenFilterFactory() - Constructor for class org.apache.lucene.analysis.payloads.DelimitedPayloadTokenFilterFactory
-
Default ctor for compatibility with SPI
- DelimitedPayloadTokenFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.payloads.DelimitedPayloadTokenFilterFactory
-
Creates a new DelimitedPayloadTokenFilterFactory
- DelimitedTermFrequencyTokenFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Characters before the delimiter are the "token", the textual integer after is the term frequency.
- DelimitedTermFrequencyTokenFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.DelimitedTermFrequencyTokenFilter
- DelimitedTermFrequencyTokenFilter(TokenStream, char) - Constructor for class org.apache.lucene.analysis.miscellaneous.DelimitedTermFrequencyTokenFilter
- DelimitedTermFrequencyTokenFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
DelimitedTermFrequencyTokenFilter
. - DelimitedTermFrequencyTokenFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.DelimitedTermFrequencyTokenFilterFactory
-
Default ctor for compatibility with SPI
- DelimitedTermFrequencyTokenFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.DelimitedTermFrequencyTokenFilterFactory
-
Creates a new DelimitedPayloadTokenFilterFactory
- DELIMITER_ATTR - Static variable in class org.apache.lucene.analysis.boost.DelimitedBoostTokenFilterFactory
- DELIMITER_ATTR - Static variable in class org.apache.lucene.analysis.miscellaneous.DelimitedTermFrequencyTokenFilterFactory
- DELIMITER_ATTR - Static variable in class org.apache.lucene.analysis.payloads.DelimitedPayloadTokenFilterFactory
- DictEntries - Interface in org.apache.lucene.analysis.hunspell
-
An object representing homonym dictionary entries.
- DictEntry - Class in org.apache.lucene.analysis.hunspell
-
An object representing *.dic file entry with its word, flags and morphological data.
- dictionary - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- Dictionary - Class in org.apache.lucene.analysis.hunspell
-
In-memory structure for the dictionary (.dic) and affix (.aff) data of a hunspell dictionary.
- Dictionary(InputStream, List<InputStream>, boolean, SortingStrategy) - Constructor for class org.apache.lucene.analysis.hunspell.Dictionary
-
Creates a new Dictionary containing the information read from the provided InputStreams to hunspell affix and dictionary files.
- Dictionary(Directory, String, InputStream, InputStream) - Constructor for class org.apache.lucene.analysis.hunspell.Dictionary
-
Creates a new Dictionary containing the information read from the provided InputStreams to hunspell affix and dictionary files.
- Dictionary(Directory, String, InputStream, List<InputStream>, boolean) - Constructor for class org.apache.lucene.analysis.hunspell.Dictionary
-
Creates a new Dictionary containing the information read from the provided InputStreams to hunspell affix and dictionary files.
- DictionaryCompoundWordTokenFilter - Class in org.apache.lucene.analysis.compound
-
A
TokenFilter
that decomposes compound words found in many Germanic languages. - DictionaryCompoundWordTokenFilter(TokenStream, CharArraySet) - Constructor for class org.apache.lucene.analysis.compound.DictionaryCompoundWordTokenFilter
-
Creates a new
DictionaryCompoundWordTokenFilter
- DictionaryCompoundWordTokenFilter(TokenStream, CharArraySet, int, int, int, boolean) - Constructor for class org.apache.lucene.analysis.compound.DictionaryCompoundWordTokenFilter
-
Creates a new
DictionaryCompoundWordTokenFilter
- DictionaryCompoundWordTokenFilterFactory - Class in org.apache.lucene.analysis.compound
-
Factory for
DictionaryCompoundWordTokenFilter
. - DictionaryCompoundWordTokenFilterFactory() - Constructor for class org.apache.lucene.analysis.compound.DictionaryCompoundWordTokenFilterFactory
-
Default ctor for compatibility with SPI
- DictionaryCompoundWordTokenFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.compound.DictionaryCompoundWordTokenFilterFactory
-
Creates a new DictionaryCompoundWordTokenFilterFactory
- DIGIT - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.
- dimension() - Method in class org.apache.lucene.analysis.synonym.word2vec.Word2VecModel
- Dl4jModelReader - Class in org.apache.lucene.analysis.synonym.word2vec
-
Dl4jModelReader reads the file generated by the library Deeplearning4j and provide a Word2VecModel with normalized vectors
- Dl4jModelReader(InputStream) - Constructor for class org.apache.lucene.analysis.synonym.word2vec.Dl4jModelReader
- doInform(ResourceLoader) - Method in class org.apache.lucene.analysis.miscellaneous.ConditionalTokenFilterFactory
-
Initialises this component with the corresponding
ResourceLoader
- doInform(ResourceLoader) - Method in class org.apache.lucene.analysis.miscellaneous.ProtectedTermFilterFactory
- DONE - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterIterator
-
Indicates the end of iteration
- DOTLESS_YEH - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- DOUBLE_TYPE - Static variable in class org.apache.lucene.analysis.cjk.CJKBigramFilter
-
when we emit a bigram, it's then marked as this type
- DropIfFlaggedFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Allows Tokens with a given combination of flags to be dropped.
- DropIfFlaggedFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.miscellaneous.DropIfFlaggedFilter
-
Construct a token stream filtering the given input.
- DropIfFlaggedFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Provides a filter that will drop tokens matching a set of flags.
- DropIfFlaggedFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.DropIfFlaggedFilterFactory
-
Default ctor for compatibility with SPI
- DropIfFlaggedFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.DropIfFlaggedFilterFactory
-
Initialize this factory via a set of key-value pairs.
- DutchAnalyzer - Class in org.apache.lucene.analysis.nl
-
Analyzer
for Dutch language. - DutchAnalyzer() - Constructor for class org.apache.lucene.analysis.nl.DutchAnalyzer
-
Builds an analyzer with the default stop words (
DutchAnalyzer.getDefaultStopSet()
) and a few default entries for the stem exclusion table. - DutchAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.nl.DutchAnalyzer
- DutchAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.nl.DutchAnalyzer
- DutchAnalyzer(CharArraySet, CharArraySet, CharArrayMap<String>) - Constructor for class org.apache.lucene.analysis.nl.DutchAnalyzer
- DutchStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- DutchStemmer() - Constructor for class org.tartarus.snowball.ext.DutchStemmer
E
- EdgeNGramFilterFactory - Class in org.apache.lucene.analysis.ngram
-
Creates new instances of
EdgeNGramTokenFilter
. - EdgeNGramFilterFactory() - Constructor for class org.apache.lucene.analysis.ngram.EdgeNGramFilterFactory
-
Default ctor for compatibility with SPI
- EdgeNGramFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ngram.EdgeNGramFilterFactory
-
Creates a new EdgeNGramFilterFactory
- EdgeNGramTokenFilter - Class in org.apache.lucene.analysis.ngram
-
Tokenizes the given token into n-grams of given size(s).
- EdgeNGramTokenFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter
-
Creates an EdgeNGramTokenFilter that produces edge n-grams of the given size.
- EdgeNGramTokenFilter(TokenStream, int, int, boolean) - Constructor for class org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter
-
Creates an EdgeNGramTokenFilter that, for a given input term, produces all edge n-grams with lengths >= minGram and <= maxGram.
- EdgeNGramTokenizer - Class in org.apache.lucene.analysis.ngram
-
Tokenizes the input from an edge into n-grams of given size(s).
- EdgeNGramTokenizer(int, int) - Constructor for class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
-
Creates EdgeNGramTokenizer that can generate n-grams in the sizes of the given range
- EdgeNGramTokenizer(AttributeFactory, int, int) - Constructor for class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
-
Creates EdgeNGramTokenizer that can generate n-grams in the sizes of the given range
- EdgeNGramTokenizerFactory - Class in org.apache.lucene.analysis.ngram
-
Creates new instances of
EdgeNGramTokenizer
. - EdgeNGramTokenizerFactory() - Constructor for class org.apache.lucene.analysis.ngram.EdgeNGramTokenizerFactory
-
Default ctor for compatibility with SPI
- EdgeNGramTokenizerFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ngram.EdgeNGramTokenizerFactory
-
Creates a new EdgeNGramTokenizerFactory
- ElisionFilter - Class in org.apache.lucene.analysis.util
-
Removes elisions from a
TokenStream
. - ElisionFilter(TokenStream, CharArraySet) - Constructor for class org.apache.lucene.analysis.util.ElisionFilter
-
Constructs an elision filter with a Set of stop words
- ElisionFilterFactory - Class in org.apache.lucene.analysis.util
-
Factory for
ElisionFilter
. - ElisionFilterFactory() - Constructor for class org.apache.lucene.analysis.util.ElisionFilterFactory
-
Default ctor for compatibility with SPI
- ElisionFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.util.ElisionFilterFactory
-
Creates a new ElisionFilterFactory
- EMAIL - Static variable in class org.apache.lucene.analysis.classic.ClassicTokenizer
- EMAIL - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
-
Email token type
- EMAIL_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- EMAIL_TYPE - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Email token type
- EMOJI - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
-
Emoji token type.
- EMOJI_TYPE - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Emoji token type
- EmptyTokenStream - Class in org.apache.lucene.analysis.miscellaneous
-
An always exhausted token stream.
- EmptyTokenStream() - Constructor for class org.apache.lucene.analysis.miscellaneous.EmptyTokenStream
- encode(char[]) - Method in class org.apache.lucene.analysis.payloads.AbstractEncoder
- encode(char[]) - Method in interface org.apache.lucene.analysis.payloads.PayloadEncoder
- encode(char[], int, int) - Method in class org.apache.lucene.analysis.payloads.FloatEncoder
- encode(char[], int, int) - Method in class org.apache.lucene.analysis.payloads.IdentityEncoder
- encode(char[], int, int) - Method in class org.apache.lucene.analysis.payloads.IntegerEncoder
- encode(char[], int, int) - Method in interface org.apache.lucene.analysis.payloads.PayloadEncoder
-
Convert a char array to a
BytesRef
- encodeFloat(float) - Static method in class org.apache.lucene.analysis.payloads.PayloadHelper
- encodeFloat(float, byte[], int) - Static method in class org.apache.lucene.analysis.payloads.PayloadHelper
- encodeInt(int) - Static method in class org.apache.lucene.analysis.payloads.PayloadHelper
- encodeInt(int, byte[], int) - Static method in class org.apache.lucene.analysis.payloads.PayloadHelper
- ENCODER_ATTR - Static variable in class org.apache.lucene.analysis.payloads.DelimitedPayloadTokenFilterFactory
- end() - Method in class org.apache.lucene.analysis.classic.ClassicTokenizer
- end() - Method in class org.apache.lucene.analysis.core.FlattenGraphFilter
- end() - Method in class org.apache.lucene.analysis.core.KeywordTokenizer
- end() - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
- end() - Method in class org.apache.lucene.analysis.minhash.MinHashFilter
- end() - Method in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter
- end() - Method in class org.apache.lucene.analysis.miscellaneous.ConcatenatingTokenStream
- end() - Method in class org.apache.lucene.analysis.miscellaneous.ConditionalTokenFilter
- end() - Method in class org.apache.lucene.analysis.miscellaneous.FingerprintFilter
- end() - Method in class org.apache.lucene.analysis.miscellaneous.FixBrokenOffsetsFilter
-
Deprecated.
- end() - Method in class org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter
- end() - Method in class org.apache.lucene.analysis.ngram.NGramTokenFilter
- end() - Method in class org.apache.lucene.analysis.ngram.NGramTokenizer
- end() - Method in class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- end() - Method in class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- end() - Method in class org.apache.lucene.analysis.pattern.PatternTokenizer
- end() - Method in class org.apache.lucene.analysis.pattern.SimplePatternSplitTokenizer
- end() - Method in class org.apache.lucene.analysis.pattern.SimplePatternTokenizer
- end() - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
- end() - Method in class org.apache.lucene.analysis.sinks.TeeSinkTokenFilter
- end() - Method in class org.apache.lucene.analysis.sinks.TeeSinkTokenFilter.SinkTokenStream
- end() - Method in class org.apache.lucene.analysis.util.CharTokenizer
- end() - Method in class org.apache.lucene.analysis.util.SegmentingTokenizerBase
- end() - Method in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- endElement(String, String, String) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- endOffset - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase.CompoundToken
- endsWith(char[], int, char[]) - Static method in class org.apache.lucene.analysis.util.StemmerUtil
-
Returns true if the character array ends with the suffix.
- endsWith(char[], int, String) - Static method in class org.apache.lucene.analysis.util.StemmerUtil
-
Returns true if the character array ends with the suffix.
- endwhen() - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.ConditionBuilder
-
Close the branch and return to the main analysis chain
- ENGLISH_STOP_WORDS_SET - Static variable in class org.apache.lucene.analysis.en.EnglishAnalyzer
-
An unmodifiable set containing some common English words that are not usually useful for searching.
- EnglishAnalyzer - Class in org.apache.lucene.analysis.en
-
Analyzer
for English. - EnglishAnalyzer() - Constructor for class org.apache.lucene.analysis.en.EnglishAnalyzer
-
Builds an analyzer with the default stop words:
EnglishAnalyzer.getDefaultStopSet()
. - EnglishAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.en.EnglishAnalyzer
-
Builds an analyzer with the given stop words.
- EnglishAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.en.EnglishAnalyzer
-
Builds an analyzer with the given stop words.
- EnglishMinimalStemFilter - Class in org.apache.lucene.analysis.en
-
A
TokenFilter
that appliesEnglishMinimalStemmer
to stem English words. - EnglishMinimalStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.en.EnglishMinimalStemFilter
- EnglishMinimalStemFilterFactory - Class in org.apache.lucene.analysis.en
-
Factory for
EnglishMinimalStemFilter
. - EnglishMinimalStemFilterFactory() - Constructor for class org.apache.lucene.analysis.en.EnglishMinimalStemFilterFactory
-
Default ctor for compatibility with SPI
- EnglishMinimalStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.en.EnglishMinimalStemFilterFactory
-
Creates a new EnglishMinimalStemFilterFactory
- EnglishMinimalStemmer - Class in org.apache.lucene.analysis.en
-
Minimal plural stemmer for English.
- EnglishMinimalStemmer() - Constructor for class org.apache.lucene.analysis.en.EnglishMinimalStemmer
- EnglishPossessiveFilter - Class in org.apache.lucene.analysis.en
-
TokenFilter that removes possessives (trailing 's) from words.
- EnglishPossessiveFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.en.EnglishPossessiveFilter
- EnglishPossessiveFilterFactory - Class in org.apache.lucene.analysis.en
-
Factory for
EnglishPossessiveFilter
. - EnglishPossessiveFilterFactory() - Constructor for class org.apache.lucene.analysis.en.EnglishPossessiveFilterFactory
-
Default ctor for compatibility with SPI
- EnglishPossessiveFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.en.EnglishPossessiveFilterFactory
-
Creates a new EnglishPossessiveFilterFactory
- EnglishStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- EnglishStemmer() - Constructor for class org.tartarus.snowball.ext.EnglishStemmer
- EntrySuggestion - Class in org.apache.lucene.analysis.hunspell
-
Suggestion to add/edit dictionary entries to generate a given list of words created by
WordFormGenerator.compress(java.util.List<java.lang.String>, java.util.Set<java.lang.String>, java.lang.Runnable)
. - eq - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Pointer to equal branch and to data when this node is a string terminator.
- eq_s(CharSequence) - Method in class org.tartarus.snowball.SnowballProgram
- eq_s_b(CharSequence) - Method in class org.tartarus.snowball.SnowballProgram
- equals(Object) - Method in class org.apache.lucene.analysis.hunspell.AffixedWord.Affix
- equals(Object) - Method in class org.apache.lucene.analysis.hunspell.AffixedWord
- equals(Object) - Method in class org.apache.lucene.analysis.hunspell.DictEntry
- equals(Object) - Method in class org.tartarus.snowball.ext.ArabicStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.ArmenianStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.BasqueStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.CatalanStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.DanishStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.DutchStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.EnglishStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.EstonianStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.FinnishStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.FrenchStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.German2Stemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.GermanStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.GreekStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.HindiStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.HungarianStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.IndonesianStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.IrishStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.ItalianStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.KpStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.LithuanianStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.LovinsStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.NepaliStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.NorwegianStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.PorterStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.PortugueseStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.RomanianStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.RussianStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.SerbianStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.SpanishStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.SwedishStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.TamilStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.TurkishStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.YiddishStemmer
- EstonianAnalyzer - Class in org.apache.lucene.analysis.et
-
Analyzer
for Estonian. - EstonianAnalyzer() - Constructor for class org.apache.lucene.analysis.et.EstonianAnalyzer
-
Builds an analyzer with the default stop words:
EstonianAnalyzer.DEFAULT_STOPWORD_FILE
. - EstonianAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.et.EstonianAnalyzer
-
Builds an analyzer with the given stop words.
- EstonianAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.et.EstonianAnalyzer
-
Builds an analyzer with the given stop words.
- EstonianStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- EstonianStemmer() - Constructor for class org.tartarus.snowball.ext.EstonianStemmer
- EVERYTHING_POSSIBLE - Static variable in interface org.apache.lucene.analysis.hunspell.FragmentChecker
- exceptions - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSetExceptions
- exceptions - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSuffixExceptions
- EXTERNAL_LINK - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- EXTERNAL_LINK_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- EXTERNAL_LINK_URL - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- EXTERNAL_LINK_URL_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
F
- FARSI_YEH - Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
- FATHA - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- FATHATAN - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- FEH - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- FilesystemResourceLoader - Class in org.apache.lucene.analysis.util
-
Simple
ResourceLoader
that opens resource files from the local file system, optionally resolving against a base directory. - FilesystemResourceLoader(Path, ClassLoader) - Constructor for class org.apache.lucene.analysis.util.FilesystemResourceLoader
-
Creates a resource loader that resolves resources against the given base directory.
- FilesystemResourceLoader(Path, Module) - Constructor for class org.apache.lucene.analysis.util.FilesystemResourceLoader
-
Creates a resource loader that resolves resources against the given base directory.
- FilesystemResourceLoader(Path, ResourceLoader) - Constructor for class org.apache.lucene.analysis.util.FilesystemResourceLoader
-
Creates a resource loader that resolves resources against the given base directory.
- FILTER_ARG_SEPARATOR - Static variable in class org.apache.lucene.analysis.miscellaneous.ProtectedTermFilterFactory
- FILTER_NAME_ID_SEPARATOR - Static variable in class org.apache.lucene.analysis.miscellaneous.ProtectedTermFilterFactory
- find(char[], int) - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- find(String) - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- find_among(Among[]) - Method in class org.tartarus.snowball.SnowballProgram
- find_among_b(Among[]) - Method in class org.tartarus.snowball.SnowballProgram
- findClass(String, Class<T>) - Method in class org.apache.lucene.analysis.util.FilesystemResourceLoader
- findPattern(String) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
- FingerprintFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Filter outputs a single token which is a concatenation of the sorted and de-duplicated set of input tokens.
- FingerprintFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.FingerprintFilter
-
Create a new FingerprintFilter with default settings
- FingerprintFilter(TokenStream, int, char) - Constructor for class org.apache.lucene.analysis.miscellaneous.FingerprintFilter
-
Create a new FingerprintFilter with control over all settings
- FingerprintFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
FingerprintFilter
. - FingerprintFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.FingerprintFilterFactory
-
Default ctor for compatibility with SPI
- FingerprintFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.FingerprintFilterFactory
-
Creates a new FingerprintFilterFactory
- FinnishAnalyzer - Class in org.apache.lucene.analysis.fi
-
Analyzer
for Finnish. - FinnishAnalyzer() - Constructor for class org.apache.lucene.analysis.fi.FinnishAnalyzer
-
Builds an analyzer with the default stop words:
FinnishAnalyzer.DEFAULT_STOPWORD_FILE
. - FinnishAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.fi.FinnishAnalyzer
-
Builds an analyzer with the given stop words.
- FinnishAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.fi.FinnishAnalyzer
-
Builds an analyzer with the given stop words.
- FinnishLightStemFilter - Class in org.apache.lucene.analysis.fi
-
A
TokenFilter
that appliesFinnishLightStemmer
to stem Finnish words. - FinnishLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.fi.FinnishLightStemFilter
- FinnishLightStemFilterFactory - Class in org.apache.lucene.analysis.fi
-
Factory for
FinnishLightStemFilter
. - FinnishLightStemFilterFactory() - Constructor for class org.apache.lucene.analysis.fi.FinnishLightStemFilterFactory
-
Default ctor for compatibility with SPI
- FinnishLightStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.fi.FinnishLightStemFilterFactory
-
Creates a new FinnishLightStemFilterFactory
- FinnishLightStemmer - Class in org.apache.lucene.analysis.fi
-
Light Stemmer for Finnish.
- FinnishLightStemmer() - Constructor for class org.apache.lucene.analysis.fi.FinnishLightStemmer
- FinnishStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- FinnishStemmer() - Constructor for class org.tartarus.snowball.ext.FinnishStemmer
- first() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- FixBrokenOffsetsFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Deprecated.Fix the token filters that create broken offsets in the first place.
- FixBrokenOffsetsFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.FixBrokenOffsetsFilter
-
Deprecated.
- FixBrokenOffsetsFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Deprecated.
- FixBrokenOffsetsFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.FixBrokenOffsetsFilterFactory
-
Deprecated.Default ctor for compatibility with SPI
- FixBrokenOffsetsFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.FixBrokenOffsetsFilterFactory
-
Deprecated.Sole constructor
- FixedShingleFilter - Class in org.apache.lucene.analysis.shingle
-
A FixedShingleFilter constructs shingles (token n-grams) from a token stream.
- FixedShingleFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.shingle.FixedShingleFilter
-
Creates a FixedShingleFilter over an input token stream
- FixedShingleFilter(TokenStream, int, String, String) - Constructor for class org.apache.lucene.analysis.shingle.FixedShingleFilter
-
Creates a FixedShingleFilter over an input token stream
- FixedShingleFilterFactory - Class in org.apache.lucene.analysis.shingle
-
Factory for
FixedShingleFilter
- FixedShingleFilterFactory() - Constructor for class org.apache.lucene.analysis.shingle.FixedShingleFilterFactory
-
Default ctor for compatibility with SPI
- FixedShingleFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.shingle.FixedShingleFilterFactory
- flagsToString(int) - Static method in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilter
-
Returns string representation of configuration flags
- FlattenGraphFilter - Class in org.apache.lucene.analysis.core
-
Converts an incoming graph token stream, such as one from
SynonymGraphFilter
, into a flat form so that all nodes form a single linear chain with no side paths. - FlattenGraphFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.core.FlattenGraphFilter
- FlattenGraphFilterFactory - Class in org.apache.lucene.analysis.core
-
Factory for
FlattenGraphFilter
. - FlattenGraphFilterFactory() - Constructor for class org.apache.lucene.analysis.core.FlattenGraphFilterFactory
-
Default ctor for compatibility with SPI
- FlattenGraphFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.core.FlattenGraphFilterFactory
-
Creates a new FlattenGraphFilterFactory
- FloatEncoder - Class in org.apache.lucene.analysis.payloads
-
Encode a character array Float as a
BytesRef
. - FloatEncoder() - Constructor for class org.apache.lucene.analysis.payloads.FloatEncoder
- flush() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- foldToASCII(char[], int) - Method in class org.apache.lucene.analysis.miscellaneous.ASCIIFoldingFilter
-
Converts characters above ASCII to their ASCII equivalents.
- foldToASCII(char[], int, char[], int, int) - Static method in class org.apache.lucene.analysis.miscellaneous.ASCIIFoldingFilter
-
Converts characters above ASCII to their ASCII equivalents.
- FORCE_FIRST_LETTER - Static variable in class org.apache.lucene.analysis.miscellaneous.CapitalizationFilterFactory
- FORMAT_SNOWBALL - Static variable in class org.apache.lucene.analysis.en.AbstractWordsFileFilterFactory
- FORMAT_WORDSET - Static variable in class org.apache.lucene.analysis.en.AbstractWordsFileFilterFactory
- FragmentChecker - Interface in org.apache.lucene.analysis.hunspell
-
An oracle for quickly checking that a specific part of a word can never be a valid word.
- freeBefore(int) - Method in class org.apache.lucene.analysis.util.RollingCharBuffer
-
Call this to notify us that no chars before this absolute position are needed anymore.
- freenode - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- FrenchAnalyzer - Class in org.apache.lucene.analysis.fr
-
Analyzer
for French language. - FrenchAnalyzer() - Constructor for class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
Builds an analyzer with the default stop words (
FrenchAnalyzer.getDefaultStopSet()
). - FrenchAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
Builds an analyzer with the given stop words
- FrenchAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
Builds an analyzer with the given stop words
- FrenchLightStemFilter - Class in org.apache.lucene.analysis.fr
-
A
TokenFilter
that appliesFrenchLightStemmer
to stem French words. - FrenchLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.fr.FrenchLightStemFilter
- FrenchLightStemFilterFactory - Class in org.apache.lucene.analysis.fr
-
Factory for
FrenchLightStemFilter
. - FrenchLightStemFilterFactory() - Constructor for class org.apache.lucene.analysis.fr.FrenchLightStemFilterFactory
-
Default ctor for compatibility with SPI
- FrenchLightStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.fr.FrenchLightStemFilterFactory
-
Creates a new FrenchLightStemFilterFactory
- FrenchLightStemmer - Class in org.apache.lucene.analysis.fr
-
Light Stemmer for French.
- FrenchLightStemmer() - Constructor for class org.apache.lucene.analysis.fr.FrenchLightStemmer
- FrenchMinimalStemFilter - Class in org.apache.lucene.analysis.fr
-
A
TokenFilter
that appliesFrenchMinimalStemmer
to stem French words. - FrenchMinimalStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.fr.FrenchMinimalStemFilter
- FrenchMinimalStemFilterFactory - Class in org.apache.lucene.analysis.fr
-
Factory for
FrenchMinimalStemFilter
. - FrenchMinimalStemFilterFactory() - Constructor for class org.apache.lucene.analysis.fr.FrenchMinimalStemFilterFactory
-
Default ctor for compatibility with SPI
- FrenchMinimalStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.fr.FrenchMinimalStemFilterFactory
-
Creates a new FrenchMinimalStemFilterFactory
- FrenchMinimalStemmer - Class in org.apache.lucene.analysis.fr
-
Light Stemmer for French.
- FrenchMinimalStemmer() - Constructor for class org.apache.lucene.analysis.fr.FrenchMinimalStemmer
- FrenchStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- FrenchStemmer() - Constructor for class org.tartarus.snowball.ext.FrenchStemmer
- fromAllSimpleWords(int, Dictionary, Runnable) - Static method in class org.apache.lucene.analysis.hunspell.NGramFragmentChecker
-
Iterate the whole dictionary, derive all word forms (using
WordFormGenerator
), vary the case to get all words acceptable by the spellchecker, and create a fragment checker based on theirn
-grams. - fromSeparatorCharPredicate(IntPredicate) - Static method in class org.apache.lucene.analysis.util.CharTokenizer
-
Creates a new instance of CharTokenizer using a custom predicate, supplied as method reference or lambda expression.
- fromSeparatorCharPredicate(AttributeFactory, IntPredicate) - Static method in class org.apache.lucene.analysis.util.CharTokenizer
-
Creates a new instance of CharTokenizer with the supplied attribute factory using a custom predicate, supplied as method reference or lambda expression.
- fromTokenCharPredicate(IntPredicate) - Static method in class org.apache.lucene.analysis.util.CharTokenizer
-
Creates a new instance of CharTokenizer using a custom predicate, supplied as method reference or lambda expression.
- fromTokenCharPredicate(AttributeFactory, IntPredicate) - Static method in class org.apache.lucene.analysis.util.CharTokenizer
-
Creates a new instance of CharTokenizer with the supplied attribute factory using a custom predicate, supplied as method reference or lambda expression.
- fromWords(int, Collection<? extends CharSequence>) - Static method in class org.apache.lucene.analysis.hunspell.NGramFragmentChecker
-
Create a fragment checker for n-grams found in the given words.
- fst - Variable in class org.apache.lucene.analysis.synonym.SynonymMap
-
map<input word, list<ord>>
G
- GalicianAnalyzer - Class in org.apache.lucene.analysis.gl
-
Analyzer
for Galician. - GalicianAnalyzer() - Constructor for class org.apache.lucene.analysis.gl.GalicianAnalyzer
-
Builds an analyzer with the default stop words:
GalicianAnalyzer.DEFAULT_STOPWORD_FILE
. - GalicianAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.gl.GalicianAnalyzer
-
Builds an analyzer with the given stop words.
- GalicianAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.gl.GalicianAnalyzer
-
Builds an analyzer with the given stop words.
- GalicianMinimalStemFilter - Class in org.apache.lucene.analysis.gl
-
A
TokenFilter
that appliesGalicianMinimalStemmer
to stem Galician words. - GalicianMinimalStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.gl.GalicianMinimalStemFilter
- GalicianMinimalStemFilterFactory - Class in org.apache.lucene.analysis.gl
-
Factory for
GalicianMinimalStemFilter
. - GalicianMinimalStemFilterFactory() - Constructor for class org.apache.lucene.analysis.gl.GalicianMinimalStemFilterFactory
-
Default ctor for compatibility with SPI
- GalicianMinimalStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.gl.GalicianMinimalStemFilterFactory
-
Creates a new GalicianMinimalStemFilterFactory
- GalicianMinimalStemmer - Class in org.apache.lucene.analysis.gl
-
Minimal Stemmer for Galician
- GalicianMinimalStemmer() - Constructor for class org.apache.lucene.analysis.gl.GalicianMinimalStemmer
- GalicianStemFilter - Class in org.apache.lucene.analysis.gl
-
A
TokenFilter
that appliesGalicianStemmer
to stem Galician words. - GalicianStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.gl.GalicianStemFilter
- GalicianStemFilterFactory - Class in org.apache.lucene.analysis.gl
-
Factory for
GalicianStemFilter
. - GalicianStemFilterFactory() - Constructor for class org.apache.lucene.analysis.gl.GalicianStemFilterFactory
-
Default ctor for compatibility with SPI
- GalicianStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.gl.GalicianStemFilterFactory
-
Creates a new GalicianStemFilterFactory
- GalicianStemmer - Class in org.apache.lucene.analysis.gl
-
Galician stemmer implementing "Regras do lematizador para o galego".
- GalicianStemmer() - Constructor for class org.apache.lucene.analysis.gl.GalicianStemmer
- GENERATE_NUMBER_PARTS - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.Causes number subwords to be generated:
- GENERATE_NUMBER_PARTS - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilter
-
Causes number subwords to be generated:
- GENERATE_WORD_PARTS - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.Causes parts of words to be generated:
- GENERATE_WORD_PARTS - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilter
-
Causes parts of words to be generated:
- generateAllSimpleWords(Consumer<AffixedWord>, Runnable) - Method in class org.apache.lucene.analysis.hunspell.WordFormGenerator
-
Traverse the whole dictionary and derive all word forms via affixation (as in
WordFormGenerator.getAllWordForms(String, String, Runnable)
) for each of the entries. - German2Stemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- German2Stemmer() - Constructor for class org.tartarus.snowball.ext.German2Stemmer
- GermanAnalyzer - Class in org.apache.lucene.analysis.de
-
Analyzer
for German language. - GermanAnalyzer() - Constructor for class org.apache.lucene.analysis.de.GermanAnalyzer
-
Builds an analyzer with the default stop words:
GermanAnalyzer.getDefaultStopSet()
. - GermanAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.de.GermanAnalyzer
-
Builds an analyzer with the given stop words
- GermanAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.de.GermanAnalyzer
-
Builds an analyzer with the given stop words
- GermanLightStemFilter - Class in org.apache.lucene.analysis.de
-
A
TokenFilter
that appliesGermanLightStemmer
to stem German words. - GermanLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.de.GermanLightStemFilter
- GermanLightStemFilterFactory - Class in org.apache.lucene.analysis.de
-
Factory for
GermanLightStemFilter
. - GermanLightStemFilterFactory() - Constructor for class org.apache.lucene.analysis.de.GermanLightStemFilterFactory
-
Default ctor for compatibility with SPI
- GermanLightStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.de.GermanLightStemFilterFactory
-
Creates a new GermanLightStemFilterFactory
- GermanLightStemmer - Class in org.apache.lucene.analysis.de
-
Light Stemmer for German.
- GermanLightStemmer() - Constructor for class org.apache.lucene.analysis.de.GermanLightStemmer
- GermanMinimalStemFilter - Class in org.apache.lucene.analysis.de
-
A
TokenFilter
that appliesGermanMinimalStemmer
to stem German words. - GermanMinimalStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.de.GermanMinimalStemFilter
- GermanMinimalStemFilterFactory - Class in org.apache.lucene.analysis.de
-
Factory for
GermanMinimalStemFilter
. - GermanMinimalStemFilterFactory() - Constructor for class org.apache.lucene.analysis.de.GermanMinimalStemFilterFactory
-
Default ctor for compatibility with SPI
- GermanMinimalStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.de.GermanMinimalStemFilterFactory
-
Creates a new GermanMinimalStemFilterFactory
- GermanMinimalStemmer - Class in org.apache.lucene.analysis.de
-
Minimal Stemmer for German.
- GermanMinimalStemmer() - Constructor for class org.apache.lucene.analysis.de.GermanMinimalStemmer
- GermanNormalizationFilter - Class in org.apache.lucene.analysis.de
-
Normalizes German characters according to the heuristics of the German2 snowball algorithm.
- GermanNormalizationFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.de.GermanNormalizationFilter
- GermanNormalizationFilterFactory - Class in org.apache.lucene.analysis.de
-
Factory for
GermanNormalizationFilter
. - GermanNormalizationFilterFactory() - Constructor for class org.apache.lucene.analysis.de.GermanNormalizationFilterFactory
-
Default ctor for compatibility with SPI
- GermanNormalizationFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.de.GermanNormalizationFilterFactory
-
Creates a new GermanNormalizationFilterFactory
- GermanStemFilter - Class in org.apache.lucene.analysis.de
-
A
TokenFilter
that stems German words. - GermanStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.de.GermanStemFilter
-
Creates a
GermanStemFilter
instance - GermanStemFilterFactory - Class in org.apache.lucene.analysis.de
-
Factory for
GermanStemFilter
. - GermanStemFilterFactory() - Constructor for class org.apache.lucene.analysis.de.GermanStemFilterFactory
-
Default ctor for compatibility with SPI
- GermanStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.de.GermanStemFilterFactory
-
Creates a new GermanStemFilterFactory
- GermanStemmer - Class in org.apache.lucene.analysis.de
-
A stemmer for German words.
- GermanStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- GermanStemmer() - Constructor for class org.apache.lucene.analysis.de.GermanStemmer
- GermanStemmer() - Constructor for class org.tartarus.snowball.ext.GermanStemmer
- get(char[], int, FST.Arc<BytesRef>, FST.BytesReader) - Method in class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter.StemmerOverrideMap
-
Returns the value mapped to the given key or
null
if the key is not in the FST dictionary. - get(int) - Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
- get(int) - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
- get(int) - Method in class org.apache.lucene.analysis.util.RollingCharBuffer
- get(int, int) - Method in class org.apache.lucene.analysis.util.RollingCharBuffer
- getAllWordForms(String) - Method in class org.apache.lucene.analysis.hunspell.Hunspell
-
Generate all word forms for all dictionary entries with the given root word.
- getAllWordForms(String, Runnable) - Method in class org.apache.lucene.analysis.hunspell.WordFormGenerator
-
Generate all word forms for all dictionary entries with the given root word.
- getAllWordForms(String, String, Runnable) - Method in class org.apache.lucene.analysis.hunspell.WordFormGenerator
-
Generate all word forms for the given root pretending it has the given flags (in the same format as the dictionary uses).
- getArray() - Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
- getArray() - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
- getArray() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- getBeginIndex() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- getBytesReader() - Method in class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter.StemmerOverrideMap
-
Returns a
FST.BytesReader
to pass to theStemmerOverrideFilter.StemmerOverrideMap.get(char[], int, FST.Arc, FST.BytesReader)
method. - getBytesRef() - Method in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter.BytesRefBuilderTermAttributeImpl
- getBytesRef() - Method in class org.apache.lucene.collation.tokenattributes.CollatedTermAttributeImpl
- getCharacter(Map<String, String>, String, Character) - Method in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilterFactory
- getCharFilterFactories() - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer
-
Returns the list of char filters that are used in this analyzer.
- getCommonWords() - Method in class org.apache.lucene.analysis.commongrams.CommonGramsFilterFactory
- getCurrent() - Method in class org.tartarus.snowball.SnowballProgram
-
Get the current string.
- getCurrentBuffer() - Method in class org.tartarus.snowball.SnowballProgram
-
Get the current buffer containing the stem.
- getCurrentBufferLength() - Method in class org.tartarus.snowball.SnowballProgram
-
Get the valid length of the character array in
SnowballProgram.getCurrentBuffer()
. - getDataFormat(String) - Method in class org.apache.lucene.analysis.miscellaneous.DateRecognizerFilterFactory
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.ar.ArabicAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.bg.BulgarianAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.bn.BengaliAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.ca.CatalanAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.cjk.CJKAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.ckb.SoraniAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.cz.CzechAnalyzer
-
Returns a set of default Czech-stopwords
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.da.DanishAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.de.GermanAnalyzer
-
Returns a set of default German-stopwords
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.el.GreekAnalyzer
-
Returns a set of default Greek-stopwords
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.en.EnglishAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.es.SpanishAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.et.EstonianAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.eu.BasqueAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.fa.PersianAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.fi.FinnishAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.ga.IrishAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.gl.GalicianAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.hi.HindiAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.hu.HungarianAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.hy.ArmenianAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.id.IndonesianAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.it.ItalianAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.lt.LithuanianAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.lv.LatvianAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.ne.NepaliAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.nl.DutchAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.no.NorwegianAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.pt.PortugueseAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.ro.RomanianAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.ru.RussianAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.sr.SerbianAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.sv.SwedishAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.ta.TamilAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.te.TeluguAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.th.ThaiAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.tr.TurkishAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDictEntry() - Method in class org.apache.lucene.analysis.hunspell.AffixedWord
- getEndIndex() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- getEntriesToAdd() - Method in class org.apache.lucene.analysis.hunspell.EntrySuggestion
- getEntriesToEdit() - Method in class org.apache.lucene.analysis.hunspell.EntrySuggestion
- getExceptionWord(ArrayList<?>) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- getExtraGeneratedWords() - Method in class org.apache.lucene.analysis.hunspell.EntrySuggestion
- getFillerToken() - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- getFlag() - Method in class org.apache.lucene.analysis.hunspell.AffixedWord.Affix
- getFlags() - Method in class org.apache.lucene.analysis.hunspell.DictEntry
- getFlags() - Method in class org.apache.lucene.analysis.pattern.PatternTypingFilter.PatternTypingRule
- getFormat() - Method in class org.apache.lucene.analysis.en.AbstractWordsFileFilterFactory
- getHyphenationPoints() - Method in class org.apache.lucene.analysis.compound.hyphenation.Hyphenation
- getHyphenationTree(String) - Static method in class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
-
Create a hyphenator tree
- getHyphenationTree(InputSource) - Static method in class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
-
Create a hyphenator tree
- getIgnoreCase() - Method in class org.apache.lucene.analysis.hunspell.Dictionary
-
Returns true if this dictionary was constructed with the
ignoreCase
option - getIndex() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- getInterletterValues(String) - Static method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- getLastCumulativeDiff() - Method in class org.apache.lucene.analysis.charfilter.BaseCharFilter
- getLength() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- getMaxLookaheadUsed() - Method in class org.apache.lucene.analysis.core.FlattenGraphFilter
-
For testing
- getMaxShingleSize() - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
The max shingle (token ngram) size
- getMaxTokenLength() - Method in class org.apache.lucene.analysis.classic.ClassicAnalyzer
- getMaxTokenLength() - Method in class org.apache.lucene.analysis.classic.ClassicTokenizer
- getMaxTokenLength() - Method in class org.apache.lucene.analysis.email.UAX29URLEmailAnalyzer
- getMaxTokenLength() - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
- getMinShingleSize() - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
The min shingle (token ngram) size
- getMorphologicalData() - Method in class org.apache.lucene.analysis.hunspell.DictEntry
- getMorphologicalData(int) - Method in interface org.apache.lucene.analysis.hunspell.DictEntries
-
Same as
get(entryIndex).getMorphologicalData()
- getMorphologicalValues(int, String) - Method in interface org.apache.lucene.analysis.hunspell.DictEntries
-
Same as
get(entryIndex).getMorphologicalValues(key)
- getMorphologicalValues(String) - Method in class org.apache.lucene.analysis.hunspell.DictEntry
- getNextToken() - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Resumes scanning until the next regular expression is matched, the end of input is encountered or an I/O-Error occurs.
- getOffsetGap(String) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer
- getPartialResult() - Method in exception org.apache.lucene.analysis.hunspell.SuggestionTimeoutException
- getPattern() - Method in class org.apache.lucene.analysis.pattern.PatternTypingFilter.PatternTypingRule
- getPattern(String) - Static method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- getPositionIncrementGap(String) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer
- getPrefixes() - Method in class org.apache.lucene.analysis.hunspell.AffixedWord
- getProtectedTerms() - Method in class org.apache.lucene.analysis.miscellaneous.ProtectedTermFilterFactory
- getRoots(String) - Method in class org.apache.lucene.analysis.hunspell.Hunspell
-
Find all roots that could result in the given word after case conversion and adding affixes.
- getStart() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- getStem() - Method in class org.apache.lucene.analysis.hunspell.DictEntry
- getStopTypes() - Method in class org.apache.lucene.analysis.core.TypeTokenFilterFactory
- getStopWords() - Method in class org.apache.lucene.analysis.core.StopFilterFactory
- getStopWords() - Method in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Provides information on which stop words have been identified for all fields
- getStopWords(String) - Method in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Provides information on which stop words have been identified for a field
- getSuffixes() - Method in class org.apache.lucene.analysis.hunspell.AffixedWord
- getSynonymProvider(ResourceLoader, String, Word2VecSynonymProviderFactory.Word2VecSupportedFormats) - Static method in class org.apache.lucene.analysis.synonym.word2vec.Word2VecSynonymProviderFactory
- getSynonyms(BytesRef, int, float) - Method in class org.apache.lucene.analysis.synonym.word2vec.Word2VecSynonymProvider
- getText() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- getText(CharTermAttribute) - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Fills CharTermAttribute with the current token text.
- getTokenFilterFactories() - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer
-
Returns the list of token filters that are used in this analyzer.
- getTokenizerFactory() - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer
-
Returns the tokenizer that is used in this analyzer.
- getTokenSeparator() - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- getType(int) - Static method in class org.apache.lucene.analysis.miscellaneous.WordDelimiterIterator
-
Computes the type of the given character
- getTypeTemplate() - Method in class org.apache.lucene.analysis.pattern.PatternTypingFilter.PatternTypingRule
- getValue() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree.Iterator
- getValues(int) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
- getWord() - Method in class org.apache.lucene.analysis.hunspell.AffixedWord
- getWordFiles() - Method in class org.apache.lucene.analysis.en.AbstractWordsFileFilterFactory
- getWords() - Method in class org.apache.lucene.analysis.en.AbstractWordsFileFilterFactory
- getWrappedAnalyzer(String) - Method in class org.apache.lucene.analysis.miscellaneous.LimitTokenCountAnalyzer
- getWrappedAnalyzer(String) - Method in class org.apache.lucene.analysis.miscellaneous.PerFieldAnalyzerWrapper
- getWrappedAnalyzer(String) - Method in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
- getWrappedAnalyzer(String) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- GRAM_TYPE - Static variable in class org.apache.lucene.analysis.commongrams.CommonGramsFilter
- GreekAnalyzer - Class in org.apache.lucene.analysis.el
-
Analyzer
for the Greek language. - GreekAnalyzer() - Constructor for class org.apache.lucene.analysis.el.GreekAnalyzer
-
Builds an analyzer with the default stop words.
- GreekAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.el.GreekAnalyzer
-
Builds an analyzer with the given stop words.
- GreekLowerCaseFilter - Class in org.apache.lucene.analysis.el
-
Normalizes token text to lower case, removes some Greek diacritics, and standardizes final sigma to sigma.
- GreekLowerCaseFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.el.GreekLowerCaseFilter
-
Create a GreekLowerCaseFilter that normalizes Greek token text.
- GreekLowerCaseFilterFactory - Class in org.apache.lucene.analysis.el
-
Factory for
GreekLowerCaseFilter
. - GreekLowerCaseFilterFactory() - Constructor for class org.apache.lucene.analysis.el.GreekLowerCaseFilterFactory
-
Default ctor for compatibility with SPI
- GreekLowerCaseFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.el.GreekLowerCaseFilterFactory
-
Creates a new GreekLowerCaseFilterFactory
- GreekStemFilter - Class in org.apache.lucene.analysis.el
-
A
TokenFilter
that appliesGreekStemmer
to stem Greek words. - GreekStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.el.GreekStemFilter
- GreekStemFilterFactory - Class in org.apache.lucene.analysis.el
-
Factory for
GreekStemFilter
. - GreekStemFilterFactory() - Constructor for class org.apache.lucene.analysis.el.GreekStemFilterFactory
-
Default ctor for compatibility with SPI
- GreekStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.el.GreekStemFilterFactory
-
Creates a new GreekStemFilterFactory
- GreekStemmer - Class in org.apache.lucene.analysis.el
-
A stemmer for Greek words, according to: Development of a Stemmer for the Greek Language. Georgios Ntais
- GreekStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- GreekStemmer() - Constructor for class org.apache.lucene.analysis.el.GreekStemmer
- GreekStemmer() - Constructor for class org.tartarus.snowball.ext.GreekStemmer
- group - Variable in class org.apache.lucene.analysis.pattern.PatternTokenizerFactory
- GROUP - Static variable in class org.apache.lucene.analysis.pattern.PatternTokenizerFactory
H
- HAMZA_ABOVE - Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
- HAN - Static variable in class org.apache.lucene.analysis.cjk.CJKBigramFilter
-
bigram flag for Han Ideographs
- HANGUL - Static variable in class org.apache.lucene.analysis.cjk.CJKBigramFilter
-
bigram flag for Hangul
- HANGUL - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
-
Hangul token type
- HANGUL_TYPE - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Hangul token type
- HAS_BUGGY_BREAKITERATORS - Static variable in class org.apache.lucene.analysis.util.CharArrayIterator
-
True if this JRE has a buggy BreakIterator implementation
- hashCode() - Method in class org.apache.lucene.analysis.hunspell.AffixedWord.Affix
- hashCode() - Method in class org.apache.lucene.analysis.hunspell.AffixedWord
- hashCode() - Method in class org.apache.lucene.analysis.hunspell.DictEntry
- hashCode() - Method in class org.tartarus.snowball.ext.ArabicStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.ArmenianStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.BasqueStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.CatalanStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.DanishStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.DutchStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.EnglishStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.EstonianStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.FinnishStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.FrenchStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.German2Stemmer
- hashCode() - Method in class org.tartarus.snowball.ext.GermanStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.GreekStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.HindiStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.HungarianStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.IndonesianStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.IrishStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.ItalianStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.KpStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.LithuanianStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.NepaliStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.NorwegianStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.PorterStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.PortugueseStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.RomanianStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.RussianStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.SerbianStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.SpanishStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.SwedishStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.TamilStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.TurkishStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.YiddishStemmer
- hashFactor() - Method in class org.apache.lucene.analysis.hunspell.Dictionary
-
The factor determining the size of the internal hash table used for storing the entries.
- hasImpossibleFragmentAround(CharSequence, int, int) - Method in interface org.apache.lucene.analysis.hunspell.FragmentChecker
-
Check if the given word range intersects any fragment which is impossible in the current language.
- hasImpossibleFragmentAround(CharSequence, int, int) - Method in class org.apache.lucene.analysis.hunspell.NGramFragmentChecker
- hasMoreElements() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree.Iterator
- HEADING - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- HEADING_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- HEH - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- HEH - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- HEH - Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
- HEH_GOAL - Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
- HEH_YEH - Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
- hi - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Pointer to high branch.
- HindiAnalyzer - Class in org.apache.lucene.analysis.hi
-
Analyzer for Hindi.
- HindiAnalyzer() - Constructor for class org.apache.lucene.analysis.hi.HindiAnalyzer
-
Builds an analyzer with the default stop words:
HindiAnalyzer.DEFAULT_STOPWORD_FILE
. - HindiAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.hi.HindiAnalyzer
-
Builds an analyzer with the given stop words
- HindiAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.hi.HindiAnalyzer
-
Builds an analyzer with the given stop words
- HindiNormalizationFilter - Class in org.apache.lucene.analysis.hi
-
A
TokenFilter
that appliesHindiNormalizer
to normalize the orthography. - HindiNormalizationFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.hi.HindiNormalizationFilter
- HindiNormalizationFilterFactory - Class in org.apache.lucene.analysis.hi
-
Factory for
HindiNormalizationFilter
. - HindiNormalizationFilterFactory() - Constructor for class org.apache.lucene.analysis.hi.HindiNormalizationFilterFactory
-
Default ctor for compatibility with SPI
- HindiNormalizationFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.hi.HindiNormalizationFilterFactory
-
Creates a new HindiNormalizationFilterFactory
- HindiNormalizer - Class in org.apache.lucene.analysis.hi
-
Normalizer for Hindi.
- HindiNormalizer() - Constructor for class org.apache.lucene.analysis.hi.HindiNormalizer
- HindiStemFilter - Class in org.apache.lucene.analysis.hi
-
A
TokenFilter
that appliesHindiStemmer
to stem Hindi words. - HindiStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.hi.HindiStemFilter
- HindiStemFilterFactory - Class in org.apache.lucene.analysis.hi
-
Factory for
HindiStemFilter
. - HindiStemFilterFactory() - Constructor for class org.apache.lucene.analysis.hi.HindiStemFilterFactory
-
Default ctor for compatibility with SPI
- HindiStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.hi.HindiStemFilterFactory
-
Creates a new HindiStemFilterFactory
- HindiStemmer - Class in org.apache.lucene.analysis.hi
-
Light Stemmer for Hindi.
- HindiStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- HindiStemmer() - Constructor for class org.apache.lucene.analysis.hi.HindiStemmer
- HindiStemmer() - Constructor for class org.tartarus.snowball.ext.HindiStemmer
- HIRAGANA - Static variable in class org.apache.lucene.analysis.cjk.CJKBigramFilter
-
bigram flag for Hiragana
- HIRAGANA - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
-
Hiragana token type
- HIRAGANA_TYPE - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Hiragana token type
- HOST - Static variable in class org.apache.lucene.analysis.classic.ClassicTokenizer
- HOST_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- hstrcmp(char[], int, char[], int) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
String compare, returns 0 if equal or t is a substring of s
- HTMLStripCharFilter - Class in org.apache.lucene.analysis.charfilter
-
A CharFilter that wraps another Reader and attempts to strip out HTML constructs.
- HTMLStripCharFilter(Reader) - Constructor for class org.apache.lucene.analysis.charfilter.HTMLStripCharFilter
-
Creates a new scanner
- HTMLStripCharFilter(Reader, Set<String>) - Constructor for class org.apache.lucene.analysis.charfilter.HTMLStripCharFilter
-
Creates a new HTMLStripCharFilter over the provided Reader with the specified start and end tags.
- HTMLStripCharFilterFactory - Class in org.apache.lucene.analysis.charfilter
-
Factory for
HTMLStripCharFilter
. - HTMLStripCharFilterFactory() - Constructor for class org.apache.lucene.analysis.charfilter.HTMLStripCharFilterFactory
-
Default ctor for compatibility with SPI
- HTMLStripCharFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.charfilter.HTMLStripCharFilterFactory
-
Creates a new HTMLStripCharFilterFactory
- HungarianAnalyzer - Class in org.apache.lucene.analysis.hu
-
Analyzer
for Hungarian. - HungarianAnalyzer() - Constructor for class org.apache.lucene.analysis.hu.HungarianAnalyzer
-
Builds an analyzer with the default stop words:
HungarianAnalyzer.DEFAULT_STOPWORD_FILE
. - HungarianAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.hu.HungarianAnalyzer
-
Builds an analyzer with the given stop words.
- HungarianAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.hu.HungarianAnalyzer
-
Builds an analyzer with the given stop words.
- HungarianLightStemFilter - Class in org.apache.lucene.analysis.hu
-
A
TokenFilter
that appliesHungarianLightStemmer
to stem Hungarian words. - HungarianLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.hu.HungarianLightStemFilter
- HungarianLightStemFilterFactory - Class in org.apache.lucene.analysis.hu
-
Factory for
HungarianLightStemFilter
. - HungarianLightStemFilterFactory() - Constructor for class org.apache.lucene.analysis.hu.HungarianLightStemFilterFactory
-
Default ctor for compatibility with SPI
- HungarianLightStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.hu.HungarianLightStemFilterFactory
-
Creates a new HungarianLightStemFilterFactory
- HungarianLightStemmer - Class in org.apache.lucene.analysis.hu
-
Light Stemmer for Hungarian.
- HungarianLightStemmer() - Constructor for class org.apache.lucene.analysis.hu.HungarianLightStemmer
- HungarianStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- HungarianStemmer() - Constructor for class org.tartarus.snowball.ext.HungarianStemmer
- Hunspell - Class in org.apache.lucene.analysis.hunspell
-
A spell checker based on Hunspell dictionaries.
- Hunspell(Dictionary) - Constructor for class org.apache.lucene.analysis.hunspell.Hunspell
- Hunspell(Dictionary, TimeoutPolicy, Runnable) - Constructor for class org.apache.lucene.analysis.hunspell.Hunspell
- HunspellStemFilter - Class in org.apache.lucene.analysis.hunspell
-
TokenFilter that uses hunspell affix rules and words to stem tokens.
- HunspellStemFilter(TokenStream, Dictionary) - Constructor for class org.apache.lucene.analysis.hunspell.HunspellStemFilter
-
Create a
HunspellStemFilter
outputting all possible stems. - HunspellStemFilter(TokenStream, Dictionary, boolean) - Constructor for class org.apache.lucene.analysis.hunspell.HunspellStemFilter
-
Create a
HunspellStemFilter
outputting all possible stems. - HunspellStemFilter(TokenStream, Dictionary, boolean, boolean) - Constructor for class org.apache.lucene.analysis.hunspell.HunspellStemFilter
-
Creates a new HunspellStemFilter that will stem tokens from the given TokenStream using affix rules in the provided Dictionary
- HunspellStemFilterFactory - Class in org.apache.lucene.analysis.hunspell
-
TokenFilterFactory that creates instances of
HunspellStemFilter
. - HunspellStemFilterFactory() - Constructor for class org.apache.lucene.analysis.hunspell.HunspellStemFilterFactory
-
Default ctor for compatibility with SPI
- HunspellStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.hunspell.HunspellStemFilterFactory
-
Creates a new HunspellStemFilterFactory
- Hyphen - Class in org.apache.lucene.analysis.compound.hyphenation
-
This class represents a hyphen.
- hyphenate(char[], int, int, int, int) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Hyphenate word and return an array of hyphenation points.
- hyphenate(String, int, int) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Hyphenate word and return a Hyphenation object.
- HyphenatedWordsFilter - Class in org.apache.lucene.analysis.miscellaneous
-
When the plain text is extracted from documents, we will often have many words hyphenated and broken into two lines.
- HyphenatedWordsFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.HyphenatedWordsFilter
-
Creates a new HyphenatedWordsFilter
- HyphenatedWordsFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
HyphenatedWordsFilter
. - HyphenatedWordsFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.HyphenatedWordsFilterFactory
-
Default ctor for compatibility with SPI
- HyphenatedWordsFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.HyphenatedWordsFilterFactory
-
Creates a new HyphenatedWordsFilterFactory
- Hyphenation - Class in org.apache.lucene.analysis.compound.hyphenation
-
This class represents a hyphenated word.
- HyphenationCompoundWordTokenFilter - Class in org.apache.lucene.analysis.compound
-
A
TokenFilter
that decomposes compound words found in many Germanic languages. - HyphenationCompoundWordTokenFilter(TokenStream, HyphenationTree) - Constructor for class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
-
Create a HyphenationCompoundWordTokenFilter with no dictionary.
- HyphenationCompoundWordTokenFilter(TokenStream, HyphenationTree, int, int, int) - Constructor for class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
-
Create a HyphenationCompoundWordTokenFilter with no dictionary.
- HyphenationCompoundWordTokenFilter(TokenStream, HyphenationTree, CharArraySet) - Constructor for class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
-
Creates a new
HyphenationCompoundWordTokenFilter
instance. - HyphenationCompoundWordTokenFilter(TokenStream, HyphenationTree, CharArraySet, int, int, int, boolean) - Constructor for class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
-
Creates a new
HyphenationCompoundWordTokenFilter
instance. - HyphenationCompoundWordTokenFilter(TokenStream, HyphenationTree, CharArraySet, int, int, int, boolean, boolean, boolean) - Constructor for class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
-
Creates a new
HyphenationCompoundWordTokenFilter
instance. - HyphenationCompoundWordTokenFilterFactory - Class in org.apache.lucene.analysis.compound
-
Factory for
HyphenationCompoundWordTokenFilter
. - HyphenationCompoundWordTokenFilterFactory() - Constructor for class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilterFactory
-
Default ctor for compatibility with SPI
- HyphenationCompoundWordTokenFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilterFactory
-
Creates a new HyphenationCompoundWordTokenFilterFactory
- HyphenationTree - Class in org.apache.lucene.analysis.compound.hyphenation
-
This tree structure stores the hyphenation patterns in an efficient way for fast lookup.
- HyphenationTree() - Constructor for class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
I
- IdentityEncoder - Class in org.apache.lucene.analysis.payloads
-
Does nothing other than convert the char array to a byte array using the specified encoding.
- IdentityEncoder() - Constructor for class org.apache.lucene.analysis.payloads.IdentityEncoder
- IdentityEncoder(Charset) - Constructor for class org.apache.lucene.analysis.payloads.IdentityEncoder
- IDEOGRAPHIC - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
-
Ideographic token type
- IDEOGRAPHIC_TYPE - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Ideographic token type
- IGNORE_KEYWORDS - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.Suppresses processing terms with
KeywordAttribute.isKeyword()
=true. - IGNORE_KEYWORDS - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilter
-
Suppresses processing terms with
KeywordAttribute.isKeyword()
=true. - in_grouping(char[], int, int) - Method in class org.tartarus.snowball.SnowballProgram
- in_grouping_b(char[], int, int) - Method in class org.tartarus.snowball.SnowballProgram
- incrementToken() - Method in class org.apache.lucene.analysis.ar.ArabicNormalizationFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ar.ArabicStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.bg.BulgarianStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.bn.BengaliNormalizationFilter
- incrementToken() - Method in class org.apache.lucene.analysis.bn.BengaliStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.boost.DelimitedBoostTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.br.BrazilianStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.cjk.CJKBigramFilter
- incrementToken() - Method in class org.apache.lucene.analysis.cjk.CJKWidthFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ckb.SoraniNormalizationFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ckb.SoraniStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.classic.ClassicFilter
-
Returns the next token in the stream, or null at EOS.
- incrementToken() - Method in class org.apache.lucene.analysis.classic.ClassicTokenizer
- incrementToken() - Method in class org.apache.lucene.analysis.commongrams.CommonGramsFilter
-
Inserts bigrams for common words into a token stream.
- incrementToken() - Method in class org.apache.lucene.analysis.commongrams.CommonGramsQueryFilter
-
Output bigrams whenever possible to optimize queries.
- incrementToken() - Method in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- incrementToken() - Method in class org.apache.lucene.analysis.core.DecimalDigitFilter
- incrementToken() - Method in class org.apache.lucene.analysis.core.FlattenGraphFilter
- incrementToken() - Method in class org.apache.lucene.analysis.core.KeywordTokenizer
- incrementToken() - Method in class org.apache.lucene.analysis.core.UpperCaseFilter
- incrementToken() - Method in class org.apache.lucene.analysis.cz.CzechStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.de.GermanLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.de.GermanMinimalStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.de.GermanNormalizationFilter
- incrementToken() - Method in class org.apache.lucene.analysis.de.GermanStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.el.GreekLowerCaseFilter
- incrementToken() - Method in class org.apache.lucene.analysis.el.GreekStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
- incrementToken() - Method in class org.apache.lucene.analysis.en.EnglishMinimalStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.en.EnglishPossessiveFilter
- incrementToken() - Method in class org.apache.lucene.analysis.en.KStemFilter
-
Returns the next, stemmed, input Token.
- incrementToken() - Method in class org.apache.lucene.analysis.en.PorterStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.es.SpanishLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.es.SpanishMinimalStemFilter
-
Deprecated.
- incrementToken() - Method in class org.apache.lucene.analysis.es.SpanishPluralStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.fa.PersianNormalizationFilter
- incrementToken() - Method in class org.apache.lucene.analysis.fa.PersianStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.fi.FinnishLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.fr.FrenchLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.fr.FrenchMinimalStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ga.IrishLowerCaseFilter
- incrementToken() - Method in class org.apache.lucene.analysis.gl.GalicianMinimalStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.gl.GalicianStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.hi.HindiNormalizationFilter
- incrementToken() - Method in class org.apache.lucene.analysis.hi.HindiStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.hu.HungarianLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.hunspell.HunspellStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.id.IndonesianStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.in.IndicNormalizationFilter
- incrementToken() - Method in class org.apache.lucene.analysis.it.ItalianLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.lv.LatvianStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.minhash.MinHashFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.ASCIIFoldingFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.CapitalizationFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.ConcatenatingTokenStream
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.ConditionalTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.DelimitedTermFrequencyTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.EmptyTokenStream
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.FingerprintFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.FixBrokenOffsetsFilter
-
Deprecated.
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.HyphenatedWordsFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.KeywordMarkerFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.KeywordRepeatFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.LimitTokenCountFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.LimitTokenOffsetFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.LimitTokenPositionFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.RemoveDuplicatesTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.ScandinavianFoldingFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.ScandinavianNormalizationFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.TrimFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.TruncateTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.TypeAsSynonymFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ngram.NGramTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ngram.NGramTokenizer
- incrementToken() - Method in class org.apache.lucene.analysis.no.NorwegianLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.no.NorwegianMinimalStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.no.NorwegianNormalizationFilter
- incrementToken() - Method in class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- incrementToken() - Method in class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- incrementToken() - Method in class org.apache.lucene.analysis.pattern.PatternCaptureGroupTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.pattern.PatternReplaceFilter
- incrementToken() - Method in class org.apache.lucene.analysis.pattern.PatternTokenizer
- incrementToken() - Method in class org.apache.lucene.analysis.pattern.PatternTypingFilter
- incrementToken() - Method in class org.apache.lucene.analysis.pattern.SimplePatternSplitTokenizer
- incrementToken() - Method in class org.apache.lucene.analysis.pattern.SimplePatternTokenizer
- incrementToken() - Method in class org.apache.lucene.analysis.payloads.DelimitedPayloadTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.payloads.NumericPayloadTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.payloads.TokenOffsetPayloadTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.payloads.TypeAsPayloadTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.pt.PortugueseLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.pt.PortugueseMinimalStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.pt.PortugueseStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ru.RussianLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.shingle.FixedShingleFilter
- incrementToken() - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
- incrementToken() - Method in class org.apache.lucene.analysis.sinks.TeeSinkTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.sinks.TeeSinkTokenFilter.SinkTokenStream
- incrementToken() - Method in class org.apache.lucene.analysis.snowball.SnowballFilter
-
Returns the next input Token, after being stemmed
- incrementToken() - Method in class org.apache.lucene.analysis.sr.SerbianNormalizationFilter
- incrementToken() - Method in class org.apache.lucene.analysis.sr.SerbianNormalizationRegularFilter
- incrementToken() - Method in class org.apache.lucene.analysis.sv.SwedishLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.sv.SwedishMinimalStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.synonym.SynonymFilter
-
Deprecated.
- incrementToken() - Method in class org.apache.lucene.analysis.synonym.SynonymGraphFilter
- incrementToken() - Method in class org.apache.lucene.analysis.synonym.word2vec.Word2VecSynonymFilter
- incrementToken() - Method in class org.apache.lucene.analysis.te.TeluguNormalizationFilter
- incrementToken() - Method in class org.apache.lucene.analysis.te.TeluguStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.tr.ApostropheFilter
- incrementToken() - Method in class org.apache.lucene.analysis.tr.TurkishLowerCaseFilter
- incrementToken() - Method in class org.apache.lucene.analysis.util.CharTokenizer
- incrementToken() - Method in class org.apache.lucene.analysis.util.ElisionFilter
-
Increments the
TokenStream
with aCharTermAttribute
without elisioned start - incrementToken() - Method in class org.apache.lucene.analysis.util.SegmentingTokenizerBase
- incrementToken() - Method in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- incrementWord() - Method in class org.apache.lucene.analysis.th.ThaiTokenizer
- incrementWord() - Method in class org.apache.lucene.analysis.util.SegmentingTokenizerBase
-
Returns true if another word is available
- IndicNormalizationFilter - Class in org.apache.lucene.analysis.in
-
A
TokenFilter
that appliesIndicNormalizer
to normalize text in Indian Languages. - IndicNormalizationFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.in.IndicNormalizationFilter
- IndicNormalizationFilterFactory - Class in org.apache.lucene.analysis.in
-
Factory for
IndicNormalizationFilter
. - IndicNormalizationFilterFactory() - Constructor for class org.apache.lucene.analysis.in.IndicNormalizationFilterFactory
-
Default ctor for compatibility with SPI
- IndicNormalizationFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.in.IndicNormalizationFilterFactory
-
Creates a new IndicNormalizationFilterFactory
- IndicNormalizer - Class in org.apache.lucene.analysis.in
-
Normalizes the Unicode representation of text in Indian languages.
- IndicNormalizer() - Constructor for class org.apache.lucene.analysis.in.IndicNormalizer
- IndonesianAnalyzer - Class in org.apache.lucene.analysis.id
-
Analyzer for Indonesian (Bahasa)
- IndonesianAnalyzer() - Constructor for class org.apache.lucene.analysis.id.IndonesianAnalyzer
-
Builds an analyzer with the default stop words:
IndonesianAnalyzer.DEFAULT_STOPWORD_FILE
. - IndonesianAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.id.IndonesianAnalyzer
-
Builds an analyzer with the given stop words
- IndonesianAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.id.IndonesianAnalyzer
-
Builds an analyzer with the given stop word.
- IndonesianStemFilter - Class in org.apache.lucene.analysis.id
-
A
TokenFilter
that appliesIndonesianStemmer
to stem Indonesian words. - IndonesianStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.id.IndonesianStemFilter
- IndonesianStemFilter(TokenStream, boolean) - Constructor for class org.apache.lucene.analysis.id.IndonesianStemFilter
-
Create a new IndonesianStemFilter.
- IndonesianStemFilterFactory - Class in org.apache.lucene.analysis.id
-
Factory for
IndonesianStemFilter
. - IndonesianStemFilterFactory() - Constructor for class org.apache.lucene.analysis.id.IndonesianStemFilterFactory
-
Default ctor for compatibility with SPI
- IndonesianStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.id.IndonesianStemFilterFactory
-
Creates a new IndonesianStemFilterFactory
- IndonesianStemmer - Class in org.apache.lucene.analysis.id
-
Stemmer for Indonesian.
- IndonesianStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- IndonesianStemmer() - Constructor for class org.apache.lucene.analysis.id.IndonesianStemmer
- IndonesianStemmer() - Constructor for class org.tartarus.snowball.ext.IndonesianStemmer
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.charfilter.MappingCharFilterFactory
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.compound.DictionaryCompoundWordTokenFilterFactory
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilterFactory
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.core.TypeTokenFilterFactory
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.en.AbstractWordsFileFilterFactory
-
Initialize the set of stopwords provided via ResourceLoader, or using defaults.
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.hunspell.HunspellStemFilterFactory
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.miscellaneous.ConditionalTokenFilterFactory
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.miscellaneous.KeywordMarkerFilterFactory
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilterFactory
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilterFactory
-
Deprecated.
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilterFactory
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.pattern.PatternTypingFilterFactory
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.payloads.DelimitedPayloadTokenFilterFactory
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.snowball.SnowballPorterFilterFactory
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.synonym.SynonymFilterFactory
-
Deprecated.
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.synonym.SynonymGraphFilterFactory
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.synonym.word2vec.Word2VecSynonymFilterFactory
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.util.ElisionFilterFactory
- INFORMATION_SEPARATOR_MARKER - Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Example marker character: U+001F (INFORMATION SEPARATOR ONE)
- init() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- initReader(String, Reader) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer
- initReader(String, Reader) - Method in class org.apache.lucene.analysis.fa.PersianAnalyzer
-
Wraps the Reader with
PersianCharFilter
- initReaderForNormalization(String, Reader) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer
- inMemory() - Static method in class org.apache.lucene.analysis.hunspell.SortingStrategy
-
The strategy that loads all entries as
String
objects and sorts them in memory. - insert(char[], int, char) - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- insert(int, int, CharSequence) - Method in class org.tartarus.snowball.SnowballProgram
- insert(String, char) - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Branches are initially compressed, needing one node per key plus the size of the string key.
- insertBalanced(String[], char[], int, int) - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Recursively insert the median first and then the median of the lower and upper halves, and so on in order to get a balanced tree.
- IntegerEncoder - Class in org.apache.lucene.analysis.payloads
-
Encode a character array Integer as a
BytesRef
. - IntegerEncoder() - Constructor for class org.apache.lucene.analysis.payloads.IntegerEncoder
- INTERNAL_LINK - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- INTERNAL_LINK_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- IrishAnalyzer - Class in org.apache.lucene.analysis.ga
-
Analyzer
for Irish. - IrishAnalyzer() - Constructor for class org.apache.lucene.analysis.ga.IrishAnalyzer
-
Builds an analyzer with the default stop words:
IrishAnalyzer.DEFAULT_STOPWORD_FILE
. - IrishAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.ga.IrishAnalyzer
-
Builds an analyzer with the given stop words.
- IrishAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.ga.IrishAnalyzer
-
Builds an analyzer with the given stop words.
- IrishLowerCaseFilter - Class in org.apache.lucene.analysis.ga
-
Normalises token text to lower case, handling t-prothesis and n-eclipsis (i.e., that 'nAthair' should become 'n-athair')
- IrishLowerCaseFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ga.IrishLowerCaseFilter
-
Create an IrishLowerCaseFilter that normalises Irish token text.
- IrishLowerCaseFilterFactory - Class in org.apache.lucene.analysis.ga
-
Factory for
IrishLowerCaseFilter
. - IrishLowerCaseFilterFactory() - Constructor for class org.apache.lucene.analysis.ga.IrishLowerCaseFilterFactory
-
Default ctor for compatibility with SPI
- IrishLowerCaseFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ga.IrishLowerCaseFilterFactory
-
Creates a new IrishLowerCaseFilterFactory
- IrishStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- IrishStemmer() - Constructor for class org.tartarus.snowball.ext.IrishStemmer
- isGramType() - Method in class org.apache.lucene.analysis.commongrams.CommonGramsQueryFilter
-
Convenience method to check if the current type is a gram type
- isIgnoreCase() - Method in class org.apache.lucene.analysis.en.AbstractWordsFileFilterFactory
- isIgnoreCase() - Method in class org.apache.lucene.analysis.miscellaneous.KeywordMarkerFilterFactory
- isIgnoreCase() - Method in class org.apache.lucene.analysis.miscellaneous.ProtectedTermFilterFactory
- isIgnoreCase() - Method in class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilterFactory
- isKeyword() - Method in class org.apache.lucene.analysis.miscellaneous.KeywordMarkerFilter
- isKeyword() - Method in class org.apache.lucene.analysis.miscellaneous.PatternKeywordMarkerFilter
- isKeyword() - Method in class org.apache.lucene.analysis.miscellaneous.SetKeywordMarkerFilter
- isOutputUnigrams() - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- isOutputUnigramsIfNoShingles() - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- isPreserveOriginal() - Method in class org.apache.lucene.analysis.miscellaneous.ASCIIFoldingFilter
-
Does the filter preserve the original tokens?
- isSafeEnd(char) - Method in class org.apache.lucene.analysis.util.SegmentingTokenizerBase
-
For sentence tokenization, these are the unambiguous break positions.
- isTokenChar(int) - Method in class org.apache.lucene.analysis.core.LetterTokenizer
-
Collects only characters which satisfy
Character.isLetter(int)
. - isTokenChar(int) - Method in class org.apache.lucene.analysis.core.UnicodeWhitespaceTokenizer
-
Collects only characters which do not satisfy Unicode's WHITESPACE property.
- isTokenChar(int) - Method in class org.apache.lucene.analysis.core.WhitespaceTokenizer
-
Collects only characters which do not satisfy
Character.isWhitespace(int)
. - isTokenChar(int) - Method in class org.apache.lucene.analysis.ngram.NGramTokenizer
-
Only collect characters which satisfy this condition.
- isTokenChar(int) - Method in class org.apache.lucene.analysis.util.CharTokenizer
-
Returns true iff a codepoint should be included in a token.
- ItalianAnalyzer - Class in org.apache.lucene.analysis.it
-
Analyzer
for Italian. - ItalianAnalyzer() - Constructor for class org.apache.lucene.analysis.it.ItalianAnalyzer
-
Builds an analyzer with the default stop words:
ItalianAnalyzer.DEFAULT_STOPWORD_FILE
. - ItalianAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.it.ItalianAnalyzer
-
Builds an analyzer with the given stop words.
- ItalianAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.it.ItalianAnalyzer
-
Builds an analyzer with the given stop words.
- ItalianLightStemFilter - Class in org.apache.lucene.analysis.it
-
A
TokenFilter
that appliesItalianLightStemmer
to stem Italian words. - ItalianLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.it.ItalianLightStemFilter
- ItalianLightStemFilterFactory - Class in org.apache.lucene.analysis.it
-
Factory for
ItalianLightStemFilter
. - ItalianLightStemFilterFactory() - Constructor for class org.apache.lucene.analysis.it.ItalianLightStemFilterFactory
-
Default ctor for compatibility with SPI
- ItalianLightStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.it.ItalianLightStemFilterFactory
-
Creates a new ItalianLightStemFilterFactory
- ItalianLightStemmer - Class in org.apache.lucene.analysis.it
-
Light Stemmer for Italian.
- ItalianLightStemmer() - Constructor for class org.apache.lucene.analysis.it.ItalianLightStemmer
- ItalianStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- ItalianStemmer() - Constructor for class org.tartarus.snowball.ext.ItalianStemmer
- ITALICS - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- ITALICS_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- Iterator() - Constructor for class org.apache.lucene.analysis.compound.hyphenation.TernaryTree.Iterator
J
- join(String[], CharsRefBuilder) - Static method in class org.apache.lucene.analysis.synonym.SynonymMap.Builder
-
Sugar: just joins the provided terms with
SynonymMap.WORD_SEPARATOR
. - jreBugWorkaround(char) - Method in class org.apache.lucene.analysis.util.CharArrayIterator
K
- KAF - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- KAF - Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
- KASRA - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- KASRATAN - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- KATAKANA - Static variable in class org.apache.lucene.analysis.cjk.CJKBigramFilter
-
bigram flag for Katakana
- KATAKANA - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
-
Katakana token type
- KATAKANA_TYPE - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Katakana token type
- KEEP - Static variable in class org.apache.lucene.analysis.miscellaneous.CapitalizationFilterFactory
- KEEP_IGNORE_CASE - Static variable in class org.apache.lucene.analysis.miscellaneous.CapitalizationFilterFactory
- KeepWordFilter - Class in org.apache.lucene.analysis.miscellaneous
-
A TokenFilter that only keeps tokens with text contained in the required words.
- KeepWordFilter(TokenStream, CharArraySet) - Constructor for class org.apache.lucene.analysis.miscellaneous.KeepWordFilter
-
Create a new
KeepWordFilter
. - KeepWordFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
KeepWordFilter
. - KeepWordFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.KeepWordFilterFactory
-
Default ctor for compatibility with SPI
- KeepWordFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.KeepWordFilterFactory
-
Creates a new KeepWordFilterFactory
- KEHEH - Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
- ket - Variable in class org.tartarus.snowball.SnowballProgram
- keys() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- KeywordAnalyzer - Class in org.apache.lucene.analysis.core
-
"Tokenizes" the entire stream as a single token.
- KeywordAnalyzer() - Constructor for class org.apache.lucene.analysis.core.KeywordAnalyzer
- KeywordMarkerFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Marks terms as keywords via the
KeywordAttribute
. - KeywordMarkerFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.KeywordMarkerFilter
-
Creates a new
KeywordMarkerFilter
- KeywordMarkerFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
KeywordMarkerFilter
. - KeywordMarkerFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.KeywordMarkerFilterFactory
-
Default ctor for compatibility with SPI
- KeywordMarkerFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.KeywordMarkerFilterFactory
-
Creates a new KeywordMarkerFilterFactory
- KeywordRepeatFilter - Class in org.apache.lucene.analysis.miscellaneous
-
This TokenFilter emits each incoming token twice once as keyword and once non-keyword, in other words once with
KeywordAttribute.setKeyword(boolean)
set totrue
and once set tofalse
. - KeywordRepeatFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.KeywordRepeatFilter
-
Construct a token stream filtering the given input.
- KeywordRepeatFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
KeywordRepeatFilter
. - KeywordRepeatFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.KeywordRepeatFilterFactory
-
Default ctor for compatibility with SPI
- KeywordRepeatFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.KeywordRepeatFilterFactory
-
Creates a new KeywordRepeatFilterFactory
- KeywordTokenizer - Class in org.apache.lucene.analysis.core
-
Emits the entire input as a single token.
- KeywordTokenizer() - Constructor for class org.apache.lucene.analysis.core.KeywordTokenizer
- KeywordTokenizer(int) - Constructor for class org.apache.lucene.analysis.core.KeywordTokenizer
- KeywordTokenizer(AttributeFactory, int) - Constructor for class org.apache.lucene.analysis.core.KeywordTokenizer
- KeywordTokenizerFactory - Class in org.apache.lucene.analysis.core
-
Factory for
KeywordTokenizer
. - KeywordTokenizerFactory() - Constructor for class org.apache.lucene.analysis.core.KeywordTokenizerFactory
-
Default ctor for compatibility with SPI
- KeywordTokenizerFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.core.KeywordTokenizerFactory
-
Creates a new KeywordTokenizerFactory
- knows(String) - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- KpStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- KpStemmer() - Constructor for class org.tartarus.snowball.ext.KpStemmer
- KStemFilter - Class in org.apache.lucene.analysis.en
-
A high-performance kstem filter for english.
- KStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.en.KStemFilter
- KStemFilterFactory - Class in org.apache.lucene.analysis.en
-
Factory for
KStemFilter
. - KStemFilterFactory() - Constructor for class org.apache.lucene.analysis.en.KStemFilterFactory
-
Default ctor for compatibility with SPI
- KStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.en.KStemFilterFactory
-
Creates a new KStemFilterFactory
- KStemmer - Class in org.apache.lucene.analysis.en
-
This class implements the Kstem algorithm
- kv - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
This vector holds the trailing of the keys when the branch is compressed.
L
- LAM - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- last() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- LatvianAnalyzer - Class in org.apache.lucene.analysis.lv
-
Analyzer
for Latvian. - LatvianAnalyzer() - Constructor for class org.apache.lucene.analysis.lv.LatvianAnalyzer
-
Builds an analyzer with the default stop words:
LatvianAnalyzer.DEFAULT_STOPWORD_FILE
. - LatvianAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.lv.LatvianAnalyzer
-
Builds an analyzer with the given stop words.
- LatvianAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.lv.LatvianAnalyzer
-
Builds an analyzer with the given stop words.
- LatvianStemFilter - Class in org.apache.lucene.analysis.lv
-
A
TokenFilter
that appliesLatvianStemmer
to stem Latvian words. - LatvianStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.lv.LatvianStemFilter
- LatvianStemFilterFactory - Class in org.apache.lucene.analysis.lv
-
Factory for
LatvianStemFilter
. - LatvianStemFilterFactory() - Constructor for class org.apache.lucene.analysis.lv.LatvianStemFilterFactory
-
Default ctor for compatibility with SPI
- LatvianStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.lv.LatvianStemFilterFactory
-
Creates a new LatvianStemFilterFactory
- LatvianStemmer - Class in org.apache.lucene.analysis.lv
-
Light stemmer for Latvian.
- LatvianStemmer() - Constructor for class org.apache.lucene.analysis.lv.LatvianStemmer
- len - Variable in class org.apache.lucene.analysis.util.OpenStringBuilder
- length - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- length() - Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
-
return number of items in array
- length() - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
-
return number of items in array
- length() - Method in class org.apache.lucene.analysis.compound.hyphenation.Hyphenation
- length() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- LengthFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Removes words that are too long or too short from the stream.
- LengthFilter(TokenStream, int, int) - Constructor for class org.apache.lucene.analysis.miscellaneous.LengthFilter
-
Create a new
LengthFilter
. - LengthFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
LengthFilter
. - LengthFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.LengthFilterFactory
-
Default ctor for compatibility with SPI
- LengthFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.LengthFilterFactory
-
Creates a new LengthFilterFactory
- LetterTokenizer - Class in org.apache.lucene.analysis.core
-
A LetterTokenizer is a tokenizer that divides text at non-letters.
- LetterTokenizer() - Constructor for class org.apache.lucene.analysis.core.LetterTokenizer
-
Construct a new LetterTokenizer.
- LetterTokenizer(AttributeFactory) - Constructor for class org.apache.lucene.analysis.core.LetterTokenizer
-
Construct a new LetterTokenizer using a given
AttributeFactory
. - LetterTokenizer(AttributeFactory, int) - Constructor for class org.apache.lucene.analysis.core.LetterTokenizer
-
Construct a new LetterTokenizer using a given
AttributeFactory
. - LetterTokenizerFactory - Class in org.apache.lucene.analysis.core
-
Factory for
LetterTokenizer
. - LetterTokenizerFactory() - Constructor for class org.apache.lucene.analysis.core.LetterTokenizerFactory
-
Default ctor for compatibility with SPI
- LetterTokenizerFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.core.LetterTokenizerFactory
-
Creates a new LetterTokenizerFactory
- limit - Variable in class org.tartarus.snowball.SnowballProgram
- limit_backward - Variable in class org.tartarus.snowball.SnowballProgram
- LimitTokenCountAnalyzer - Class in org.apache.lucene.analysis.miscellaneous
-
This Analyzer limits the number of tokens while indexing.
- LimitTokenCountAnalyzer(Analyzer, int) - Constructor for class org.apache.lucene.analysis.miscellaneous.LimitTokenCountAnalyzer
-
Build an analyzer that limits the maximum number of tokens per field.
- LimitTokenCountAnalyzer(Analyzer, int, boolean) - Constructor for class org.apache.lucene.analysis.miscellaneous.LimitTokenCountAnalyzer
-
Build an analyzer that limits the maximum number of tokens per field.
- LimitTokenCountFilter - Class in org.apache.lucene.analysis.miscellaneous
-
This TokenFilter limits the number of tokens while indexing.
- LimitTokenCountFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.miscellaneous.LimitTokenCountFilter
-
Build a filter that only accepts tokens up to a maximum number.
- LimitTokenCountFilter(TokenStream, int, boolean) - Constructor for class org.apache.lucene.analysis.miscellaneous.LimitTokenCountFilter
-
Build an filter that limits the maximum number of tokens per field.
- LimitTokenCountFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
LimitTokenCountFilter
. - LimitTokenCountFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.LimitTokenCountFilterFactory
-
Default ctor for compatibility with SPI
- LimitTokenCountFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.LimitTokenCountFilterFactory
-
Creates a new LimitTokenCountFilterFactory
- LimitTokenOffsetFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Lets all tokens pass through until it sees one with a start offset <= a configured limit, which won't pass and ends the stream.
- LimitTokenOffsetFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.miscellaneous.LimitTokenOffsetFilter
-
Lets all tokens pass through until it sees one with a start offset <=
maxStartOffset
which won't pass and ends the stream. - LimitTokenOffsetFilter(TokenStream, int, boolean) - Constructor for class org.apache.lucene.analysis.miscellaneous.LimitTokenOffsetFilter
- LimitTokenOffsetFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
LimitTokenOffsetFilter
. - LimitTokenOffsetFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.LimitTokenOffsetFilterFactory
-
Default ctor for compatibility with SPI
- LimitTokenOffsetFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.LimitTokenOffsetFilterFactory
- LimitTokenPositionFilter - Class in org.apache.lucene.analysis.miscellaneous
-
This TokenFilter limits its emitted tokens to those with positions that are not greater than the configured limit.
- LimitTokenPositionFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.miscellaneous.LimitTokenPositionFilter
-
Build a filter that only accepts tokens up to and including the given maximum position.
- LimitTokenPositionFilter(TokenStream, int, boolean) - Constructor for class org.apache.lucene.analysis.miscellaneous.LimitTokenPositionFilter
-
Build a filter that limits the maximum position of tokens to emit.
- LimitTokenPositionFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
LimitTokenPositionFilter
. - LimitTokenPositionFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.LimitTokenPositionFilterFactory
-
Default ctor for compatibility with SPI
- LimitTokenPositionFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.LimitTokenPositionFilterFactory
-
Creates a new LimitTokenPositionFilterFactory
- LithuanianAnalyzer - Class in org.apache.lucene.analysis.lt
-
Analyzer
for Lithuanian. - LithuanianAnalyzer() - Constructor for class org.apache.lucene.analysis.lt.LithuanianAnalyzer
-
Builds an analyzer with the default stop words:
LithuanianAnalyzer.DEFAULT_STOPWORD_FILE
. - LithuanianAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.lt.LithuanianAnalyzer
-
Builds an analyzer with the given stop words.
- LithuanianAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.lt.LithuanianAnalyzer
-
Builds an analyzer with the given stop words.
- LithuanianStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- LithuanianStemmer() - Constructor for class org.tartarus.snowball.ext.LithuanianStemmer
- lo - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Pointer to low branch and to rest of the key when it is stored directly in this node, we don't have unions in java!
- loadPatterns(InputSource) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Read hyphenation patterns from an XML file.
- loadSynonyms(ResourceLoader, String, boolean, Analyzer) - Method in class org.apache.lucene.analysis.synonym.SynonymFilterFactory
-
Deprecated.Load synonyms with the given
SynonymMap.Parser
class. - loadSynonyms(ResourceLoader, String, boolean, Analyzer) - Method in class org.apache.lucene.analysis.synonym.SynonymGraphFilterFactory
-
Load synonyms with the given
SynonymMap.Parser
class. - LOCALE - Static variable in class org.apache.lucene.analysis.miscellaneous.DateRecognizerFilterFactory
- log() - Method in class org.apache.lucene.analysis.br.BrazilianStemmer
-
For log and debug purpose
- lookupEntries(String) - Method in class org.apache.lucene.analysis.hunspell.Dictionary
- LovinsStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- LovinsStemmer() - Constructor for class org.tartarus.snowball.ext.LovinsStemmer
- LOWER - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.
- LowerCaseFilter - Class in org.apache.lucene.analysis.core
-
Normalizes token text to lower case.
- LowerCaseFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.core.LowerCaseFilter
-
Create a new LowerCaseFilter, that normalizes token text to lower case.
- LowerCaseFilterFactory - Class in org.apache.lucene.analysis.core
-
Factory for
LowerCaseFilter
. - LowerCaseFilterFactory() - Constructor for class org.apache.lucene.analysis.core.LowerCaseFilterFactory
-
Default ctor for compatibility with SPI
- LowerCaseFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.core.LowerCaseFilterFactory
-
Creates a new LowerCaseFilterFactory
M
- MappingCharFilter - Class in org.apache.lucene.analysis.charfilter
-
Simplistic
CharFilter
that applies the mappings contained in aNormalizeCharMap
to the character stream, and correcting the resulting changes to the offsets. - MappingCharFilter(NormalizeCharMap, Reader) - Constructor for class org.apache.lucene.analysis.charfilter.MappingCharFilter
-
Default constructor that takes a
Reader
. - MappingCharFilterFactory - Class in org.apache.lucene.analysis.charfilter
-
Factory for
MappingCharFilter
. - MappingCharFilterFactory() - Constructor for class org.apache.lucene.analysis.charfilter.MappingCharFilterFactory
-
Default ctor for compatibility with SPI
- MappingCharFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.charfilter.MappingCharFilterFactory
-
Creates a new MappingCharFilterFactory
- matches(char[], int) - Method in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
- matches(char[], int) - Method in class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSetExceptions
- matches(char[], int) - Method in class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSuffixExceptions
- MAX_KEY - Static variable in class org.apache.lucene.analysis.miscellaneous.CodepointCountFilterFactory
- MAX_KEY - Static variable in class org.apache.lucene.analysis.miscellaneous.LengthFilterFactory
- MAX_OUTPUT_TOKEN_SIZE_KEY - Static variable in class org.apache.lucene.analysis.miscellaneous.FingerprintFilterFactory
- MAX_START_OFFSET - Static variable in class org.apache.lucene.analysis.miscellaneous.LimitTokenOffsetFilterFactory
- MAX_TOKEN_COUNT_KEY - Static variable in class org.apache.lucene.analysis.miscellaneous.LimitTokenCountFilterFactory
- MAX_TOKEN_LENGTH - Static variable in class org.apache.lucene.analysis.miscellaneous.CapitalizationFilterFactory
- MAX_TOKEN_LENGTH_LIMIT - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
-
Absolute maximum sized token
- MAX_TOKEN_POSITION_KEY - Static variable in class org.apache.lucene.analysis.miscellaneous.LimitTokenPositionFilterFactory
- MAX_WORD_COUNT - Static variable in class org.apache.lucene.analysis.miscellaneous.CapitalizationFilterFactory
- maxHorizontalContext - Variable in class org.apache.lucene.analysis.synonym.SynonymMap
-
maxHorizontalContext: maximum context we need on the tokenstream
- maxSubwordSize - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- min - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
- min - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
- MIN_KEY - Static variable in class org.apache.lucene.analysis.miscellaneous.CodepointCountFilterFactory
- MIN_KEY - Static variable in class org.apache.lucene.analysis.miscellaneous.LengthFilterFactory
- MIN_WORD_LENGTH - Static variable in class org.apache.lucene.analysis.miscellaneous.CapitalizationFilterFactory
- MinHashFilter - Class in org.apache.lucene.analysis.minhash
-
Generate min hash tokens from an incoming stream of tokens.
- MinHashFilter(TokenStream, int, int, int, boolean) - Constructor for class org.apache.lucene.analysis.minhash.MinHashFilter
-
create a MinHash filter
- MinHashFilterFactory - Class in org.apache.lucene.analysis.minhash
- MinHashFilterFactory() - Constructor for class org.apache.lucene.analysis.minhash.MinHashFilterFactory
-
Default ctor for compatibility with SPI
- MinHashFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.minhash.MinHashFilterFactory
-
Create a
MinHashFilterFactory
. - minSubwordSize - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- minWordSize - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
N
- name - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
- name() - Method in class org.apache.lucene.collation.CollationDocValuesField
- NAME - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizationFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ar.ArabicStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.bg.BulgarianStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.bn.BengaliNormalizationFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.bn.BengaliStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.boost.DelimitedBoostTokenFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.br.BrazilianStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.charfilter.HTMLStripCharFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.charfilter.MappingCharFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.cjk.CJKBigramFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.cjk.CJKWidthCharFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.cjk.CJKWidthFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ckb.SoraniNormalizationFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ckb.SoraniStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.classic.ClassicFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.classic.ClassicTokenizerFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.commongrams.CommonGramsFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.commongrams.CommonGramsQueryFilterFactory
- NAME - Static variable in class org.apache.lucene.analysis.compound.DictionaryCompoundWordTokenFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.core.DecimalDigitFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.core.FlattenGraphFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.core.KeywordTokenizerFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.core.LetterTokenizerFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.core.LowerCaseFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.core.StopFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.core.TypeTokenFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.core.UpperCaseFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.core.WhitespaceTokenizerFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.cz.CzechStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.de.GermanLightStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.de.GermanMinimalStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.de.GermanNormalizationFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.de.GermanStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.el.GreekLowerCaseFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.el.GreekStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.en.EnglishMinimalStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.en.EnglishPossessiveFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.en.KStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.en.PorterStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.es.SpanishLightStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.es.SpanishMinimalStemFilterFactory
-
Deprecated.SPI name
- NAME - Static variable in class org.apache.lucene.analysis.es.SpanishPluralStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.fa.PersianCharFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.fa.PersianNormalizationFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.fa.PersianStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.fi.FinnishLightStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.fr.FrenchLightStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.fr.FrenchMinimalStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ga.IrishLowerCaseFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.gl.GalicianMinimalStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.gl.GalicianStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.hi.HindiNormalizationFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.hi.HindiStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.hu.HungarianLightStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.hunspell.HunspellStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.id.IndonesianStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.in.IndicNormalizationFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.it.ItalianLightStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.lv.LatvianStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.minhash.MinHashFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.ASCIIFoldingFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.CapitalizationFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.CodepointCountFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.ConditionalTokenFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.DateRecognizerFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.DelimitedTermFrequencyTokenFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.DropIfFlaggedFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.FingerprintFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.FixBrokenOffsetsFilterFactory
-
Deprecated.SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.HyphenatedWordsFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.KeepWordFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.KeywordMarkerFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.KeywordRepeatFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.LengthFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.LimitTokenCountFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.LimitTokenOffsetFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.LimitTokenPositionFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.ProtectedTermFilterFactory
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.RemoveDuplicatesTokenFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.ScandinavianFoldingFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.ScandinavianNormalizationFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.TrimFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.TruncateTokenFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.TypeAsSynonymFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilterFactory
-
Deprecated.SPI name
- NAME - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ngram.EdgeNGramFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ngram.EdgeNGramTokenizerFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ngram.NGramFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ngram.NGramTokenizerFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.no.NorwegianLightStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.no.NorwegianMinimalStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.no.NorwegianNormalizationFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.path.PathHierarchyTokenizerFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.pattern.PatternCaptureGroupFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.pattern.PatternReplaceCharFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.pattern.PatternReplaceFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.pattern.PatternTokenizerFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.pattern.PatternTypingFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.pattern.SimplePatternSplitTokenizerFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.pattern.SimplePatternTokenizerFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.payloads.DelimitedPayloadTokenFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.payloads.NumericPayloadTokenFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.payloads.TokenOffsetPayloadTokenFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.payloads.TypeAsPayloadTokenFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.pt.PortugueseLightStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.pt.PortugueseMinimalStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.pt.PortugueseStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ru.RussianLightStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.shingle.FixedShingleFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.shingle.ShingleFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.snowball.SnowballPorterFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.sr.SerbianNormalizationFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.sv.SwedishLightStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.sv.SwedishMinimalStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.synonym.SynonymFilterFactory
-
Deprecated.SPI name
- NAME - Static variable in class org.apache.lucene.analysis.synonym.SynonymGraphFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.synonym.word2vec.Word2VecSynonymFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.te.TeluguNormalizationFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.te.TeluguStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.th.ThaiTokenizerFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.tr.ApostropheFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.tr.TurkishLowerCaseFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.util.ElisionFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizerFactory
-
SPI name
- NepaliAnalyzer - Class in org.apache.lucene.analysis.ne
-
Analyzer for Nepali.
- NepaliAnalyzer() - Constructor for class org.apache.lucene.analysis.ne.NepaliAnalyzer
-
Builds an analyzer with the default stop words:
NepaliAnalyzer.DEFAULT_STOPWORD_FILE
. - NepaliAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.ne.NepaliAnalyzer
-
Builds an analyzer with the given stop words
- NepaliAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.ne.NepaliAnalyzer
-
Builds an analyzer with the given stop words
- NepaliStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- NepaliStemmer() - Constructor for class org.tartarus.snowball.ext.NepaliStemmer
- newInstance(String, Class<T>) - Method in class org.apache.lucene.analysis.util.FilesystemResourceLoader
- newSentenceInstance() - Static method in class org.apache.lucene.analysis.util.CharArrayIterator
-
Create a new CharArrayIterator that works around JRE bugs in a manner suitable for
BreakIterator.getSentenceInstance()
- newSinkTokenStream() - Method in class org.apache.lucene.analysis.sinks.TeeSinkTokenFilter
-
Returns a new
TeeSinkTokenFilter.SinkTokenStream
that receives all tokens consumed by this stream. - newWordInstance() - Static method in class org.apache.lucene.analysis.util.CharArrayIterator
-
Create a new CharArrayIterator that works around JRE bugs in a manner suitable for
BreakIterator.getWordInstance()
- next() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- nextElement() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree.Iterator
- NGramFilterFactory - Class in org.apache.lucene.analysis.ngram
-
Factory for
NGramTokenFilter
. - NGramFilterFactory() - Constructor for class org.apache.lucene.analysis.ngram.NGramFilterFactory
-
Default ctor for compatibility with SPI
- NGramFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ngram.NGramFilterFactory
-
Creates a new NGramFilterFactory
- NGramFragmentChecker - Class in org.apache.lucene.analysis.hunspell
-
A
FragmentChecker
based on all character n-grams possible in a certain language, keeping them in a relatively memory-efficient, but probabilistic data structure. - NGramFragmentChecker.NGramConsumer - Interface in org.apache.lucene.analysis.hunspell
-
A callback for n-gram ranges in words
- NGramTokenFilter - Class in org.apache.lucene.analysis.ngram
-
Tokenizes the input into n-grams of the given size(s).
- NGramTokenFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.ngram.NGramTokenFilter
-
Creates an NGramTokenFilter that produces n-grams of the indicated size.
- NGramTokenFilter(TokenStream, int, int, boolean) - Constructor for class org.apache.lucene.analysis.ngram.NGramTokenFilter
-
Creates an NGramTokenFilter that, for a given input term, produces all contained n-grams with lengths >= minGram and <= maxGram.
- NGramTokenizer - Class in org.apache.lucene.analysis.ngram
-
Tokenizes the input into n-grams of the given size(s).
- NGramTokenizer() - Constructor for class org.apache.lucene.analysis.ngram.NGramTokenizer
-
Creates NGramTokenizer with default min and max n-grams.
- NGramTokenizer(int, int) - Constructor for class org.apache.lucene.analysis.ngram.NGramTokenizer
-
Creates NGramTokenizer with given min and max n-grams.
- NGramTokenizer(AttributeFactory, int, int) - Constructor for class org.apache.lucene.analysis.ngram.NGramTokenizer
-
Creates NGramTokenizer with given min and max n-grams.
- NGramTokenizerFactory - Class in org.apache.lucene.analysis.ngram
-
Factory for
NGramTokenizer
. - NGramTokenizerFactory() - Constructor for class org.apache.lucene.analysis.ngram.NGramTokenizerFactory
-
Default ctor for compatibility with SPI
- NGramTokenizerFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ngram.NGramTokenizerFactory
-
Creates a new NGramTokenizerFactory
- NO_TIMEOUT - org.apache.lucene.analysis.hunspell.TimeoutPolicy
-
Let the computation complete even if it takes ages
- noBreak - Variable in class org.apache.lucene.analysis.compound.hyphenation.Hyphen
- NOON - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- normalize(char[], int) - Method in class org.apache.lucene.analysis.ar.ArabicNormalizer
-
Normalize an input buffer of Arabic text
- normalize(char[], int) - Method in class org.apache.lucene.analysis.bn.BengaliNormalizer
-
Normalize an input buffer of Bengali text
- normalize(char[], int) - Method in class org.apache.lucene.analysis.ckb.SoraniNormalizer
-
Normalize an input buffer of Sorani text
- normalize(char[], int) - Method in class org.apache.lucene.analysis.fa.PersianNormalizer
-
Normalize an input buffer of Persian text
- normalize(char[], int) - Method in class org.apache.lucene.analysis.hi.HindiNormalizer
-
Normalize an input buffer of Hindi text
- normalize(char[], int) - Method in class org.apache.lucene.analysis.in.IndicNormalizer
-
Normalizes input text, and returns the new length.
- normalize(char[], int) - Method in class org.apache.lucene.analysis.te.TeluguNormalizer
-
Normalize an input buffer of Telugu text
- normalize(Reader) - Method in class org.apache.lucene.analysis.charfilter.MappingCharFilterFactory
- normalize(Reader) - Method in class org.apache.lucene.analysis.cjk.CJKWidthCharFilterFactory
- normalize(Reader) - Method in class org.apache.lucene.analysis.fa.PersianCharFilterFactory
- normalize(Reader) - Method in class org.apache.lucene.analysis.pattern.PatternReplaceCharFilterFactory
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.ar.ArabicAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.bg.BulgarianAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.bn.BengaliAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.ca.CatalanAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.cjk.CJKAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.ckb.SoraniAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.classic.ClassicAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.core.SimpleAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.core.StopAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.cz.CzechAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.da.DanishAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.de.GermanAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.el.GreekAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.email.UAX29URLEmailAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.en.EnglishAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.es.SpanishAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.et.EstonianAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.eu.BasqueAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.fa.PersianAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.fi.FinnishAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.ga.IrishAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.gl.GalicianAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.hi.HindiAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.hu.HungarianAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.hy.ArmenianAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.id.IndonesianAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.it.ItalianAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.lt.LithuanianAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.lv.LatvianAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.ne.NepaliAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.no.NorwegianAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.pt.PortugueseAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.ro.RomanianAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.ru.RussianAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.sr.SerbianAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.sv.SwedishAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.ta.TamilAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.te.TeluguAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.th.ThaiAnalyzer
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.tr.TurkishAnalyzer
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.ar.ArabicNormalizationFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.bn.BengaliNormalizationFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.cjk.CJKWidthFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.ckb.SoraniNormalizationFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.core.DecimalDigitFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.core.LowerCaseFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.core.UpperCaseFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.de.GermanNormalizationFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.el.GreekLowerCaseFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.fa.PersianNormalizationFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.ga.IrishLowerCaseFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.hi.HindiNormalizationFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.in.IndicNormalizationFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.ASCIIFoldingFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.ScandinavianFoldingFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.ScandinavianNormalizationFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.TrimFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.no.NorwegianNormalizationFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.sr.SerbianNormalizationFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.te.TeluguNormalizationFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.tr.TurkishLowerCaseFilterFactory
- normalize(TokenStream) - Method in class org.apache.lucene.analysis.util.ElisionFilterFactory
- NormalizeCharMap - Class in org.apache.lucene.analysis.charfilter
-
Holds a map of String input to String output, to be used with
MappingCharFilter
. - NormalizeCharMap.Builder - Class in org.apache.lucene.analysis.charfilter
-
Builds an NormalizeCharMap.
- normalizeException(ArrayList<?>) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- normMap - Variable in class org.apache.lucene.analysis.charfilter.MappingCharFilterFactory
- NorwegianAnalyzer - Class in org.apache.lucene.analysis.no
-
Analyzer
for Norwegian. - NorwegianAnalyzer() - Constructor for class org.apache.lucene.analysis.no.NorwegianAnalyzer
-
Builds an analyzer with the default stop words:
NorwegianAnalyzer.DEFAULT_STOPWORD_FILE
. - NorwegianAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.no.NorwegianAnalyzer
-
Builds an analyzer with the given stop words.
- NorwegianAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.no.NorwegianAnalyzer
-
Builds an analyzer with the given stop words.
- NorwegianLightStemFilter - Class in org.apache.lucene.analysis.no
-
A
TokenFilter
that appliesNorwegianLightStemmer
to stem Norwegian words. - NorwegianLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.no.NorwegianLightStemFilter
- NorwegianLightStemFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.no.NorwegianLightStemFilter
-
Creates a new NorwegianLightStemFilter
- NorwegianLightStemFilterFactory - Class in org.apache.lucene.analysis.no
-
Factory for
NorwegianLightStemFilter
. - NorwegianLightStemFilterFactory() - Constructor for class org.apache.lucene.analysis.no.NorwegianLightStemFilterFactory
-
Default ctor for compatibility with SPI
- NorwegianLightStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.no.NorwegianLightStemFilterFactory
-
Creates a new NorwegianLightStemFilterFactory
- NorwegianLightStemmer - Class in org.apache.lucene.analysis.no
-
Light Stemmer for Norwegian.
- NorwegianLightStemmer(int) - Constructor for class org.apache.lucene.analysis.no.NorwegianLightStemmer
-
Creates a new NorwegianLightStemmer
- NorwegianMinimalStemFilter - Class in org.apache.lucene.analysis.no
-
A
TokenFilter
that appliesNorwegianMinimalStemmer
to stem Norwegian words. - NorwegianMinimalStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.no.NorwegianMinimalStemFilter
- NorwegianMinimalStemFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.no.NorwegianMinimalStemFilter
-
Creates a new NorwegianLightStemFilter
- NorwegianMinimalStemFilterFactory - Class in org.apache.lucene.analysis.no
-
Factory for
NorwegianMinimalStemFilter
. - NorwegianMinimalStemFilterFactory() - Constructor for class org.apache.lucene.analysis.no.NorwegianMinimalStemFilterFactory
-
Default ctor for compatibility with SPI
- NorwegianMinimalStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.no.NorwegianMinimalStemFilterFactory
-
Creates a new NorwegianMinimalStemFilterFactory
- NorwegianMinimalStemmer - Class in org.apache.lucene.analysis.no
-
Minimal Stemmer for Norwegian Bokmål (no-nb) and Nynorsk (no-nn)
- NorwegianMinimalStemmer(int) - Constructor for class org.apache.lucene.analysis.no.NorwegianMinimalStemmer
-
Creates a new NorwegianMinimalStemmer
- NorwegianNormalizationFilter - Class in org.apache.lucene.analysis.no
-
This filter normalize use of the interchangeable Scandinavian characters æÆäÄöÖøØ and folded variants (ae, oe, aa) by transforming them to åÅæÆøØ.
- NorwegianNormalizationFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.no.NorwegianNormalizationFilter
- NorwegianNormalizationFilterFactory - Class in org.apache.lucene.analysis.no
-
Factory for
NorwegianNormalizationFilter
. - NorwegianNormalizationFilterFactory() - Constructor for class org.apache.lucene.analysis.no.NorwegianNormalizationFilterFactory
-
Default ctor for compatibility with SPI
- NorwegianNormalizationFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.no.NorwegianNormalizationFilterFactory
- NorwegianStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- NorwegianStemmer() - Constructor for class org.tartarus.snowball.ext.NorwegianStemmer
- NUM - Static variable in class org.apache.lucene.analysis.classic.ClassicTokenizer
- NUM - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
-
Numeric token type
- NUM_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- NUMERIC_TYPE - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Numbers
- NumericPayloadTokenFilter - Class in org.apache.lucene.analysis.payloads
-
Assigns a payload to a token based on the
TypeAttribute
- NumericPayloadTokenFilter(TokenStream, float, String) - Constructor for class org.apache.lucene.analysis.payloads.NumericPayloadTokenFilter
- NumericPayloadTokenFilterFactory - Class in org.apache.lucene.analysis.payloads
-
Factory for
NumericPayloadTokenFilter
. - NumericPayloadTokenFilterFactory() - Constructor for class org.apache.lucene.analysis.payloads.NumericPayloadTokenFilterFactory
-
Default ctor for compatibility with SPI
- NumericPayloadTokenFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.payloads.NumericPayloadTokenFilterFactory
-
Creates a new NumericPayloadTokenFilterFactory
- NYNORSK - Static variable in class org.apache.lucene.analysis.no.NorwegianLightStemmer
-
Constant to remove Nynorsk-specific endings
O
- OE - org.apache.lucene.analysis.miscellaneous.ScandinavianNormalizer.Foldings
- offline(Directory, String) - Static method in class org.apache.lucene.analysis.hunspell.SortingStrategy
-
An "offline" strategy that creates temporary files in the given directory and uses them for sorting with
OfflineSorter
. - offset - Variable in class org.apache.lucene.analysis.util.SegmentingTokenizerBase
-
accumulated offset of previous buffers for this reader, for offsetAtt
- offsetAtt - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- OFFSETS - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilterFactory
- OK_PREFIX - Static variable in class org.apache.lucene.analysis.miscellaneous.CapitalizationFilterFactory
- ONLY_FIRST_WORD - Static variable in class org.apache.lucene.analysis.miscellaneous.CapitalizationFilterFactory
- onlyLongestMatch - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- OO - org.apache.lucene.analysis.miscellaneous.ScandinavianNormalizer.Foldings
- openResource(String) - Method in class org.apache.lucene.analysis.util.FilesystemResourceLoader
- OpenStringBuilder - Class in org.apache.lucene.analysis.util
-
A StringBuilder that allows one to access the array.
- OpenStringBuilder() - Constructor for class org.apache.lucene.analysis.util.OpenStringBuilder
- OpenStringBuilder(char[], int) - Constructor for class org.apache.lucene.analysis.util.OpenStringBuilder
- OpenStringBuilder(int) - Constructor for class org.apache.lucene.analysis.util.OpenStringBuilder
- org.apache.lucene.analysis.ar - package org.apache.lucene.analysis.ar
-
Analyzer for Arabic.
- org.apache.lucene.analysis.bg - package org.apache.lucene.analysis.bg
-
Analyzer for Bulgarian.
- org.apache.lucene.analysis.bn - package org.apache.lucene.analysis.bn
-
Analyzer for Bengali Language.
- org.apache.lucene.analysis.boost - package org.apache.lucene.analysis.boost
-
Provides various convenience classes for creating boosts on Tokens.
- org.apache.lucene.analysis.br - package org.apache.lucene.analysis.br
-
Analyzer for Brazilian Portuguese.
- org.apache.lucene.analysis.ca - package org.apache.lucene.analysis.ca
-
Analyzer for Catalan.
- org.apache.lucene.analysis.charfilter - package org.apache.lucene.analysis.charfilter
-
Normalization of text before the tokenizer.
- org.apache.lucene.analysis.cjk - package org.apache.lucene.analysis.cjk
-
Analyzer for Chinese, Japanese, and Korean, which indexes bigrams.
- org.apache.lucene.analysis.ckb - package org.apache.lucene.analysis.ckb
-
Analyzer for Sorani Kurdish.
- org.apache.lucene.analysis.classic - package org.apache.lucene.analysis.classic
-
Fast, general-purpose grammar-based tokenizers.
- org.apache.lucene.analysis.commongrams - package org.apache.lucene.analysis.commongrams
-
Construct n-grams for frequently occurring terms and phrases.
- org.apache.lucene.analysis.compound - package org.apache.lucene.analysis.compound
-
A filter that decomposes compound words you find in many Germanic languages into the word parts.
- org.apache.lucene.analysis.compound.hyphenation - package org.apache.lucene.analysis.compound.hyphenation
-
Hyphenation code for the CompoundWordTokenFilter.
- org.apache.lucene.analysis.core - package org.apache.lucene.analysis.core
-
Basic, general-purpose analysis components.
- org.apache.lucene.analysis.custom - package org.apache.lucene.analysis.custom
-
A general-purpose Analyzer that can be created with a builder-style API.
- org.apache.lucene.analysis.cz - package org.apache.lucene.analysis.cz
-
Analyzer for Czech.
- org.apache.lucene.analysis.da - package org.apache.lucene.analysis.da
-
Analyzer for Danish.
- org.apache.lucene.analysis.de - package org.apache.lucene.analysis.de
-
Analyzer for German.
- org.apache.lucene.analysis.el - package org.apache.lucene.analysis.el
-
Analyzer for Greek.
- org.apache.lucene.analysis.email - package org.apache.lucene.analysis.email
-
Fast, general-purpose URLs and email addresses tokenizers.
- org.apache.lucene.analysis.en - package org.apache.lucene.analysis.en
-
Analyzer for English.
- org.apache.lucene.analysis.es - package org.apache.lucene.analysis.es
-
Analyzer for Spanish.
- org.apache.lucene.analysis.et - package org.apache.lucene.analysis.et
-
Analyzer for Estonian.
- org.apache.lucene.analysis.eu - package org.apache.lucene.analysis.eu
-
Analyzer for Basque.
- org.apache.lucene.analysis.fa - package org.apache.lucene.analysis.fa
-
Analyzer for Persian.
- org.apache.lucene.analysis.fi - package org.apache.lucene.analysis.fi
-
Analyzer for Finnish.
- org.apache.lucene.analysis.fr - package org.apache.lucene.analysis.fr
-
Analyzer for French.
- org.apache.lucene.analysis.ga - package org.apache.lucene.analysis.ga
-
Analyzer for Irish.
- org.apache.lucene.analysis.gl - package org.apache.lucene.analysis.gl
-
Analyzer for Galician.
- org.apache.lucene.analysis.hi - package org.apache.lucene.analysis.hi
-
Analyzer for Hindi.
- org.apache.lucene.analysis.hu - package org.apache.lucene.analysis.hu
-
Analyzer for Hungarian.
- org.apache.lucene.analysis.hunspell - package org.apache.lucene.analysis.hunspell
-
A Java implementation of Hunspell stemming and spell-checking algorithms (
Hunspell
), and a stemming TokenFilter (HunspellStemFilter
) based on it. - org.apache.lucene.analysis.hy - package org.apache.lucene.analysis.hy
-
Analyzer for Armenian.
- org.apache.lucene.analysis.id - package org.apache.lucene.analysis.id
-
Analyzer for Indonesian.
- org.apache.lucene.analysis.in - package org.apache.lucene.analysis.in
-
Analyzer for Indian languages.
- org.apache.lucene.analysis.it - package org.apache.lucene.analysis.it
-
Analyzer for Italian.
- org.apache.lucene.analysis.lt - package org.apache.lucene.analysis.lt
-
Analyzer for Lithuanian.
- org.apache.lucene.analysis.lv - package org.apache.lucene.analysis.lv
-
Analyzer for Latvian.
- org.apache.lucene.analysis.minhash - package org.apache.lucene.analysis.minhash
-
MinHash filtering (for LSH).
- org.apache.lucene.analysis.miscellaneous - package org.apache.lucene.analysis.miscellaneous
-
Miscellaneous Tokenstreams.
- org.apache.lucene.analysis.ne - package org.apache.lucene.analysis.ne
-
Analyzer for Nepali.
- org.apache.lucene.analysis.ngram - package org.apache.lucene.analysis.ngram
-
Character n-gram tokenizers and filters.
- org.apache.lucene.analysis.nl - package org.apache.lucene.analysis.nl
-
Analyzer for Dutch.
- org.apache.lucene.analysis.no - package org.apache.lucene.analysis.no
-
Analyzer for Norwegian.
- org.apache.lucene.analysis.path - package org.apache.lucene.analysis.path
-
Analysis components for path-like strings such as filenames.
- org.apache.lucene.analysis.pattern - package org.apache.lucene.analysis.pattern
-
Set of components for pattern-based (regex) analysis.
- org.apache.lucene.analysis.payloads - package org.apache.lucene.analysis.payloads
-
Provides various convenience classes for creating payloads on Tokens.
- org.apache.lucene.analysis.pt - package org.apache.lucene.analysis.pt
-
Analyzer for Portuguese.
- org.apache.lucene.analysis.query - package org.apache.lucene.analysis.query
-
Automatically filter high-frequency stopwords.
- org.apache.lucene.analysis.reverse - package org.apache.lucene.analysis.reverse
-
Filter to reverse token text.
- org.apache.lucene.analysis.ro - package org.apache.lucene.analysis.ro
-
Analyzer for Romanian.
- org.apache.lucene.analysis.ru - package org.apache.lucene.analysis.ru
-
Analyzer for Russian.
- org.apache.lucene.analysis.shingle - package org.apache.lucene.analysis.shingle
-
Word n-gram filters.
- org.apache.lucene.analysis.sinks - package org.apache.lucene.analysis.sinks
- org.apache.lucene.analysis.snowball - package org.apache.lucene.analysis.snowball
- org.apache.lucene.analysis.sr - package org.apache.lucene.analysis.sr
-
Analyzer for Serbian.
- org.apache.lucene.analysis.sv - package org.apache.lucene.analysis.sv
-
Analyzer for Swedish.
- org.apache.lucene.analysis.synonym - package org.apache.lucene.analysis.synonym
-
Analysis components for Synonyms.
- org.apache.lucene.analysis.synonym.word2vec - package org.apache.lucene.analysis.synonym.word2vec
-
Analysis components for Synonyms using Word2Vec model.
- org.apache.lucene.analysis.ta - package org.apache.lucene.analysis.ta
-
Analyzer for Tamil.
- org.apache.lucene.analysis.te - package org.apache.lucene.analysis.te
-
Analyzer for Telugu Language.
- org.apache.lucene.analysis.th - package org.apache.lucene.analysis.th
-
Analyzer for Thai.
- org.apache.lucene.analysis.tr - package org.apache.lucene.analysis.tr
-
Analyzer for Turkish.
- org.apache.lucene.analysis.util - package org.apache.lucene.analysis.util
-
Utility functions for text analysis.
- org.apache.lucene.analysis.wikipedia - package org.apache.lucene.analysis.wikipedia
-
Tokenizer that is aware of Wikipedia syntax.
- org.apache.lucene.collation - package org.apache.lucene.collation
-
Unicode collation support.
- org.apache.lucene.collation.tokenattributes - package org.apache.lucene.collation.tokenattributes
-
Custom
AttributeImpl
for indexing collation keys as index terms. - org.tartarus.snowball - package org.tartarus.snowball
-
Snowball stemmer API
- org.tartarus.snowball.ext - package org.tartarus.snowball.ext
-
Autogenerated snowball stemmer implementations.
- out_grouping(char[], int, int) - Method in class org.tartarus.snowball.SnowballProgram
- out_grouping_b(char[], int, int) - Method in class org.tartarus.snowball.SnowballProgram
P
- packValues(String) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Packs the values by storing them in 4 bits, two values into a byte Values range is from 0 to 9.
- parse(Reader) - Method in class org.apache.lucene.analysis.synonym.SolrSynonymParser
- parse(Reader) - Method in class org.apache.lucene.analysis.synonym.SynonymMap.Parser
-
Parse the given input, adding synonyms to the inherited
SynonymMap.Builder
. - parse(Reader) - Method in class org.apache.lucene.analysis.synonym.WordnetSynonymParser
- parse(Class<? extends RSLPStemmerBase>, String) - Static method in class org.apache.lucene.analysis.pt.RSLPStemmerBase
-
Parse a resource file into an RSLP stemmer description.
- parse(String) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
Parses a hyphenation pattern file.
- parse(String) - Static method in class org.apache.lucene.analysis.util.CSVUtil
-
Parse CSV line
- parse(InputSource) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
Parses a hyphenation pattern file.
- Parser(boolean, Analyzer) - Constructor for class org.apache.lucene.analysis.synonym.SynonymMap.Parser
- parseRules(List<String>, NormalizeCharMap.Builder) - Method in class org.apache.lucene.analysis.charfilter.MappingCharFilterFactory
- parseString(String) - Method in class org.apache.lucene.analysis.charfilter.MappingCharFilterFactory
- PathHierarchyTokenizer - Class in org.apache.lucene.analysis.path
-
Tokenizer for path-like hierarchies.
- PathHierarchyTokenizer() - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- PathHierarchyTokenizer(char, char) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- PathHierarchyTokenizer(char, char, int) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- PathHierarchyTokenizer(int) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- PathHierarchyTokenizer(int, char) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- PathHierarchyTokenizer(int, char, char, int) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- PathHierarchyTokenizer(AttributeFactory, char, char, int) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- PathHierarchyTokenizer(AttributeFactory, int, char, char, int) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- PathHierarchyTokenizerFactory - Class in org.apache.lucene.analysis.path
-
Factory for
PathHierarchyTokenizer
. - PathHierarchyTokenizerFactory() - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizerFactory
-
Default ctor for compatibility with SPI
- PathHierarchyTokenizerFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizerFactory
-
Creates a new PathHierarchyTokenizerFactory
- pattern - Variable in class org.apache.lucene.analysis.pattern.PatternTokenizerFactory
- PATTERN - Static variable in class org.apache.lucene.analysis.miscellaneous.KeywordMarkerFilterFactory
- PATTERN - Static variable in class org.apache.lucene.analysis.pattern.PatternTokenizerFactory
- PATTERN - Static variable in class org.apache.lucene.analysis.pattern.SimplePatternSplitTokenizerFactory
- PATTERN - Static variable in class org.apache.lucene.analysis.pattern.SimplePatternTokenizerFactory
- PatternCaptureGroupFilterFactory - Class in org.apache.lucene.analysis.pattern
-
Factory for
PatternCaptureGroupTokenFilter
. - PatternCaptureGroupFilterFactory() - Constructor for class org.apache.lucene.analysis.pattern.PatternCaptureGroupFilterFactory
-
Default ctor for compatibility with SPI
- PatternCaptureGroupFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.pattern.PatternCaptureGroupFilterFactory
- PatternCaptureGroupTokenFilter - Class in org.apache.lucene.analysis.pattern
-
CaptureGroup uses Java regexes to emit multiple tokens - one for each capture group in one or more patterns.
- PatternCaptureGroupTokenFilter(TokenStream, boolean, Pattern...) - Constructor for class org.apache.lucene.analysis.pattern.PatternCaptureGroupTokenFilter
- PatternConsumer - Interface in org.apache.lucene.analysis.compound.hyphenation
-
This interface is used to connect the XML pattern file parser to the hyphenation tree.
- PatternKeywordMarkerFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Marks terms as keywords via the
KeywordAttribute
. - PatternKeywordMarkerFilter(TokenStream, Pattern) - Constructor for class org.apache.lucene.analysis.miscellaneous.PatternKeywordMarkerFilter
-
Create a new
PatternKeywordMarkerFilter
, that marks the current token as a keyword if the tokens term buffer matches the providedPattern
via theKeywordAttribute
. - PatternParser - Class in org.apache.lucene.analysis.compound.hyphenation
-
A SAX document handler to read and parse hyphenation patterns from a XML file.
- PatternParser() - Constructor for class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- PatternParser(PatternConsumer) - Constructor for class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- PatternReplaceCharFilter - Class in org.apache.lucene.analysis.pattern
-
CharFilter that uses a regular expression for the target of replace string.
- PatternReplaceCharFilter(Pattern, String, Reader) - Constructor for class org.apache.lucene.analysis.pattern.PatternReplaceCharFilter
- PatternReplaceCharFilterFactory - Class in org.apache.lucene.analysis.pattern
-
Factory for
PatternReplaceCharFilter
. - PatternReplaceCharFilterFactory() - Constructor for class org.apache.lucene.analysis.pattern.PatternReplaceCharFilterFactory
-
Default ctor for compatibility with SPI
- PatternReplaceCharFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.pattern.PatternReplaceCharFilterFactory
-
Creates a new PatternReplaceCharFilterFactory
- PatternReplaceFilter - Class in org.apache.lucene.analysis.pattern
-
A TokenFilter which applies a Pattern to each token in the stream, replacing match occurrences with the specified replacement string.
- PatternReplaceFilter(TokenStream, Pattern, String, boolean) - Constructor for class org.apache.lucene.analysis.pattern.PatternReplaceFilter
-
Constructs an instance to replace either the first, or all occurrences
- PatternReplaceFilterFactory - Class in org.apache.lucene.analysis.pattern
-
Factory for
PatternReplaceFilter
. - PatternReplaceFilterFactory() - Constructor for class org.apache.lucene.analysis.pattern.PatternReplaceFilterFactory
-
Default ctor for compatibility with SPI
- PatternReplaceFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.pattern.PatternReplaceFilterFactory
-
Creates a new PatternReplaceFilterFactory
- PatternTokenizer - Class in org.apache.lucene.analysis.pattern
-
This tokenizer uses regex pattern matching to construct distinct tokens for the input stream.
- PatternTokenizer(Pattern, int) - Constructor for class org.apache.lucene.analysis.pattern.PatternTokenizer
-
creates a new PatternTokenizer returning tokens from group (-1 for split functionality)
- PatternTokenizer(AttributeFactory, Pattern, int) - Constructor for class org.apache.lucene.analysis.pattern.PatternTokenizer
-
creates a new PatternTokenizer returning tokens from group (-1 for split functionality)
- PatternTokenizerFactory - Class in org.apache.lucene.analysis.pattern
-
Factory for
PatternTokenizer
. - PatternTokenizerFactory() - Constructor for class org.apache.lucene.analysis.pattern.PatternTokenizerFactory
-
Default ctor for compatibility with SPI
- PatternTokenizerFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.pattern.PatternTokenizerFactory
-
Creates a new PatternTokenizerFactory
- PatternTypingFilter - Class in org.apache.lucene.analysis.pattern
-
Set a type attribute to a parameterized value when tokens are matched by any of a several regex patterns.
- PatternTypingFilter(TokenStream, PatternTypingFilter.PatternTypingRule...) - Constructor for class org.apache.lucene.analysis.pattern.PatternTypingFilter
- PatternTypingFilter.PatternTypingRule - Class in org.apache.lucene.analysis.pattern
-
Value holding class for pattern typing rules.
- PatternTypingFilterFactory - Class in org.apache.lucene.analysis.pattern
-
Provides a filter that will analyze tokens with the analyzer from an arbitrary field type.
- PatternTypingFilterFactory() - Constructor for class org.apache.lucene.analysis.pattern.PatternTypingFilterFactory
-
Default ctor for compatibility with SPI
- PatternTypingFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.pattern.PatternTypingFilterFactory
-
Creates a new PatternTypingFilterFactory
- PatternTypingRule(Pattern, int, String) - Constructor for class org.apache.lucene.analysis.pattern.PatternTypingFilter.PatternTypingRule
- PayloadEncoder - Interface in org.apache.lucene.analysis.payloads
-
Mainly for use with the DelimitedPayloadTokenFilter, converts char buffers to
BytesRef
. - PayloadHelper - Class in org.apache.lucene.analysis.payloads
-
Utility methods for encoding payloads.
- PayloadHelper() - Constructor for class org.apache.lucene.analysis.payloads.PayloadHelper
- PerFieldAnalyzerWrapper - Class in org.apache.lucene.analysis.miscellaneous
-
This analyzer is used to facilitate scenarios where different fields require different analysis techniques.
- PerFieldAnalyzerWrapper(Analyzer) - Constructor for class org.apache.lucene.analysis.miscellaneous.PerFieldAnalyzerWrapper
-
Constructs with default analyzer.
- PerFieldAnalyzerWrapper(Analyzer, Map<String, Analyzer>) - Constructor for class org.apache.lucene.analysis.miscellaneous.PerFieldAnalyzerWrapper
-
Constructs with default analyzer and a map of analyzers to use for specific fields.
- PersianAnalyzer - Class in org.apache.lucene.analysis.fa
-
Analyzer
for Persian. - PersianAnalyzer() - Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
-
Builds an analyzer with the default stop words:
PersianAnalyzer.DEFAULT_STOPWORD_FILE
. - PersianAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
-
Builds an analyzer with the given stop words
- PersianCharFilter - Class in org.apache.lucene.analysis.fa
-
CharFilter that replaces instances of Zero-width non-joiner with an ordinary space.
- PersianCharFilter(Reader) - Constructor for class org.apache.lucene.analysis.fa.PersianCharFilter
- PersianCharFilterFactory - Class in org.apache.lucene.analysis.fa
-
Factory for
PersianCharFilter
. - PersianCharFilterFactory() - Constructor for class org.apache.lucene.analysis.fa.PersianCharFilterFactory
-
Default ctor for compatibility with SPI
- PersianCharFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.fa.PersianCharFilterFactory
-
Creates a new PersianCharFilterFactory
- PersianNormalizationFilter - Class in org.apache.lucene.analysis.fa
-
A
TokenFilter
that appliesPersianNormalizer
to normalize the orthography. - PersianNormalizationFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.fa.PersianNormalizationFilter
- PersianNormalizationFilterFactory - Class in org.apache.lucene.analysis.fa
-
Factory for
PersianNormalizationFilter
. - PersianNormalizationFilterFactory() - Constructor for class org.apache.lucene.analysis.fa.PersianNormalizationFilterFactory
-
Default ctor for compatibility with SPI
- PersianNormalizationFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.fa.PersianNormalizationFilterFactory
-
Creates a new PersianNormalizationFilterFactory
- PersianNormalizer - Class in org.apache.lucene.analysis.fa
-
Normalizer for Persian.
- PersianNormalizer() - Constructor for class org.apache.lucene.analysis.fa.PersianNormalizer
- PersianStemFilter - Class in org.apache.lucene.analysis.fa
-
A
TokenFilter
that appliesPersianStemmer
to stem Persian words. - PersianStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.fa.PersianStemFilter
- PersianStemFilterFactory - Class in org.apache.lucene.analysis.fa
-
Factory for
PersianStemFilter
. - PersianStemFilterFactory() - Constructor for class org.apache.lucene.analysis.fa.PersianStemFilterFactory
-
Default ctor for compatibility with SPI
- PersianStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.fa.PersianStemFilterFactory
-
Creates a new PersianStemFilterFactory
- PersianStemmer - Class in org.apache.lucene.analysis.fa
-
Stemmer for Persian.
- PersianStemmer() - Constructor for class org.apache.lucene.analysis.fa.PersianStemmer
- PorterStemFilter - Class in org.apache.lucene.analysis.en
-
Transforms the token stream as per the Porter stemming algorithm.
- PorterStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.en.PorterStemFilter
- PorterStemFilterFactory - Class in org.apache.lucene.analysis.en
-
Factory for
PorterStemFilter
. - PorterStemFilterFactory() - Constructor for class org.apache.lucene.analysis.en.PorterStemFilterFactory
-
Default ctor for compatibility with SPI
- PorterStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.en.PorterStemFilterFactory
-
Creates a new PorterStemFilterFactory
- PorterStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- PorterStemmer() - Constructor for class org.tartarus.snowball.ext.PorterStemmer
- PortugueseAnalyzer - Class in org.apache.lucene.analysis.pt
-
Analyzer
for Portuguese. - PortugueseAnalyzer() - Constructor for class org.apache.lucene.analysis.pt.PortugueseAnalyzer
-
Builds an analyzer with the default stop words:
PortugueseAnalyzer.DEFAULT_STOPWORD_FILE
. - PortugueseAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.pt.PortugueseAnalyzer
-
Builds an analyzer with the given stop words.
- PortugueseAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.pt.PortugueseAnalyzer
-
Builds an analyzer with the given stop words.
- PortugueseLightStemFilter - Class in org.apache.lucene.analysis.pt
-
A
TokenFilter
that appliesPortugueseLightStemmer
to stem Portuguese words. - PortugueseLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.pt.PortugueseLightStemFilter
- PortugueseLightStemFilterFactory - Class in org.apache.lucene.analysis.pt
-
Factory for
PortugueseLightStemFilter
. - PortugueseLightStemFilterFactory() - Constructor for class org.apache.lucene.analysis.pt.PortugueseLightStemFilterFactory
-
Default ctor for compatibility with SPI
- PortugueseLightStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.pt.PortugueseLightStemFilterFactory
-
Creates a new PortugueseLightStemFilterFactory
- PortugueseLightStemmer - Class in org.apache.lucene.analysis.pt
-
Light Stemmer for Portuguese
- PortugueseLightStemmer() - Constructor for class org.apache.lucene.analysis.pt.PortugueseLightStemmer
- PortugueseMinimalStemFilter - Class in org.apache.lucene.analysis.pt
-
A
TokenFilter
that appliesPortugueseMinimalStemmer
to stem Portuguese words. - PortugueseMinimalStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.pt.PortugueseMinimalStemFilter
- PortugueseMinimalStemFilterFactory - Class in org.apache.lucene.analysis.pt
-
Factory for
PortugueseMinimalStemFilter
. - PortugueseMinimalStemFilterFactory() - Constructor for class org.apache.lucene.analysis.pt.PortugueseMinimalStemFilterFactory
-
Default ctor for compatibility with SPI
- PortugueseMinimalStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.pt.PortugueseMinimalStemFilterFactory
-
Creates a new PortugueseMinimalStemFilterFactory
- PortugueseMinimalStemmer - Class in org.apache.lucene.analysis.pt
-
Minimal Stemmer for Portuguese
- PortugueseMinimalStemmer() - Constructor for class org.apache.lucene.analysis.pt.PortugueseMinimalStemmer
- PortugueseStemFilter - Class in org.apache.lucene.analysis.pt
-
A
TokenFilter
that appliesPortugueseStemmer
to stem Portuguese words. - PortugueseStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.pt.PortugueseStemFilter
- PortugueseStemFilterFactory - Class in org.apache.lucene.analysis.pt
-
Factory for
PortugueseStemFilter
. - PortugueseStemFilterFactory() - Constructor for class org.apache.lucene.analysis.pt.PortugueseStemFilterFactory
-
Default ctor for compatibility with SPI
- PortugueseStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.pt.PortugueseStemFilterFactory
-
Creates a new PortugueseStemFilterFactory
- PortugueseStemmer - Class in org.apache.lucene.analysis.pt
-
Portuguese stemmer implementing the RSLP (Removedor de Sufixos da Lingua Portuguesa) algorithm.
- PortugueseStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- PortugueseStemmer() - Constructor for class org.apache.lucene.analysis.pt.PortugueseStemmer
- PortugueseStemmer() - Constructor for class org.tartarus.snowball.ext.PortugueseStemmer
- postBreak - Variable in class org.apache.lucene.analysis.compound.hyphenation.Hyphen
- preBreak - Variable in class org.apache.lucene.analysis.compound.hyphenation.Hyphen
- PREFIX_LENGTH_KEY - Static variable in class org.apache.lucene.analysis.miscellaneous.TruncateTokenFilterFactory
- prefixes - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- PRESERVE_ORIGINAL - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.Causes original words are preserved and added to the subword list (Defaults to false)
- PRESERVE_ORIGINAL - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilter
-
Causes original words are preserved and added to the subword list (Defaults to false)
- previous() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- printStats(PrintStream) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
- printStats(PrintStream) - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- proceedPastRep() - Method in class org.apache.lucene.analysis.hunspell.Suggester
-
Returns a copy of this suggester instance that doesn't stop after encountering acceptable words after applying REP rules.
- processNGram(CharSequence, int, int) - Method in interface org.apache.lucene.analysis.hunspell.NGramFragmentChecker.NGramConsumer
- processNGrams(int, CharSequence) - Method in interface org.apache.lucene.analysis.hunspell.NGramFragmentChecker.NGramConsumer
-
Call
NGramFragmentChecker.NGramConsumer.processNGram(java.lang.CharSequence, int, int)
for each fragment of the lengthn
in the given word - processNGrams(int, Dictionary, Runnable, NGramFragmentChecker.NGramConsumer) - Static method in class org.apache.lucene.analysis.hunspell.NGramFragmentChecker
-
Traverse the whole dictionary, generate all word forms of its entries, and process all n-grams in these word forms.
- processToken(char[], int) - Method in class org.apache.lucene.analysis.miscellaneous.ScandinavianNormalizer
-
Takes the original buffer and length as input.
- PROTECTED_TERMS - Static variable in class org.apache.lucene.analysis.miscellaneous.ProtectedTermFilterFactory
- PROTECTED_TOKENS - Static variable in class org.apache.lucene.analysis.miscellaneous.KeywordMarkerFilterFactory
- PROTECTED_TOKENS - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilterFactory
-
Deprecated.
- PROTECTED_TOKENS - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilterFactory
- PROTECTED_TOKENS - Static variable in class org.apache.lucene.analysis.snowball.SnowballPorterFilterFactory
- ProtectedTermFilter - Class in org.apache.lucene.analysis.miscellaneous
-
A ConditionalTokenFilter that only applies its wrapped filters to tokens that are not contained in a protected set.
- ProtectedTermFilter(CharArraySet, TokenStream, Function<TokenStream, TokenStream>) - Constructor for class org.apache.lucene.analysis.miscellaneous.ProtectedTermFilter
-
Creates a new ProtectedTermFilter
- ProtectedTermFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for a
ProtectedTermFilter
- ProtectedTermFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.ProtectedTermFilterFactory
-
Default ctor for compatibility with SPI
- ProtectedTermFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.ProtectedTermFilterFactory
- PUA_EC00_MARKER - Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Example marker character: U+EC00 (PRIVATE USE AREA: EC00)
- put(int, byte) - Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
- put(int, char) - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
Q
- QueryAutoStopWordAnalyzer - Class in org.apache.lucene.analysis.query
-
An
Analyzer
used primarily at query time to wrap another analyzer and provide a layer of protection which prevents very common words from being passed into queries. - QueryAutoStopWordAnalyzer(Analyzer, IndexReader) - Constructor for class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for all indexed fields from terms with a document frequency percentage greater than
QueryAutoStopWordAnalyzer.defaultMaxDocFreqPercent
- QueryAutoStopWordAnalyzer(Analyzer, IndexReader, float) - Constructor for class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for all indexed fields from terms with a document frequency percentage greater than the given maxPercentDocs
- QueryAutoStopWordAnalyzer(Analyzer, IndexReader, int) - Constructor for class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for all indexed fields from terms with a document frequency greater than the given maxDocFreq
- QueryAutoStopWordAnalyzer(Analyzer, IndexReader, Collection<String>, float) - Constructor for class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for the given selection of fields from terms with a document frequency percentage greater than the given maxPercentDocs
- QueryAutoStopWordAnalyzer(Analyzer, IndexReader, Collection<String>, int) - Constructor for class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for the given selection of fields from terms with a document frequency greater than the given maxDocFreq
- quoteEscape(String) - Static method in class org.apache.lucene.analysis.util.CSVUtil
-
Quote and escape input value for CSV
R
- r_A() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_AA() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_B() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_BB() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_C() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_CC() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_CONSONANT() - Method in class org.tartarus.snowball.ext.HindiStemmer
- r_D() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_E() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_F() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_G() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_H() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_I() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_J() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_K() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_KER() - Method in class org.tartarus.snowball.ext.IndonesianStemmer
- r_L() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_LONG() - Method in class org.tartarus.snowball.ext.FinnishStemmer
- r_M() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_N() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_O() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_P() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_Q() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_R() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_S() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_SUFFIX_AN_OK() - Method in class org.tartarus.snowball.ext.IndonesianStemmer
- r_SUFFIX_I_OK() - Method in class org.tartarus.snowball.ext.IndonesianStemmer
- r_SUFFIX_KAN_OK() - Method in class org.tartarus.snowball.ext.IndonesianStemmer
- r_T() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_U() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_V() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_VI() - Method in class org.tartarus.snowball.ext.FinnishStemmer
- r_VOWEL() - Method in class org.tartarus.snowball.ext.IndonesianStemmer
- r_W() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_X() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_Y() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- r_Z() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- read() - Method in class org.apache.lucene.analysis.charfilter.HTMLStripCharFilter
- read() - Method in class org.apache.lucene.analysis.charfilter.MappingCharFilter
- read() - Method in class org.apache.lucene.analysis.cjk.CJKWidthCharFilter
- read() - Method in class org.apache.lucene.analysis.fa.PersianCharFilter
- read() - Method in class org.apache.lucene.analysis.pattern.PatternReplaceCharFilter
- read() - Method in class org.apache.lucene.analysis.synonym.word2vec.Dl4jModelReader
- read(char[], int, int) - Method in class org.apache.lucene.analysis.charfilter.HTMLStripCharFilter
- read(char[], int, int) - Method in class org.apache.lucene.analysis.charfilter.MappingCharFilter
- read(char[], int, int) - Method in class org.apache.lucene.analysis.cjk.CJKWidthCharFilter
- read(char[], int, int) - Method in class org.apache.lucene.analysis.fa.PersianCharFilter
- read(char[], int, int) - Method in class org.apache.lucene.analysis.pattern.PatternReplaceCharFilter
- readToken(StringBuilder) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- reflectWith(AttributeReflector) - Method in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter.BytesRefBuilderTermAttributeImpl
- RemoveDuplicatesTokenFilter - Class in org.apache.lucene.analysis.miscellaneous
-
A TokenFilter which filters out Tokens at the same position and Term text as the previous token in the stream.
- RemoveDuplicatesTokenFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.RemoveDuplicatesTokenFilter
-
Creates a new RemoveDuplicatesTokenFilter
- RemoveDuplicatesTokenFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
RemoveDuplicatesTokenFilter
. - RemoveDuplicatesTokenFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.RemoveDuplicatesTokenFilterFactory
-
Default ctor for compatibility with SPI
- RemoveDuplicatesTokenFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.RemoveDuplicatesTokenFilterFactory
-
Creates a new RemoveDuplicatesTokenFilterFactory
- replace(char[], int) - Method in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
- replace_s(int, int, CharSequence) - Method in class org.tartarus.snowball.SnowballProgram
- replacement - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
- reserve(int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- reset() - Method in class org.apache.lucene.analysis.charfilter.MappingCharFilter
- reset() - Method in class org.apache.lucene.analysis.cjk.CJKBigramFilter
- reset() - Method in class org.apache.lucene.analysis.classic.ClassicTokenizer
- reset() - Method in class org.apache.lucene.analysis.commongrams.CommonGramsFilter
- reset() - Method in class org.apache.lucene.analysis.commongrams.CommonGramsQueryFilter
- reset() - Method in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- reset() - Method in class org.apache.lucene.analysis.core.FlattenGraphFilter
- reset() - Method in class org.apache.lucene.analysis.core.KeywordTokenizer
- reset() - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
- reset() - Method in class org.apache.lucene.analysis.hunspell.HunspellStemFilter
- reset() - Method in class org.apache.lucene.analysis.minhash.MinHashFilter
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.ASCIIFoldingFilter
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.ConcatenatingTokenStream
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.ConditionalTokenFilter
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.FingerprintFilter
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.FixBrokenOffsetsFilter
-
Deprecated.
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.HyphenatedWordsFilter
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.KeywordRepeatFilter
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.LimitTokenCountFilter
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.LimitTokenPositionFilter
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.RemoveDuplicatesTokenFilter
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.TypeAsSynonymFilter
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilter
- reset() - Method in class org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter
- reset() - Method in class org.apache.lucene.analysis.ngram.NGramTokenFilter
- reset() - Method in class org.apache.lucene.analysis.ngram.NGramTokenizer
- reset() - Method in class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- reset() - Method in class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- reset() - Method in class org.apache.lucene.analysis.pattern.PatternCaptureGroupTokenFilter
- reset() - Method in class org.apache.lucene.analysis.pattern.PatternTokenizer
- reset() - Method in class org.apache.lucene.analysis.pattern.SimplePatternSplitTokenizer
- reset() - Method in class org.apache.lucene.analysis.pattern.SimplePatternTokenizer
- reset() - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
- reset() - Method in class org.apache.lucene.analysis.sinks.TeeSinkTokenFilter
- reset() - Method in class org.apache.lucene.analysis.sinks.TeeSinkTokenFilter.SinkTokenStream
- reset() - Method in class org.apache.lucene.analysis.synonym.SynonymFilter
-
Deprecated.
- reset() - Method in class org.apache.lucene.analysis.synonym.SynonymGraphFilter
- reset() - Method in class org.apache.lucene.analysis.synonym.word2vec.Word2VecSynonymFilter
- reset() - Method in class org.apache.lucene.analysis.util.CharTokenizer
- reset() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- reset() - Method in class org.apache.lucene.analysis.util.SegmentingTokenizerBase
- reset() - Method in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- reset(Reader) - Method in class org.apache.lucene.analysis.util.RollingCharBuffer
-
Clear array and switch to new reader.
- resize(int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- resolveEntity(String, String) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- RETURN_PARTIAL_RESULT - org.apache.lucene.analysis.hunspell.TimeoutPolicy
-
Just stop the calculation and return whatever has been computed so far
- reverse(char[]) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Reverses the given input buffer in-place
- reverse(char[], int) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Partially reverses the given input buffer in-place from offset 0 up to the given length.
- reverse(char[], int, int) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Partially reverses the given input buffer in-place from the given offset up to the given length.
- reverse(String) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Reverses the given input string
- ReversePathHierarchyTokenizer - Class in org.apache.lucene.analysis.path
-
Tokenizer for domain-like hierarchies.
- ReversePathHierarchyTokenizer() - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- ReversePathHierarchyTokenizer(char, char) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- ReversePathHierarchyTokenizer(char, char, int) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- ReversePathHierarchyTokenizer(char, int) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- ReversePathHierarchyTokenizer(int) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- ReversePathHierarchyTokenizer(int, char) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- ReversePathHierarchyTokenizer(int, char, char) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- ReversePathHierarchyTokenizer(int, char, char, int) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- ReversePathHierarchyTokenizer(AttributeFactory, char, char, int) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- ReversePathHierarchyTokenizer(AttributeFactory, int, char, char, int) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- ReverseStringFilter - Class in org.apache.lucene.analysis.reverse
-
Reverse token string, for example "country" => "yrtnuoc".
- ReverseStringFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Create a new ReverseStringFilter that reverses all tokens in the supplied
TokenStream
. - ReverseStringFilter(TokenStream, char) - Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Create a new ReverseStringFilter that reverses and marks all tokens in the supplied
TokenStream
. - ReverseStringFilterFactory - Class in org.apache.lucene.analysis.reverse
-
Factory for
ReverseStringFilter
. - ReverseStringFilterFactory() - Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilterFactory
-
Default ctor for compatibility with SPI
- ReverseStringFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilterFactory
-
Creates a new ReverseStringFilterFactory
- rewind() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree.Iterator
- RollingCharBuffer - Class in org.apache.lucene.analysis.util
-
Acts like a forever growing char[] as you read characters into it from the provided reader, but internally it uses a circular buffer to only hold the characters that haven't been freed yet.
- RollingCharBuffer() - Constructor for class org.apache.lucene.analysis.util.RollingCharBuffer
- RomanianAnalyzer - Class in org.apache.lucene.analysis.ro
-
Analyzer
for Romanian. - RomanianAnalyzer() - Constructor for class org.apache.lucene.analysis.ro.RomanianAnalyzer
-
Builds an analyzer with the default stop words:
RomanianAnalyzer.DEFAULT_STOPWORD_FILE
. - RomanianAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.ro.RomanianAnalyzer
-
Builds an analyzer with the given stop words.
- RomanianAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.ro.RomanianAnalyzer
-
Builds an analyzer with the given stop words.
- RomanianStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- RomanianStemmer() - Constructor for class org.tartarus.snowball.ext.RomanianStemmer
- root - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- RSLPStemmerBase - Class in org.apache.lucene.analysis.pt
-
Base class for stemmers that use a set of RSLP-like stemming steps.
- RSLPStemmerBase() - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase
- RSLPStemmerBase.Rule - Class in org.apache.lucene.analysis.pt
-
A basic rule, with no exceptions.
- RSLPStemmerBase.RuleWithSetExceptions - Class in org.apache.lucene.analysis.pt
-
A rule with a set of whole-word exceptions.
- RSLPStemmerBase.RuleWithSuffixExceptions - Class in org.apache.lucene.analysis.pt
-
A rule with a set of exceptional suffixes.
- RSLPStemmerBase.Step - Class in org.apache.lucene.analysis.pt
-
A step containing a list of rules.
- RTL_DIRECTION_MARKER - Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Example marker character: U+200F (RIGHT-TO-LEFT MARK)
- Rule(String, int, String) - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
-
Create a rule.
- RULE_JAVA - Static variable in class org.apache.lucene.analysis.core.WhitespaceTokenizerFactory
- RULE_UNICODE - Static variable in class org.apache.lucene.analysis.core.WhitespaceTokenizerFactory
- rules - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
- RuleWithSetExceptions(String, int, String, String[]) - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSetExceptions
- RuleWithSuffixExceptions(String, int, String, String[]) - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSuffixExceptions
- RussianAnalyzer - Class in org.apache.lucene.analysis.ru
-
Analyzer
for Russian language. - RussianAnalyzer() - Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
- RussianAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
-
Builds an analyzer with the given stop words
- RussianAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
-
Builds an analyzer with the given stop words
- RussianLightStemFilter - Class in org.apache.lucene.analysis.ru
-
A
TokenFilter
that appliesRussianLightStemmer
to stem Russian words. - RussianLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ru.RussianLightStemFilter
- RussianLightStemFilterFactory - Class in org.apache.lucene.analysis.ru
-
Factory for
RussianLightStemFilter
. - RussianLightStemFilterFactory() - Constructor for class org.apache.lucene.analysis.ru.RussianLightStemFilterFactory
-
Default ctor for compatibility with SPI
- RussianLightStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ru.RussianLightStemFilterFactory
-
Creates a new RussianLightStemFilterFactory
- RussianLightStemmer - Class in org.apache.lucene.analysis.ru
-
Light Stemmer for Russian.
- RussianLightStemmer() - Constructor for class org.apache.lucene.analysis.ru.RussianLightStemmer
- RussianStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- RussianStemmer() - Constructor for class org.tartarus.snowball.ext.RussianStemmer
S
- sc - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
The character stored in this node: splitchar.
- ScandinavianFoldingFilter - Class in org.apache.lucene.analysis.miscellaneous
-
This filter folds Scandinavian characters åÅäæÄÆ->a and öÖøØ->o.
- ScandinavianFoldingFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.ScandinavianFoldingFilter
- ScandinavianFoldingFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
ScandinavianFoldingFilter
. - ScandinavianFoldingFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.ScandinavianFoldingFilterFactory
-
Default ctor for compatibility with SPI
- ScandinavianFoldingFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.ScandinavianFoldingFilterFactory
- ScandinavianNormalizationFilter - Class in org.apache.lucene.analysis.miscellaneous
-
This filter normalize use of the interchangeable Scandinavian characters æÆäÄöÖøØ and folded variants (aa, ao, ae, oe and oo) by transforming them to åÅæÆøØ.
- ScandinavianNormalizationFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.ScandinavianNormalizationFilter
- ScandinavianNormalizationFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
ScandinavianNormalizationFilter
. - ScandinavianNormalizationFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.ScandinavianNormalizationFilterFactory
-
Default ctor for compatibility with SPI
- ScandinavianNormalizationFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.ScandinavianNormalizationFilterFactory
- ScandinavianNormalizer - Class in org.apache.lucene.analysis.miscellaneous
-
This Normalizer does the heavy lifting for a set of Scandinavian normalization filters, normalizing use of the interchangeable Scandinavian characters æÆäÄöÖøØ and folded variants (aa, ao, ae, oe and oo) by transforming them to åÅæÆøØ.
- ScandinavianNormalizer(Set<ScandinavianNormalizer.Foldings>) - Constructor for class org.apache.lucene.analysis.miscellaneous.ScandinavianNormalizer
-
Create the instance, while choosing which foldings to apply.
- ScandinavianNormalizer.Foldings - Enum in org.apache.lucene.analysis.miscellaneous
-
List of possible foldings that can be used when configuring the filter
- searchPatterns(char[], int, byte[]) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Search for all possible partial matches of word starting at index an update interletter values.
- SegmentingTokenizerBase - Class in org.apache.lucene.analysis.util
-
Breaks text into sentences with a
BreakIterator
and allows subclasses to decompose these sentences into words. - SegmentingTokenizerBase(BreakIterator) - Constructor for class org.apache.lucene.analysis.util.SegmentingTokenizerBase
-
Construct a new SegmenterBase, using the provided BreakIterator for sentence segmentation.
- SegmentingTokenizerBase(AttributeFactory, BreakIterator) - Constructor for class org.apache.lucene.analysis.util.SegmentingTokenizerBase
-
Construct a new SegmenterBase, also supplying the AttributeFactory
- SEP_LABEL - Static variable in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter
-
Represents the default separator between tokens.
- SEPARATOR_KEY - Static variable in class org.apache.lucene.analysis.miscellaneous.FingerprintFilterFactory
- SerbianAnalyzer - Class in org.apache.lucene.analysis.sr
-
Analyzer
for Serbian. - SerbianAnalyzer() - Constructor for class org.apache.lucene.analysis.sr.SerbianAnalyzer
-
Builds an analyzer with the default stop words:
SerbianAnalyzer.DEFAULT_STOPWORD_FILE
. - SerbianAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.sr.SerbianAnalyzer
-
Builds an analyzer with the given stop words.
- SerbianAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.sr.SerbianAnalyzer
-
Builds an analyzer with the given stop words.
- SerbianNormalizationFilter - Class in org.apache.lucene.analysis.sr
-
Normalizes Serbian Cyrillic and Latin characters to "bald" Latin.
- SerbianNormalizationFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.sr.SerbianNormalizationFilter
- SerbianNormalizationFilterFactory - Class in org.apache.lucene.analysis.sr
-
Factory for
SerbianNormalizationFilter
. - SerbianNormalizationFilterFactory() - Constructor for class org.apache.lucene.analysis.sr.SerbianNormalizationFilterFactory
-
Default ctor for compatibility with SPI
- SerbianNormalizationFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.sr.SerbianNormalizationFilterFactory
-
Creates a new SerbianNormalizationFilterFactory
- SerbianNormalizationRegularFilter - Class in org.apache.lucene.analysis.sr
-
Normalizes Serbian Cyrillic to Latin.
- SerbianNormalizationRegularFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.sr.SerbianNormalizationRegularFilter
- SerbianStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- SerbianStemmer() - Constructor for class org.tartarus.snowball.ext.SerbianStemmer
- set(char[], int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- setBufferSize(int) - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Sets the scanner buffer size in chars
- setCharAt(int, char) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- setConsumer(PatternConsumer) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- setCurrent(char[], int) - Method in class org.tartarus.snowball.SnowballProgram
-
Set the current string.
- setCurrent(String) - Method in class org.tartarus.snowball.SnowballProgram
-
Set the current string.
- setFillerToken(String) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Sets the string to insert for each position at which there is no token (i.e., when position increment is greater than one).
- setIndex(int) - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- setInnerFilters(List<TokenFilterFactory>) - Method in class org.apache.lucene.analysis.miscellaneous.ConditionalTokenFilterFactory
-
Set the inner filter factories to produce the
TokenFilter
s that will be wrapped by theConditionalTokenFilter
- SetKeywordMarkerFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Marks terms as keywords via the
KeywordAttribute
. - SetKeywordMarkerFilter(TokenStream, CharArraySet) - Constructor for class org.apache.lucene.analysis.miscellaneous.SetKeywordMarkerFilter
-
Create a new KeywordSetMarkerFilter, that marks the current token as a keyword if the tokens term buffer is contained in the given set via the
KeywordAttribute
. - setLength(int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- setMaxShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Set the max shingle size (default: 2)
- setMaxTokenLength(int) - Method in class org.apache.lucene.analysis.classic.ClassicAnalyzer
-
Set maximum allowed token length.
- setMaxTokenLength(int) - Method in class org.apache.lucene.analysis.classic.ClassicTokenizer
-
Set the max allowed token length.
- setMaxTokenLength(int) - Method in class org.apache.lucene.analysis.email.UAX29URLEmailAnalyzer
-
Set the max allowed token length.
- setMaxTokenLength(int) - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
-
Set the max allowed token length.
- setMinShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Set the min shingle size (default: 2).
- setNextSentence(int, int) - Method in class org.apache.lucene.analysis.th.ThaiTokenizer
- setNextSentence(int, int) - Method in class org.apache.lucene.analysis.util.SegmentingTokenizerBase
-
Provides the next input sentence for analysis
- setOutputUnigrams(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Shall the output stream contain the input tokens (unigrams) as well as shingles? (default: true.)
- setOutputUnigramsIfNoShingles(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Shall we override the behavior of outputUnigrams==false for those times when no shingles are available (because there are fewer than minShingleSize tokens in the input stream)? (default: false.)
- setStemmer(GermanStemmer) - Method in class org.apache.lucene.analysis.de.GermanStemFilter
-
Set a alternative/custom
GermanStemmer
for this filter. - setStringValue(String) - Method in class org.apache.lucene.collation.CollationDocValuesField
- setText(char[], int, int) - Method in class org.apache.lucene.analysis.util.CharArrayIterator
-
Set a new region of text to be examined by this iterator
- setTokenSeparator(String) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Sets the string to use when joining adjacent tokens to form a shingle
- setTokenType(String) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Set the type of the shingle tokens produced by this filter.
- SHADDA - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- ShingleAnalyzerWrapper - Class in org.apache.lucene.analysis.shingle
-
A ShingleAnalyzerWrapper wraps a
ShingleFilter
around anotherAnalyzer
. - ShingleAnalyzerWrapper() - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Wraps
StandardAnalyzer
. - ShingleAnalyzerWrapper(int, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Wraps
StandardAnalyzer
. - ShingleAnalyzerWrapper(Analyzer) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- ShingleAnalyzerWrapper(Analyzer, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- ShingleAnalyzerWrapper(Analyzer, int, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- ShingleAnalyzerWrapper(Analyzer, int, int, String, boolean, boolean, String) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Creates a new ShingleAnalyzerWrapper
- ShingleFilter - Class in org.apache.lucene.analysis.shingle
-
A ShingleFilter constructs shingles (token n-grams) from a token stream.
- ShingleFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
-
Construct a ShingleFilter with default shingle size: 2.
- ShingleFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
-
Constructs a ShingleFilter with the specified shingle size from the
TokenStream
input
- ShingleFilter(TokenStream, int, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
-
Constructs a ShingleFilter with the specified shingle size from the
TokenStream
input
- ShingleFilter(TokenStream, String) - Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
-
Construct a ShingleFilter with the specified token type for shingle tokens and the default shingle size: 2
- ShingleFilterFactory - Class in org.apache.lucene.analysis.shingle
-
Factory for
ShingleFilter
. - ShingleFilterFactory() - Constructor for class org.apache.lucene.analysis.shingle.ShingleFilterFactory
-
Default ctor for compatibility with SPI
- ShingleFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.shingle.ShingleFilterFactory
-
Creates a new ShingleFilterFactory
- shouldFilter() - Method in class org.apache.lucene.analysis.miscellaneous.ConditionalTokenFilter
-
Whether or not to execute the wrapped TokenFilter(s) for the current token
- shouldFilter() - Method in class org.apache.lucene.analysis.miscellaneous.ProtectedTermFilter
- SimpleAnalyzer - Class in org.apache.lucene.analysis.core
- SimpleAnalyzer() - Constructor for class org.apache.lucene.analysis.core.SimpleAnalyzer
-
Creates a new
SimpleAnalyzer
- SimplePatternSplitTokenizer - Class in org.apache.lucene.analysis.pattern
- SimplePatternSplitTokenizer(String) - Constructor for class org.apache.lucene.analysis.pattern.SimplePatternSplitTokenizer
-
See
RegExp
for the accepted syntax. - SimplePatternSplitTokenizer(AttributeFactory, String, int) - Constructor for class org.apache.lucene.analysis.pattern.SimplePatternSplitTokenizer
-
See
RegExp
for the accepted syntax. - SimplePatternSplitTokenizer(AttributeFactory, Automaton) - Constructor for class org.apache.lucene.analysis.pattern.SimplePatternSplitTokenizer
-
Runs a pre-built automaton.
- SimplePatternSplitTokenizer(Automaton) - Constructor for class org.apache.lucene.analysis.pattern.SimplePatternSplitTokenizer
-
Runs a pre-built automaton.
- SimplePatternSplitTokenizerFactory - Class in org.apache.lucene.analysis.pattern
-
Factory for
SimplePatternSplitTokenizer
, for producing tokens by splitting according to the provided regexp. - SimplePatternSplitTokenizerFactory() - Constructor for class org.apache.lucene.analysis.pattern.SimplePatternSplitTokenizerFactory
-
Default ctor for compatibility with SPI
- SimplePatternSplitTokenizerFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.pattern.SimplePatternSplitTokenizerFactory
-
Creates a new SimpleSplitPatternTokenizerFactory
- SimplePatternTokenizer - Class in org.apache.lucene.analysis.pattern
- SimplePatternTokenizer(String) - Constructor for class org.apache.lucene.analysis.pattern.SimplePatternTokenizer
-
See
RegExp
for the accepted syntax. - SimplePatternTokenizer(AttributeFactory, String, int) - Constructor for class org.apache.lucene.analysis.pattern.SimplePatternTokenizer
-
See
RegExp
for the accepted syntax. - SimplePatternTokenizer(AttributeFactory, Automaton) - Constructor for class org.apache.lucene.analysis.pattern.SimplePatternTokenizer
-
Runs a pre-built automaton.
- SimplePatternTokenizer(Automaton) - Constructor for class org.apache.lucene.analysis.pattern.SimplePatternTokenizer
-
Runs a pre-built automaton.
- SimplePatternTokenizerFactory - Class in org.apache.lucene.analysis.pattern
-
Factory for
SimplePatternTokenizer
, for matching tokens based on the provided regexp. - SimplePatternTokenizerFactory() - Constructor for class org.apache.lucene.analysis.pattern.SimplePatternTokenizerFactory
-
Default ctor for compatibility with SPI
- SimplePatternTokenizerFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.pattern.SimplePatternTokenizerFactory
-
Creates a new SimplePatternTokenizerFactory
- SINGLE_TYPE - Static variable in class org.apache.lucene.analysis.cjk.CJKBigramFilter
-
when we emit a unigram, it's then marked as this type
- size() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- size() - Method in interface org.apache.lucene.analysis.hunspell.DictEntries
- size() - Method in class org.apache.lucene.analysis.synonym.word2vec.Word2VecModel
- size() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- slice_check() - Method in class org.tartarus.snowball.SnowballProgram
- slice_del() - Method in class org.tartarus.snowball.SnowballProgram
- slice_from(CharSequence) - Method in class org.tartarus.snowball.SnowballProgram
- slice_to(StringBuilder) - Method in class org.tartarus.snowball.SnowballProgram
- SnowballFilter - Class in org.apache.lucene.analysis.snowball
-
A filter that stems words using a Snowball-generated stemmer.
- SnowballFilter(TokenStream, String) - Constructor for class org.apache.lucene.analysis.snowball.SnowballFilter
-
Construct the named stemming filter.
- SnowballFilter(TokenStream, SnowballStemmer) - Constructor for class org.apache.lucene.analysis.snowball.SnowballFilter
- SnowballPorterFilterFactory - Class in org.apache.lucene.analysis.snowball
-
Factory for
SnowballFilter
, with configurable language - SnowballPorterFilterFactory() - Constructor for class org.apache.lucene.analysis.snowball.SnowballPorterFilterFactory
-
Default ctor for compatibility with SPI
- SnowballPorterFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.snowball.SnowballPorterFilterFactory
-
Creates a new SnowballPorterFilterFactory
- SnowballProgram - Class in org.tartarus.snowball
-
Base class for a snowball stemmer
- SnowballProgram() - Constructor for class org.tartarus.snowball.SnowballProgram
- SnowballProgram(SnowballProgram) - Constructor for class org.tartarus.snowball.SnowballProgram
- SnowballStemmer - Class in org.tartarus.snowball
-
Parent class of all snowball stemmers, which must implement
stem
- SnowballStemmer() - Constructor for class org.tartarus.snowball.SnowballStemmer
- SolrSynonymParser - Class in org.apache.lucene.analysis.synonym
-
Parser for the Solr synonyms format.
- SolrSynonymParser(boolean, boolean, Analyzer) - Constructor for class org.apache.lucene.analysis.synonym.SolrSynonymParser
- SoraniAnalyzer - Class in org.apache.lucene.analysis.ckb
-
Analyzer
for Sorani Kurdish. - SoraniAnalyzer() - Constructor for class org.apache.lucene.analysis.ckb.SoraniAnalyzer
-
Builds an analyzer with the default stop words:
SoraniAnalyzer.DEFAULT_STOPWORD_FILE
. - SoraniAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.ckb.SoraniAnalyzer
-
Builds an analyzer with the given stop words.
- SoraniAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.ckb.SoraniAnalyzer
-
Builds an analyzer with the given stop words.
- SoraniNormalizationFilter - Class in org.apache.lucene.analysis.ckb
-
A
TokenFilter
that appliesSoraniNormalizer
to normalize the orthography. - SoraniNormalizationFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ckb.SoraniNormalizationFilter
- SoraniNormalizationFilterFactory - Class in org.apache.lucene.analysis.ckb
-
Factory for
SoraniNormalizationFilter
. - SoraniNormalizationFilterFactory() - Constructor for class org.apache.lucene.analysis.ckb.SoraniNormalizationFilterFactory
-
Default ctor for compatibility with SPI
- SoraniNormalizationFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ckb.SoraniNormalizationFilterFactory
-
Creates a new SoraniNormalizationFilterFactory
- SoraniNormalizer - Class in org.apache.lucene.analysis.ckb
-
Normalizes the Unicode representation of Sorani text.
- SoraniNormalizer() - Constructor for class org.apache.lucene.analysis.ckb.SoraniNormalizer
- SoraniStemFilter - Class in org.apache.lucene.analysis.ckb
-
A
TokenFilter
that appliesSoraniStemmer
to stem Sorani words. - SoraniStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ckb.SoraniStemFilter
- SoraniStemFilterFactory - Class in org.apache.lucene.analysis.ckb
-
Factory for
SoraniStemFilter
. - SoraniStemFilterFactory() - Constructor for class org.apache.lucene.analysis.ckb.SoraniStemFilterFactory
-
Default ctor for compatibility with SPI
- SoraniStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ckb.SoraniStemFilterFactory
-
Creates a new SoraniStemFilterFactory
- SoraniStemmer - Class in org.apache.lucene.analysis.ckb
-
Light stemmer for Sorani
- SoraniStemmer() - Constructor for class org.apache.lucene.analysis.ckb.SoraniStemmer
- SortingStrategy - Class in org.apache.lucene.analysis.hunspell
-
The strategy defining how a Hunspell dictionary should be loaded, with different tradeoffs.
- SortingStrategy() - Constructor for class org.apache.lucene.analysis.hunspell.SortingStrategy
- SOUTH_EAST_ASIAN_TYPE - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Chars in class \p{Line_Break = Complex_Context} are from South East Asian scripts (Thai, Lao, Myanmar, Khmer, etc.).
- SOUTHEAST_ASIAN - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
-
Southeast Asian token type
- SpanishAnalyzer - Class in org.apache.lucene.analysis.es
-
Analyzer
for Spanish. - SpanishAnalyzer() - Constructor for class org.apache.lucene.analysis.es.SpanishAnalyzer
-
Builds an analyzer with the default stop words:
SpanishAnalyzer.DEFAULT_STOPWORD_FILE
. - SpanishAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.es.SpanishAnalyzer
-
Builds an analyzer with the given stop words.
- SpanishAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.es.SpanishAnalyzer
-
Builds an analyzer with the given stop words.
- SpanishLightStemFilter - Class in org.apache.lucene.analysis.es
-
A
TokenFilter
that appliesSpanishLightStemmer
to stem Spanish words. - SpanishLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.es.SpanishLightStemFilter
- SpanishLightStemFilterFactory - Class in org.apache.lucene.analysis.es
-
Factory for
SpanishLightStemFilter
. - SpanishLightStemFilterFactory() - Constructor for class org.apache.lucene.analysis.es.SpanishLightStemFilterFactory
-
Default ctor for compatibility with SPI
- SpanishLightStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.es.SpanishLightStemFilterFactory
-
Creates a new SpanishLightStemFilterFactory
- SpanishLightStemmer - Class in org.apache.lucene.analysis.es
-
Light Stemmer for Spanish
- SpanishLightStemmer() - Constructor for class org.apache.lucene.analysis.es.SpanishLightStemmer
- SpanishMinimalStemFilter - Class in org.apache.lucene.analysis.es
-
Deprecated.Use
SpanishPluralStemFilter
instead. - SpanishMinimalStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.es.SpanishMinimalStemFilter
-
Deprecated.
- SpanishMinimalStemFilterFactory - Class in org.apache.lucene.analysis.es
-
Deprecated.Use
SpanishPluralStemFilterFactory
instead - SpanishMinimalStemFilterFactory() - Constructor for class org.apache.lucene.analysis.es.SpanishMinimalStemFilterFactory
-
Deprecated.Default ctor for compatibility with SPI
- SpanishMinimalStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.es.SpanishMinimalStemFilterFactory
-
Deprecated.Creates a new SpanishMinimalStemFilterFactory
- SpanishMinimalStemmer - Class in org.apache.lucene.analysis.es
-
Deprecated.Use
SpanishPluralStemmer
instead. - SpanishMinimalStemmer() - Constructor for class org.apache.lucene.analysis.es.SpanishMinimalStemmer
-
Deprecated.
- SpanishPluralStemFilter - Class in org.apache.lucene.analysis.es
-
A
TokenFilter
that appliesSpanishPluralStemmer
to stem Spanish words. - SpanishPluralStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.es.SpanishPluralStemFilter
- SpanishPluralStemFilterFactory - Class in org.apache.lucene.analysis.es
-
Factory for
SpanishPluralStemFilterFactory
. - SpanishPluralStemFilterFactory() - Constructor for class org.apache.lucene.analysis.es.SpanishPluralStemFilterFactory
-
Default ctor for compatibility with SPI
- SpanishPluralStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.es.SpanishPluralStemFilterFactory
- SpanishPluralStemmer - Class in org.apache.lucene.analysis.es
-
Plural Stemmer for Spanish
- SpanishPluralStemmer() - Constructor for class org.apache.lucene.analysis.es.SpanishPluralStemmer
- SpanishStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- SpanishStemmer() - Constructor for class org.tartarus.snowball.ext.SpanishStemmer
- spell(String) - Method in class org.apache.lucene.analysis.hunspell.Hunspell
- SPLIT_ON_CASE_CHANGE - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.If not set, causes case changes to be ignored (subwords will only be generated given SUBWORD_DELIM tokens)
- SPLIT_ON_CASE_CHANGE - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilter
-
Causes lowercase -> uppercase transition to start a new subword.
- SPLIT_ON_NUMERICS - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.If not set, causes numeric changes to be ignored (subwords will only be generated given SUBWORD_DELIM tokens).
- SPLIT_ON_NUMERICS - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilter
-
If not set, causes numeric changes to be ignored (subwords will only be generated given SUBWORD_DELIM tokens).
- START_OF_HEADING_MARKER - Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Example marker character: U+0001 (START OF HEADING)
- startElement(String, String, String, Attributes) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- startOffset - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase.CompoundToken
- startsWith(char[], int, String) - Static method in class org.apache.lucene.analysis.util.StemmerUtil
-
Returns true if the character array starts with the suffix.
- stem() - Method in class org.tartarus.snowball.ext.ArabicStemmer
- stem() - Method in class org.tartarus.snowball.ext.ArmenianStemmer
- stem() - Method in class org.tartarus.snowball.ext.BasqueStemmer
- stem() - Method in class org.tartarus.snowball.ext.CatalanStemmer
- stem() - Method in class org.tartarus.snowball.ext.DanishStemmer
- stem() - Method in class org.tartarus.snowball.ext.DutchStemmer
- stem() - Method in class org.tartarus.snowball.ext.EnglishStemmer
- stem() - Method in class org.tartarus.snowball.ext.EstonianStemmer
- stem() - Method in class org.tartarus.snowball.ext.FinnishStemmer
- stem() - Method in class org.tartarus.snowball.ext.FrenchStemmer
- stem() - Method in class org.tartarus.snowball.ext.German2Stemmer
- stem() - Method in class org.tartarus.snowball.ext.GermanStemmer
- stem() - Method in class org.tartarus.snowball.ext.GreekStemmer
- stem() - Method in class org.tartarus.snowball.ext.HindiStemmer
- stem() - Method in class org.tartarus.snowball.ext.HungarianStemmer
- stem() - Method in class org.tartarus.snowball.ext.IndonesianStemmer
- stem() - Method in class org.tartarus.snowball.ext.IrishStemmer
- stem() - Method in class org.tartarus.snowball.ext.ItalianStemmer
- stem() - Method in class org.tartarus.snowball.ext.KpStemmer
- stem() - Method in class org.tartarus.snowball.ext.LithuanianStemmer
- stem() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- stem() - Method in class org.tartarus.snowball.ext.NepaliStemmer
- stem() - Method in class org.tartarus.snowball.ext.NorwegianStemmer
- stem() - Method in class org.tartarus.snowball.ext.PorterStemmer
- stem() - Method in class org.tartarus.snowball.ext.PortugueseStemmer
- stem() - Method in class org.tartarus.snowball.ext.RomanianStemmer
- stem() - Method in class org.tartarus.snowball.ext.RussianStemmer
- stem() - Method in class org.tartarus.snowball.ext.SerbianStemmer
- stem() - Method in class org.tartarus.snowball.ext.SpanishStemmer
- stem() - Method in class org.tartarus.snowball.ext.SwedishStemmer
- stem() - Method in class org.tartarus.snowball.ext.TamilStemmer
- stem() - Method in class org.tartarus.snowball.ext.TurkishStemmer
- stem() - Method in class org.tartarus.snowball.ext.YiddishStemmer
- stem() - Method in class org.tartarus.snowball.SnowballStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.ar.ArabicStemmer
-
Stem an input buffer of Arabic text.
- stem(char[], int) - Method in class org.apache.lucene.analysis.bg.BulgarianStemmer
-
Stem an input buffer of Bulgarian text.
- stem(char[], int) - Method in class org.apache.lucene.analysis.bn.BengaliStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.ckb.SoraniStemmer
-
Stem an input buffer of Sorani text.
- stem(char[], int) - Method in class org.apache.lucene.analysis.cz.CzechStemmer
-
Stem an input buffer of Czech text.
- stem(char[], int) - Method in class org.apache.lucene.analysis.de.GermanLightStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.de.GermanMinimalStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.el.GreekStemmer
-
Stems a word contained in a leading portion of a char[] array.
- stem(char[], int) - Method in class org.apache.lucene.analysis.en.EnglishMinimalStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.es.SpanishLightStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.es.SpanishMinimalStemmer
-
Deprecated.
- stem(char[], int) - Method in class org.apache.lucene.analysis.es.SpanishPluralStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.fa.PersianStemmer
-
Stem an input buffer of Persian text.
- stem(char[], int) - Method in class org.apache.lucene.analysis.fi.FinnishLightStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.fr.FrenchLightStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.fr.FrenchMinimalStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.gl.GalicianMinimalStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.gl.GalicianStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.hi.HindiStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.hu.HungarianLightStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.it.ItalianLightStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.lv.LatvianStemmer
-
Stem a latvian word.
- stem(char[], int) - Method in class org.apache.lucene.analysis.no.NorwegianLightStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.no.NorwegianMinimalStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.pt.PortugueseLightStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.pt.PortugueseMinimalStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.pt.PortugueseStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.ru.RussianLightStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.sv.SwedishLightStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.sv.SwedishMinimalStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.te.TeluguStemmer
- stem(char[], int, boolean) - Method in class org.apache.lucene.analysis.id.IndonesianStemmer
-
Stem a term (returning its new length).
- stem(String) - Method in class org.apache.lucene.analysis.br.BrazilianStemmer
-
Stems the given term to an unique
discriminator
. - stem(String) - Method in class org.apache.lucene.analysis.de.GermanStemmer
-
Stemms the given term to an unique
discriminator
. - STEM_ENGLISH_POSSESSIVE - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.Causes trailing "'s" to be removed for each subword
- STEM_ENGLISH_POSSESSIVE - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilter
-
Causes trailing "'s" to be removed for each subword
- StemmerOverrideFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Provides the ability to override any
KeywordAttribute
aware stemmer with custom dictionary-based stemming. - StemmerOverrideFilter(TokenStream, StemmerOverrideFilter.StemmerOverrideMap) - Constructor for class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter
-
Create a new StemmerOverrideFilter, performing dictionary-based stemming with the provided
dictionary
. - StemmerOverrideFilter.Builder - Class in org.apache.lucene.analysis.miscellaneous
-
This builder builds an
FST
for theStemmerOverrideFilter
- StemmerOverrideFilter.StemmerOverrideMap - Class in org.apache.lucene.analysis.miscellaneous
-
A read-only 4-byte FST backed map that allows fast case-insensitive key value lookups for
StemmerOverrideFilter
- StemmerOverrideFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
StemmerOverrideFilter
. - StemmerOverrideFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilterFactory
-
Default ctor for compatibility with SPI
- StemmerOverrideFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilterFactory
-
Creates a new StemmerOverrideFilterFactory
- StemmerOverrideMap(FST<BytesRef>, boolean) - Constructor for class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter.StemmerOverrideMap
-
Creates a new
StemmerOverrideFilter.StemmerOverrideMap
- StemmerUtil - Class in org.apache.lucene.analysis.util
-
Some commonly-used stemming functions
- stemPrefix(char[], int) - Method in class org.apache.lucene.analysis.ar.ArabicStemmer
-
Stem a prefix off an Arabic word.
- stemSuffix(char[], int) - Method in class org.apache.lucene.analysis.ar.ArabicStemmer
-
Stem suffix(es) off an Arabic word.
- Step(String, RSLPStemmerBase.Rule[], int, String[]) - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
-
Create a new step
- STOP_WORDS_SET - Static variable in class org.apache.lucene.analysis.classic.ClassicAnalyzer
-
An unmodifiable set containing some common English words that are usually not useful for searching.
- STOP_WORDS_SET - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailAnalyzer
-
An unmodifiable set containing some common English words that are usually not useful for searching.
- StopAnalyzer - Class in org.apache.lucene.analysis.core
- StopAnalyzer(Reader) - Constructor for class org.apache.lucene.analysis.core.StopAnalyzer
-
Builds an analyzer with the stop words from the given reader.
- StopAnalyzer(Path) - Constructor for class org.apache.lucene.analysis.core.StopAnalyzer
-
Builds an analyzer with the stop words from the given path.
- StopAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.core.StopAnalyzer
-
Builds an analyzer with the stop words from the given set.
- StopFilter - Class in org.apache.lucene.analysis.core
-
Removes stop words from a token stream.
- StopFilter(TokenStream, CharArraySet) - Constructor for class org.apache.lucene.analysis.core.StopFilter
-
Constructs a filter which removes words from the input TokenStream that are named in the Set.
- StopFilterFactory - Class in org.apache.lucene.analysis.core
-
Factory for
StopFilter
. - StopFilterFactory() - Constructor for class org.apache.lucene.analysis.core.StopFilterFactory
-
Default ctor for compatibility with SPI
- StopFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.core.StopFilterFactory
-
Creates a new StopFilterFactory
- stoplist - Variable in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
This map stores hyphenation exceptions
- STOPWORDS_COMMENT - Static variable in class org.apache.lucene.analysis.fa.PersianAnalyzer
-
The comment character in the stopwords file.
- strcmp(char[], int, char[], int) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Compares 2 null terminated char arrays
- strcmp(String, char[], int) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Compares a string with null terminated char array
- strcpy(char[], int, char[], int) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- strlen(char[]) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- strlen(char[], int) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- SUB_HEADING - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- SUB_HEADING_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- subSequence(int, int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- SUBWORD_DELIM - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.
- suffix - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
- suffixes - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- suffixes - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
- suggest(String) - Method in class org.apache.lucene.analysis.hunspell.Hunspell
- suggest(String, long) - Method in class org.apache.lucene.analysis.hunspell.Hunspell
- Suggester - Class in org.apache.lucene.analysis.hunspell
-
A generator for misspelled word corrections based on Hunspell flags.
- Suggester(Dictionary) - Constructor for class org.apache.lucene.analysis.hunspell.Suggester
- SuggestionTimeoutException - Exception in org.apache.lucene.analysis.hunspell
-
An exception thrown when
Hunspell.suggest(java.lang.String)
call takes too long, ifTimeoutPolicy.THROW_EXCEPTION
is used. - SuggestionTimeoutException(String, List<String>) - Constructor for exception org.apache.lucene.analysis.hunspell.SuggestionTimeoutException
- suggestNoTimeout(String, Runnable) - Method in class org.apache.lucene.analysis.hunspell.Suggester
-
Compute suggestions for the given misspelled word
- suggestWithTimeout(String, long, Runnable) - Method in class org.apache.lucene.analysis.hunspell.Suggester
- SUKUN - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- SwedishAnalyzer - Class in org.apache.lucene.analysis.sv
-
Analyzer
for Swedish. - SwedishAnalyzer() - Constructor for class org.apache.lucene.analysis.sv.SwedishAnalyzer
-
Builds an analyzer with the default stop words:
SwedishAnalyzer.DEFAULT_STOPWORD_FILE
. - SwedishAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.sv.SwedishAnalyzer
-
Builds an analyzer with the given stop words.
- SwedishAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.sv.SwedishAnalyzer
-
Builds an analyzer with the given stop words.
- SwedishLightStemFilter - Class in org.apache.lucene.analysis.sv
-
A
TokenFilter
that appliesSwedishLightStemmer
to stem Swedish words. - SwedishLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.sv.SwedishLightStemFilter
- SwedishLightStemFilterFactory - Class in org.apache.lucene.analysis.sv
-
Factory for
SwedishLightStemFilter
. - SwedishLightStemFilterFactory() - Constructor for class org.apache.lucene.analysis.sv.SwedishLightStemFilterFactory
-
Default ctor for compatibility with SPI
- SwedishLightStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.sv.SwedishLightStemFilterFactory
-
Creates a new SwedishLightStemFilterFactory
- SwedishLightStemmer - Class in org.apache.lucene.analysis.sv
-
Light Stemmer for Swedish.
- SwedishLightStemmer() - Constructor for class org.apache.lucene.analysis.sv.SwedishLightStemmer
- SwedishMinimalStemFilter - Class in org.apache.lucene.analysis.sv
-
A
TokenFilter
that appliesSwedishMinimalStemmer
to stem Swedish words. - SwedishMinimalStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.sv.SwedishMinimalStemFilter
- SwedishMinimalStemFilterFactory - Class in org.apache.lucene.analysis.sv
-
Factory for
SwedishMinimalStemFilter
. - SwedishMinimalStemFilterFactory() - Constructor for class org.apache.lucene.analysis.sv.SwedishMinimalStemFilterFactory
-
Default ctor for compatibility with SPI
- SwedishMinimalStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.sv.SwedishMinimalStemFilterFactory
-
Creates a new SwedishMinimalStemFilterFactory
- SwedishMinimalStemmer - Class in org.apache.lucene.analysis.sv
-
Minimal Stemmer for Swedish.
- SwedishMinimalStemmer() - Constructor for class org.apache.lucene.analysis.sv.SwedishMinimalStemmer
- SwedishStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- SwedishStemmer() - Constructor for class org.tartarus.snowball.ext.SwedishStemmer
- SynonymFilter - Class in org.apache.lucene.analysis.synonym
-
Deprecated.Use
SynonymGraphFilter
instead, but be sure to also useFlattenGraphFilter
at index time (not at search time) as well. - SynonymFilter(TokenStream, SynonymMap, boolean) - Constructor for class org.apache.lucene.analysis.synonym.SynonymFilter
-
Deprecated.
- SynonymFilterFactory - Class in org.apache.lucene.analysis.synonym
-
Deprecated.Use
SynonymGraphFilterFactory
instead, but be sure to also useFlattenGraphFilterFactory
at index time (not at search time) as well. - SynonymFilterFactory() - Constructor for class org.apache.lucene.analysis.synonym.SynonymFilterFactory
-
Deprecated.Default ctor for compatibility with SPI
- SynonymFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.synonym.SynonymFilterFactory
-
Deprecated.
- SynonymGraphFilter - Class in org.apache.lucene.analysis.synonym
-
Applies single- or multi-token synonyms from a
SynonymMap
to an incomingTokenStream
, producing a fully correct graph output. - SynonymGraphFilter(TokenStream, SynonymMap, boolean) - Constructor for class org.apache.lucene.analysis.synonym.SynonymGraphFilter
-
Apply previously built synonyms to incoming tokens.
- SynonymGraphFilterFactory - Class in org.apache.lucene.analysis.synonym
-
Factory for
SynonymGraphFilter
. - SynonymGraphFilterFactory() - Constructor for class org.apache.lucene.analysis.synonym.SynonymGraphFilterFactory
-
Default ctor for compatibility with SPI
- SynonymGraphFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.synonym.SynonymGraphFilterFactory
- SynonymMap - Class in org.apache.lucene.analysis.synonym
-
A map of synonyms, keys and values are phrases.
- SynonymMap(FST<BytesRef>, BytesRefHash, int) - Constructor for class org.apache.lucene.analysis.synonym.SynonymMap
- SynonymMap.Builder - Class in org.apache.lucene.analysis.synonym
-
Builds an FSTSynonymMap.
- SynonymMap.Parser - Class in org.apache.lucene.analysis.synonym
-
Abstraction for parsing synonym files.
T
- TamilAnalyzer - Class in org.apache.lucene.analysis.ta
-
Analyzer for Tamil.
- TamilAnalyzer() - Constructor for class org.apache.lucene.analysis.ta.TamilAnalyzer
-
Builds an analyzer with the default stop words:
TamilAnalyzer.DEFAULT_STOPWORD_FILE
. - TamilAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.ta.TamilAnalyzer
-
Builds an analyzer with the given stop words
- TamilAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.ta.TamilAnalyzer
-
Builds an analyzer with the given stop words
- TamilStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- TamilStemmer() - Constructor for class org.tartarus.snowball.ext.TamilStemmer
- TATWEEL - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- TeeSinkTokenFilter - Class in org.apache.lucene.analysis.sinks
-
This TokenFilter provides the ability to set aside attribute states that have already been analyzed.
- TeeSinkTokenFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.sinks.TeeSinkTokenFilter
- TeeSinkTokenFilter.SinkTokenStream - Class in org.apache.lucene.analysis.sinks
-
TokenStream output from a tee.
- TEH - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- TEH_MARBUTA - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- TEH_MARBUTA - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- TeluguAnalyzer - Class in org.apache.lucene.analysis.te
-
Analyzer for Telugu.
- TeluguAnalyzer() - Constructor for class org.apache.lucene.analysis.te.TeluguAnalyzer
-
Builds an analyzer with the default stop words:
TeluguAnalyzer.DEFAULT_STOPWORD_FILE
. - TeluguAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.te.TeluguAnalyzer
-
Builds an analyzer with the given stop words
- TeluguAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.te.TeluguAnalyzer
-
Builds an analyzer with the given stop words
- TeluguNormalizationFilter - Class in org.apache.lucene.analysis.te
-
A
TokenFilter
that appliesTeluguNormalizer
to normalize the orthography. - TeluguNormalizationFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.te.TeluguNormalizationFilter
- TeluguNormalizationFilterFactory - Class in org.apache.lucene.analysis.te
-
Factory for
TeluguNormalizationFilter
. - TeluguNormalizationFilterFactory() - Constructor for class org.apache.lucene.analysis.te.TeluguNormalizationFilterFactory
-
Default ctor for compatibility with SPI
- TeluguNormalizationFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.te.TeluguNormalizationFilterFactory
-
Creates a new TeluguNormalizationFilterFactory
- TeluguNormalizer - Class in org.apache.lucene.analysis.te
-
Normalizer for Telugu.
- TeluguNormalizer() - Constructor for class org.apache.lucene.analysis.te.TeluguNormalizer
- TeluguStemFilter - Class in org.apache.lucene.analysis.te
-
A
TokenFilter
that appliesTeluguStemmer
to stem Telugu words. - TeluguStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.te.TeluguStemFilter
- TeluguStemFilterFactory - Class in org.apache.lucene.analysis.te
-
Factory for
TeluguStemFilter
. - TeluguStemFilterFactory() - Constructor for class org.apache.lucene.analysis.te.TeluguStemFilterFactory
-
Default ctor for compatibility with SPI
- TeluguStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.te.TeluguStemFilterFactory
- TeluguStemmer - Class in org.apache.lucene.analysis.te
-
Stemmer for Telugu.
- TeluguStemmer() - Constructor for class org.apache.lucene.analysis.te.TeluguStemmer
- term - Variable in class org.apache.lucene.analysis.synonym.word2vec.TermAndBoost
-
the term
- TermAndBoost - Class in org.apache.lucene.analysis.synonym.word2vec
-
Wraps a term and boost
- TermAndBoost(BytesRef, float) - Constructor for class org.apache.lucene.analysis.synonym.word2vec.TermAndBoost
-
Creates a new TermAndBoost
- termAtt - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- termValue(int) - Method in class org.apache.lucene.analysis.synonym.word2vec.Word2VecModel
- TernaryTree - Class in org.apache.lucene.analysis.compound.hyphenation
-
Ternary Search Tree.
- TernaryTree.Iterator - Class in org.apache.lucene.analysis.compound.hyphenation
-
Enumeration over TST keys
- ThaiAnalyzer - Class in org.apache.lucene.analysis.th
-
Analyzer
for Thai language. - ThaiAnalyzer() - Constructor for class org.apache.lucene.analysis.th.ThaiAnalyzer
-
Builds an analyzer with the default stop words.
- ThaiAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.th.ThaiAnalyzer
-
Builds an analyzer with the given stop words.
- ThaiTokenizer - Class in org.apache.lucene.analysis.th
-
Tokenizer that use
BreakIterator
to tokenize Thai text. - ThaiTokenizer() - Constructor for class org.apache.lucene.analysis.th.ThaiTokenizer
-
Creates a new ThaiTokenizer
- ThaiTokenizer(AttributeFactory) - Constructor for class org.apache.lucene.analysis.th.ThaiTokenizer
-
Creates a new ThaiTokenizer, supplying the AttributeFactory
- ThaiTokenizerFactory - Class in org.apache.lucene.analysis.th
-
Factory for
ThaiTokenizer
. - ThaiTokenizerFactory() - Constructor for class org.apache.lucene.analysis.th.ThaiTokenizerFactory
-
Default ctor for compatibility with SPI
- ThaiTokenizerFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.th.ThaiTokenizerFactory
-
Creates a new ThaiTokenizerFactory
- THROW_EXCEPTION - org.apache.lucene.analysis.hunspell.TimeoutPolicy
-
Throw an exception (e.g.
- TimeoutPolicy - Enum in org.apache.lucene.analysis.hunspell
-
A strategy determining what to do when Hunspell API calls take too much time
- toAutomaton() - Method in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter
-
Converts the tokenStream to an automaton, treating the transition labels as utf-8.
- toAutomaton(boolean) - Method in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter
-
Converts the tokenStream to an automaton.
- toCharArray() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- TOKEN_OUTPUT - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizerFactory
- TOKEN_TYPES - Static variable in class org.apache.lucene.analysis.classic.ClassicTokenizer
-
String token types that correspond to token type int constants
- TOKEN_TYPES - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
-
String token types that correspond to token type int constants
- TOKEN_TYPES - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
String token types that correspond to token type int constants
- TokenOffsetPayloadTokenFilter - Class in org.apache.lucene.analysis.payloads
-
Adds the
OffsetAttribute.startOffset()
andOffsetAttribute.endOffset()
First 4 bytes are the start - TokenOffsetPayloadTokenFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.payloads.TokenOffsetPayloadTokenFilter
- TokenOffsetPayloadTokenFilterFactory - Class in org.apache.lucene.analysis.payloads
-
Factory for
TokenOffsetPayloadTokenFilter
. - TokenOffsetPayloadTokenFilterFactory() - Constructor for class org.apache.lucene.analysis.payloads.TokenOffsetPayloadTokenFilterFactory
-
Default ctor for compatibility with SPI
- TokenOffsetPayloadTokenFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.payloads.TokenOffsetPayloadTokenFilterFactory
-
Creates a new TokenOffsetPayloadTokenFilterFactory
- tokenOutput - Variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizerFactory
- tokens - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- TOKENS_ONLY - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
Only output tokens
- tolerateAffixRuleCountMismatches() - Method in class org.apache.lucene.analysis.hunspell.Dictionary
-
Whether incorrect PFX/SFX rule counts should be silently ignored.
- tolerateDuplicateConversionMappings() - Method in class org.apache.lucene.analysis.hunspell.Dictionary
-
Whether duplicate ICONV/OCONV lines should be silently ignored.
- toString() - Method in class org.apache.lucene.analysis.compound.hyphenation.Hyphen
- toString() - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer
- toString() - Method in class org.apache.lucene.analysis.hunspell.AffixedWord.Affix
- toString() - Method in class org.apache.lucene.analysis.hunspell.AffixedWord
- toString() - Method in class org.apache.lucene.analysis.hunspell.DictEntry
- toString() - Method in class org.apache.lucene.analysis.hunspell.EntrySuggestion
- toString() - Method in class org.apache.lucene.analysis.miscellaneous.LimitTokenCountAnalyzer
- toString() - Method in class org.apache.lucene.analysis.miscellaneous.PerFieldAnalyzerWrapper
- toString() - Method in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilter
- toString() - Method in class org.apache.lucene.analysis.miscellaneous.WordDelimiterIterator
- toString() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- toUTF16() - Method in interface org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter.BytesRefBuilderTermAttribute
-
Returns the term represented as UTF-16
- toUTF16() - Method in class org.apache.lucene.analysis.miscellaneous.ConcatenateGraphFilter.BytesRefBuilderTermAttributeImpl
- TrimFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Trims leading and trailing whitespace from Tokens in the stream.
- TrimFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.TrimFilter
-
Create a new
TrimFilter
. - TrimFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
TrimFilter
. - TrimFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.TrimFilterFactory
-
Default ctor for compatibility with SPI
- TrimFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.TrimFilterFactory
-
Creates a new TrimFilterFactory
- trimToSize() - Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
- trimToSize() - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
- trimToSize() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Each node stores a character (splitchar) which is part of some key(s).
- TruncateTokenFilter - Class in org.apache.lucene.analysis.miscellaneous
-
A token filter for truncating the terms into a specific length.
- TruncateTokenFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.miscellaneous.TruncateTokenFilter
- TruncateTokenFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
TruncateTokenFilter
. - TruncateTokenFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.TruncateTokenFilterFactory
-
Default ctor for compatibility with SPI
- TruncateTokenFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.TruncateTokenFilterFactory
- TurkishAnalyzer - Class in org.apache.lucene.analysis.tr
-
Analyzer
for Turkish. - TurkishAnalyzer() - Constructor for class org.apache.lucene.analysis.tr.TurkishAnalyzer
-
Builds an analyzer with the default stop words:
TurkishAnalyzer.DEFAULT_STOPWORD_FILE
. - TurkishAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.tr.TurkishAnalyzer
-
Builds an analyzer with the given stop words.
- TurkishAnalyzer(CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.tr.TurkishAnalyzer
-
Builds an analyzer with the given stop words.
- TurkishLowerCaseFilter - Class in org.apache.lucene.analysis.tr
-
Normalizes Turkish token text to lower case.
- TurkishLowerCaseFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.tr.TurkishLowerCaseFilter
-
Create a new TurkishLowerCaseFilter, that normalizes Turkish token text to lower case.
- TurkishLowerCaseFilterFactory - Class in org.apache.lucene.analysis.tr
-
Factory for
TurkishLowerCaseFilter
. - TurkishLowerCaseFilterFactory() - Constructor for class org.apache.lucene.analysis.tr.TurkishLowerCaseFilterFactory
-
Default ctor for compatibility with SPI
- TurkishLowerCaseFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.tr.TurkishLowerCaseFilterFactory
-
Creates a new TurkishLowerCaseFilterFactory
- TurkishStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- TurkishStemmer() - Constructor for class org.tartarus.snowball.ext.TurkishStemmer
- txt - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase.CompoundToken
- TYPE_SYNONYM - Static variable in class org.apache.lucene.analysis.synonym.SynonymFilter
-
Deprecated.
- TYPE_SYNONYM - Static variable in class org.apache.lucene.analysis.synonym.SynonymGraphFilter
- TypeAsPayloadTokenFilter - Class in org.apache.lucene.analysis.payloads
-
Makes the
TypeAttribute
a payload. - TypeAsPayloadTokenFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.payloads.TypeAsPayloadTokenFilter
- TypeAsPayloadTokenFilterFactory - Class in org.apache.lucene.analysis.payloads
-
Factory for
TypeAsPayloadTokenFilter
. - TypeAsPayloadTokenFilterFactory() - Constructor for class org.apache.lucene.analysis.payloads.TypeAsPayloadTokenFilterFactory
-
Default ctor for compatibility with SPI
- TypeAsPayloadTokenFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.payloads.TypeAsPayloadTokenFilterFactory
-
Creates a new TypeAsPayloadTokenFilterFactory
- TypeAsSynonymFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Adds the
TypeAttribute.type()
as a synonym, i.e. - TypeAsSynonymFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.TypeAsSynonymFilter
- TypeAsSynonymFilter(TokenStream, String) - Constructor for class org.apache.lucene.analysis.miscellaneous.TypeAsSynonymFilter
- TypeAsSynonymFilter(TokenStream, String, Set<String>, int) - Constructor for class org.apache.lucene.analysis.miscellaneous.TypeAsSynonymFilter
- TypeAsSynonymFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
TypeAsSynonymFilter
. - TypeAsSynonymFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.TypeAsSynonymFilterFactory
-
Default ctor for compatibility with SPI
- TypeAsSynonymFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.TypeAsSynonymFilterFactory
- TYPES - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilterFactory
-
Deprecated.
- TYPES - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilterFactory
- TypeTokenFilter - Class in org.apache.lucene.analysis.core
-
Removes tokens whose types appear in a set of blocked types from a token stream.
- TypeTokenFilter(TokenStream, Set<String>) - Constructor for class org.apache.lucene.analysis.core.TypeTokenFilter
-
Create a new
TypeTokenFilter
that filters tokens out (useWhiteList=false). - TypeTokenFilter(TokenStream, Set<String>, boolean) - Constructor for class org.apache.lucene.analysis.core.TypeTokenFilter
-
Create a new
TypeTokenFilter
. - TypeTokenFilterFactory - Class in org.apache.lucene.analysis.core
-
Factory class for
TypeTokenFilter
. - TypeTokenFilterFactory() - Constructor for class org.apache.lucene.analysis.core.TypeTokenFilterFactory
-
Default ctor for compatibility with SPI
- TypeTokenFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.core.TypeTokenFilterFactory
-
Creates a new TypeTokenFilterFactory
U
- UAX29URLEmailAnalyzer - Class in org.apache.lucene.analysis.email
-
Filters
UAX29URLEmailTokenizer
withLowerCaseFilter
andStopFilter
, using a list of English stop words. - UAX29URLEmailAnalyzer() - Constructor for class org.apache.lucene.analysis.email.UAX29URLEmailAnalyzer
-
Builds an analyzer with the default stop words (
UAX29URLEmailAnalyzer.STOP_WORDS_SET
). - UAX29URLEmailAnalyzer(Reader) - Constructor for class org.apache.lucene.analysis.email.UAX29URLEmailAnalyzer
-
Builds an analyzer with the stop words from the given reader.
- UAX29URLEmailAnalyzer(CharArraySet) - Constructor for class org.apache.lucene.analysis.email.UAX29URLEmailAnalyzer
-
Builds an analyzer with the given stop words.
- UAX29URLEmailTokenizer - Class in org.apache.lucene.analysis.email
-
This class implements Word Break rules from the Unicode Text Segmentation algorithm, as specified in Unicode Standard Annex #29 URLs and email addresses are also tokenized according to the relevant RFCs.
- UAX29URLEmailTokenizer() - Constructor for class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
-
Creates a new instance of the UAX29URLEmailTokenizer.
- UAX29URLEmailTokenizer(AttributeFactory) - Constructor for class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
-
Creates a new UAX29URLEmailTokenizer with a given
AttributeFactory
- UAX29URLEmailTokenizerFactory - Class in org.apache.lucene.analysis.email
-
Factory for
UAX29URLEmailTokenizer
. - UAX29URLEmailTokenizerFactory() - Constructor for class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerFactory
-
Default ctor for compatibility with SPI
- UAX29URLEmailTokenizerFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerFactory
-
Creates a new UAX29URLEmailTokenizerFactory
- UAX29URLEmailTokenizerImpl - Class in org.apache.lucene.analysis.email
-
This class implements Word Break rules from the Unicode Text Segmentation algorithm, as specified in Unicode Standard Annex #29 URLs and email addresses are also tokenized according to the relevant RFCs.
- UAX29URLEmailTokenizerImpl(Reader) - Constructor for class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Creates a new scanner
- UNICODE_VERSION - Static variable in class org.apache.lucene.analysis.util.UnicodeProps
-
Unicode version that was used to generate this file: "15.1.0.0"
- UnicodeProps - Class in org.apache.lucene.analysis.util
-
This file contains unicode properties used by various
CharTokenizer
s. - UnicodeWhitespaceAnalyzer - Class in org.apache.lucene.analysis.core
-
An Analyzer that uses
UnicodeWhitespaceTokenizer
. - UnicodeWhitespaceAnalyzer() - Constructor for class org.apache.lucene.analysis.core.UnicodeWhitespaceAnalyzer
-
Creates a new
UnicodeWhitespaceAnalyzer
- UnicodeWhitespaceTokenizer - Class in org.apache.lucene.analysis.core
-
A UnicodeWhitespaceTokenizer is a tokenizer that divides text at whitespace.
- UnicodeWhitespaceTokenizer() - Constructor for class org.apache.lucene.analysis.core.UnicodeWhitespaceTokenizer
-
Construct a new UnicodeWhitespaceTokenizer.
- UnicodeWhitespaceTokenizer(AttributeFactory) - Constructor for class org.apache.lucene.analysis.core.UnicodeWhitespaceTokenizer
-
Construct a new UnicodeWhitespaceTokenizer using a given
AttributeFactory
. - UnicodeWhitespaceTokenizer(AttributeFactory, int) - Constructor for class org.apache.lucene.analysis.core.UnicodeWhitespaceTokenizer
-
Construct a new UnicodeWhitespaceTokenizer using a given
AttributeFactory
. - unpackValues(int) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
- unsafeWrite(char) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- unsafeWrite(char[], int, int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- unsafeWrite(int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- UNTOKENIZED_ONLY - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
Only output untokenized tokens, which are tokens that would normally be split into several tokens
- UNTOKENIZED_TOKEN_FLAG - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
This flag is used to indicate that the produced "Token" would, if
WikipediaTokenizer.TOKENS_ONLY
was used, produce multiple tokens. - UNTOKENIZED_TYPES - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizerFactory
- untokenizedTypes - Variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizerFactory
- UPPER - Static variable in class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.
- UpperCaseFilter - Class in org.apache.lucene.analysis.core
-
Normalizes token text to UPPER CASE.
- UpperCaseFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.core.UpperCaseFilter
-
Create a new UpperCaseFilter, that normalizes token text to upper case.
- UpperCaseFilterFactory - Class in org.apache.lucene.analysis.core
-
Factory for
UpperCaseFilter
. - UpperCaseFilterFactory() - Constructor for class org.apache.lucene.analysis.core.UpperCaseFilterFactory
-
Default ctor for compatibility with SPI
- UpperCaseFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.core.UpperCaseFilterFactory
-
Creates a new UpperCaseFilterFactory
- URL - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizer
-
URL token type
- URL_TYPE - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
URL token type
V
- valueOf(String) - Static method in enum org.apache.lucene.analysis.hunspell.TimeoutPolicy
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.apache.lucene.analysis.miscellaneous.ScandinavianNormalizer.Foldings
-
Returns the enum constant of this type with the specified name.
- values() - Static method in enum org.apache.lucene.analysis.hunspell.TimeoutPolicy
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.apache.lucene.analysis.miscellaneous.ScandinavianNormalizer.Foldings
-
Returns an array containing the constants of this enum type, in the order they are declared.
- vectorValue(int) - Method in class org.apache.lucene.analysis.synonym.word2vec.Word2VecModel
- vectorValue(BytesRef) - Method in class org.apache.lucene.analysis.synonym.word2vec.Word2VecModel
- vspace - Variable in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
value space: stores the interletter values
W
- WAW - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- when(Class<? extends ConditionalTokenFilterFactory>, String...) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Add a
ConditionalTokenFilterFactory
to the analysis chain - when(Class<? extends ConditionalTokenFilterFactory>, Map<String, String>) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Add a
ConditionalTokenFilterFactory
to the analysis chain - when(String, String...) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Add a
ConditionalTokenFilterFactory
to the analysis chain - when(String, Map<String, String>) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Add a
ConditionalTokenFilterFactory
to the analysis chain - when(ConditionalTokenFilterFactory) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Add a
ConditionalTokenFilterFactory
to the analysis chain - whenTerm(Predicate<CharSequence>) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Apply subsequent token filters if the current token's term matches a predicate
- WHITESPACE - Static variable in class org.apache.lucene.analysis.util.UnicodeProps
-
Bitset with Unicode WHITESPACE code points.
- WhitespaceAnalyzer - Class in org.apache.lucene.analysis.core
-
An Analyzer that uses
WhitespaceTokenizer
. - WhitespaceAnalyzer() - Constructor for class org.apache.lucene.analysis.core.WhitespaceAnalyzer
-
Creates a new
WhitespaceAnalyzer
with a maximum token length of 255 chars - WhitespaceAnalyzer(int) - Constructor for class org.apache.lucene.analysis.core.WhitespaceAnalyzer
-
Creates a new
WhitespaceAnalyzer
with a custom maximum token length - WhitespaceTokenizer - Class in org.apache.lucene.analysis.core
-
A tokenizer that divides text at whitespace characters as defined by
Character.isWhitespace(int)
. - WhitespaceTokenizer() - Constructor for class org.apache.lucene.analysis.core.WhitespaceTokenizer
-
Construct a new WhitespaceTokenizer.
- WhitespaceTokenizer(int) - Constructor for class org.apache.lucene.analysis.core.WhitespaceTokenizer
-
Construct a new WhitespaceTokenizer using a given max token length
- WhitespaceTokenizer(AttributeFactory) - Constructor for class org.apache.lucene.analysis.core.WhitespaceTokenizer
-
Construct a new WhitespaceTokenizer using a given
AttributeFactory
. - WhitespaceTokenizer(AttributeFactory, int) - Constructor for class org.apache.lucene.analysis.core.WhitespaceTokenizer
-
Construct a new WhitespaceTokenizer using a given
AttributeFactory
. - WhitespaceTokenizerFactory - Class in org.apache.lucene.analysis.core
-
Factory for
WhitespaceTokenizer
. - WhitespaceTokenizerFactory() - Constructor for class org.apache.lucene.analysis.core.WhitespaceTokenizerFactory
-
Default ctor for compatibility with SPI
- WhitespaceTokenizerFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.core.WhitespaceTokenizerFactory
-
Creates a new WhitespaceTokenizerFactory
- WikipediaTokenizer - Class in org.apache.lucene.analysis.wikipedia
-
Extension of StandardTokenizer that is aware of Wikipedia syntax.
- WikipediaTokenizer() - Constructor for class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
Creates a new instance of the
WikipediaTokenizer
. - WikipediaTokenizer(int, Set<String>) - Constructor for class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
Creates a new instance of the
WikipediaTokenizer
. - WikipediaTokenizer(AttributeFactory, int, Set<String>) - Constructor for class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
Creates a new instance of the
WikipediaTokenizer
. - WikipediaTokenizerFactory - Class in org.apache.lucene.analysis.wikipedia
-
Factory for
WikipediaTokenizer
. - WikipediaTokenizerFactory() - Constructor for class org.apache.lucene.analysis.wikipedia.WikipediaTokenizerFactory
-
Default ctor for compatibility with SPI
- WikipediaTokenizerFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.wikipedia.WikipediaTokenizerFactory
-
Creates a new WikipediaTokenizerFactory
- withDefaultMatchVersion(Version) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
This match version is passed as default to all tokenizers or filters.
- withFragmentChecker(FragmentChecker) - Method in class org.apache.lucene.analysis.hunspell.Suggester
-
Returns a copy of this suggester instance with
FragmentChecker
hint that can improve the performance of the "Modification" phase performance. - withOffsetGap(int) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Sets the offset gap of the analyzer.
- withPositionIncrementGap(int) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Sets the position increment gap of the analyzer.
- withSuggestibleEntryCache() - Method in class org.apache.lucene.analysis.hunspell.Suggester
-
Returns a copy of this suggester instance with better "Enumeration" phase performance (see
Suggester
documentation), but using more memory. - withTokenizer(Class<? extends TokenizerFactory>, String...) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Uses the given tokenizer.
- withTokenizer(Class<? extends TokenizerFactory>, Map<String, String>) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Uses the given tokenizer.
- withTokenizer(String, String...) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Uses the given tokenizer.
- withTokenizer(String, Map<String, String>) - Method in class org.apache.lucene.analysis.custom.CustomAnalyzer.Builder
-
Uses the given tokenizer.
- WORD_SEPARATOR - Static variable in class org.apache.lucene.analysis.synonym.SynonymMap
-
for multiword support, you must separate words with this separator
- WORD_TYPE - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Alphanumeric sequences
- Word2VecModel - Class in org.apache.lucene.analysis.synonym.word2vec
-
Word2VecModel is a class representing the parsed Word2Vec model containing the vectors for each word in dictionary
- Word2VecModel(int, int) - Constructor for class org.apache.lucene.analysis.synonym.word2vec.Word2VecModel
- Word2VecSynonymFilter - Class in org.apache.lucene.analysis.synonym.word2vec
-
Applies single-token synonyms from a Word2Vec trained network to an incoming
TokenStream
. - Word2VecSynonymFilter(TokenStream, Word2VecSynonymProvider, int, float) - Constructor for class org.apache.lucene.analysis.synonym.word2vec.Word2VecSynonymFilter
-
Apply previously built synonymProvider to incoming tokens.
- Word2VecSynonymFilterFactory - Class in org.apache.lucene.analysis.synonym.word2vec
-
Factory for
Word2VecSynonymFilter
. - Word2VecSynonymFilterFactory() - Constructor for class org.apache.lucene.analysis.synonym.word2vec.Word2VecSynonymFilterFactory
-
Default ctor for compatibility with SPI
- Word2VecSynonymFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.synonym.word2vec.Word2VecSynonymFilterFactory
- Word2VecSynonymProvider - Class in org.apache.lucene.analysis.synonym.word2vec
-
The Word2VecSynonymProvider generates the list of sysnonyms of a term.
- Word2VecSynonymProvider(Word2VecModel) - Constructor for class org.apache.lucene.analysis.synonym.word2vec.Word2VecSynonymProvider
-
Word2VecSynonymProvider constructor
- Word2VecSynonymProviderFactory - Class in org.apache.lucene.analysis.synonym.word2vec
-
Supply Word2Vec Word2VecSynonymProvider cache avoiding that multiple instances of Word2VecSynonymFilterFactory will instantiate multiple instances of the same SynonymProvider.
- Word2VecSynonymProviderFactory() - Constructor for class org.apache.lucene.analysis.synonym.word2vec.Word2VecSynonymProviderFactory
- WordDelimiterFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Deprecated.Use
WordDelimiterGraphFilter
instead: it produces a correct token graph so that e.g.PhraseQuery
works correctly when it's used in the search time analyzer. - WordDelimiterFilter(TokenStream, byte[], int, CharArraySet) - Constructor for class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.Creates a new WordDelimiterFilter
- WordDelimiterFilter(TokenStream, int, CharArraySet) - Constructor for class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilter
-
Deprecated.Creates a new WordDelimiterFilter using
WordDelimiterIterator.DEFAULT_WORD_DELIM_TABLE
as its charTypeTable - WordDelimiterFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Deprecated.Use
WordDelimiterGraphFilterFactory
instead: it produces a correct token graph so that e.g.PhraseQuery
works correctly when it's used in the search time analyzer. - WordDelimiterFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilterFactory
-
Deprecated.Default ctor for compatibility with SPI
- WordDelimiterFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.WordDelimiterFilterFactory
-
Deprecated.Creates a new WordDelimiterFilterFactory
- WordDelimiterGraphFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Splits words into subwords and performs optional transformations on subword groups, producing a correct token graph so that e.g.
- WordDelimiterGraphFilter(TokenStream, boolean, byte[], int, CharArraySet) - Constructor for class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilter
-
Creates a new WordDelimiterGraphFilter
- WordDelimiterGraphFilter(TokenStream, int, CharArraySet) - Constructor for class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilter
-
Creates a new WordDelimiterGraphFilter using
WordDelimiterIterator.DEFAULT_WORD_DELIM_TABLE
as its charTypeTable - WordDelimiterGraphFilterFactory - Class in org.apache.lucene.analysis.miscellaneous
-
Factory for
WordDelimiterGraphFilter
. - WordDelimiterGraphFilterFactory() - Constructor for class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilterFactory
-
Default ctor for compatibility with SPI
- WordDelimiterGraphFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.WordDelimiterGraphFilterFactory
-
Creates a new WordDelimiterGraphFilterFactory
- WordDelimiterIterator - Class in org.apache.lucene.analysis.miscellaneous
-
A BreakIterator-like API for iterating over subwords in text, according to WordDelimiterGraphFilter rules.
- WordFormGenerator - Class in org.apache.lucene.analysis.hunspell
-
A utility class used for generating possible word forms by adding affixes to stems (
WordFormGenerator.getAllWordForms(String, String, Runnable)
), and suggesting stems and flags to generate the given set of words (WordFormGenerator.compress(List, Set, Runnable)
). - WordFormGenerator(Dictionary) - Constructor for class org.apache.lucene.analysis.hunspell.WordFormGenerator
- WordnetSynonymParser - Class in org.apache.lucene.analysis.synonym
-
Parser for wordnet prolog format
- WordnetSynonymParser(boolean, boolean, Analyzer) - Constructor for class org.apache.lucene.analysis.synonym.WordnetSynonymParser
- words - Variable in class org.apache.lucene.analysis.synonym.SynonymMap
-
map<ord, outputword>
- wrapComponents(String, Analyzer.TokenStreamComponents) - Method in class org.apache.lucene.analysis.miscellaneous.LimitTokenCountAnalyzer
- wrapComponents(String, Analyzer.TokenStreamComponents) - Method in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
- wrapComponents(String, Analyzer.TokenStreamComponents) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- write(char) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- write(char[]) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- write(char[], int, int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- write(int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- write(String) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- write(OpenStringBuilder) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
Y
- YEH - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- YEH - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- YEH - Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
- YEH_BARREE - Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
- YiddishStemmer - Class in org.tartarus.snowball.ext
-
This class implements the stemming algorithm defined by a snowball script.
- YiddishStemmer() - Constructor for class org.tartarus.snowball.ext.YiddishStemmer
- yyatEOF() - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Returns whether the scanner has reached the end of the reader it reads from.
- yybegin(int) - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Enters a new lexical state.
- yychar() - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Character count processed so far
- yycharat(int) - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Returns the character at the given position from the matched text.
- yyclose() - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Closes the input reader.
- YYEOF - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
This character denotes the end of file.
- YYINITIAL - Static variable in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Lexical States.
- yylength() - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
How many characters were matched.
- yypushback(int) - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Pushes the specified amount of characters back into the input stream.
- yyreset(Reader) - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Resets the scanner to read from a new input stream.
- yystate() - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Returns the current lexical state.
- yytext() - Method in class org.apache.lucene.analysis.email.UAX29URLEmailTokenizerImpl
-
Returns the text matched by the current regular expression.
All Classes All Packages