Interface | Description |
---|---|
CharFilterFactory | |
MultiTermAwareComponent |
Add to any analysis factory component to allow returning an
analysis component factory for use with partial terms in prefix queries,
wildcard queries, range query endpoints, regex queries, etc.
|
TokenFilterFactory |
A
TokenFilterFactory creates a
TokenFilter to transform one TokenStream
into another. |
TokenizerFactory |
A
TokenizerFactory breaks up a stream of characters
into tokens. |
Class | Description |
---|---|
ArabicLetterTokenizerFactory | Deprecated
(3.1) Use StandardTokenizerFactory instead.
|
ArabicNormalizationFilterFactory |
Factory for
ArabicNormalizationFilter . |
ArabicStemFilterFactory |
Factory for
ArabicStemFilter . |
ASCIIFoldingFilterFactory |
Factory for
ASCIIFoldingFilter . |
BaseCharFilterFactory | |
BaseTokenFilterFactory |
Simple abstract implementation that handles init arg processing.
|
BaseTokenizerFactory |
Simple abstract implementation that handles init arg processing.
|
BeiderMorseFilterFactory |
Factory for
BeiderMorseFilter . |
BrazilianStemFilterFactory |
Factory for
BrazilianStemFilter . |
BufferedTokenStream | Deprecated
This class does not support custom attributes.
|
BulgarianStemFilterFactory |
Factory for
BulgarianStemFilter . |
CapitalizationFilterFactory |
A filter to apply normal capitalization rules to Tokens.
|
ChineseFilterFactory | Deprecated
Use
StopFilterFactory instead. |
ChineseTokenizerFactory | Deprecated
Use
StandardTokenizerFactory instead. |
CJKBigramFilterFactory |
Factory for
CJKBigramFilter . |
CJKTokenizerFactory | Deprecated |
CJKWidthFilterFactory |
Factory for
CJKWidthFilter . |
ClassicFilterFactory |
Factory for
ClassicFilter . |
ClassicTokenizerFactory |
Factory for
ClassicTokenizer . |
CollationKeyFilterFactory |
Factory for
CollationKeyFilter . |
CommonGramsFilter |
Construct bigrams for frequently occurring terms while indexing.
|
CommonGramsFilterFactory |
Constructs a
CommonGramsFilter . |
CommonGramsQueryFilter |
Wrap a CommonGramsFilter optimizing phrase queries by only returning single
words when they are not a member of a bigram.
|
CommonGramsQueryFilterFactory |
Construct
CommonGramsQueryFilter . |
CzechStemFilterFactory |
Factory for
CzechStemFilter . |
DelimitedPayloadTokenFilterFactory |
Factory for
DelimitedPayloadTokenFilter . |
DictionaryCompoundWordTokenFilterFactory |
Factory for
DictionaryCompoundWordTokenFilter . |
DoubleMetaphoneFilter | Deprecated
Use
DoubleMetaphoneFilter instead |
DoubleMetaphoneFilterFactory |
Factory for
DoubleMetaphoneFilter . |
DutchStemFilterFactory | Deprecated
Use
SnowballPorterFilterFactory with "Dutch" instead,
which has the same functionality. |
EdgeNGramFilterFactory |
Creates new instances of
EdgeNGramTokenFilter . |
EdgeNGramTokenizerFactory |
Creates new instances of
EdgeNGramTokenizer . |
ElisionFilterFactory |
Factory for
ElisionFilter . |
EnglishMinimalStemFilterFactory |
Factory for
EnglishMinimalStemFilter . |
EnglishPorterFilterFactory | Deprecated
Use SnowballPorterFilterFactory with language="English" instead
|
EnglishPossessiveFilterFactory |
Factory for
EnglishPossessiveFilter . |
FinnishLightStemFilterFactory |
Factory for
FinnishLightStemFilter . |
FrenchLightStemFilterFactory |
Factory for
FrenchLightStemFilter . |
FrenchMinimalStemFilterFactory |
Factory for
FrenchMinimalStemFilter . |
FrenchStemFilterFactory | Deprecated
Use
SnowballPorterFilterFactory with "French" instead,
which has the same functionality. |
GalicianMinimalStemFilterFactory |
Factory for
GalicianMinimalStemFilter . |
GalicianStemFilterFactory |
Factory for
GalicianStemFilter . |
GermanLightStemFilterFactory |
Factory for
GermanLightStemFilter . |
GermanMinimalStemFilterFactory |
Factory for
GermanMinimalStemFilter . |
GermanNormalizationFilterFactory |
Factory for
GermanNormalizationFilter . |
GermanStemFilterFactory |
Factory for
GermanStemFilter . |
GreekLowerCaseFilterFactory |
Factory for
GreekLowerCaseFilter . |
GreekStemFilterFactory |
Factory for
GreekStemFilter . |
HindiNormalizationFilterFactory |
Factory for
HindiNormalizationFilter . |
HindiStemFilterFactory |
Factory for
HindiStemFilter . |
HTMLStripCharFilterFactory |
Factory for
HTMLStripCharFilter . |
HungarianLightStemFilterFactory |
Factory for
HungarianLightStemFilter . |
HunspellStemFilterFactory |
TokenFilterFactory that creates instances of
HunspellStemFilter . |
HyphenatedWordsFilter |
When the plain text is extracted from documents, we will often have many words hyphenated and broken into
two lines.
|
HyphenatedWordsFilterFactory |
Factory for
HyphenatedWordsFilter . |
HyphenationCompoundWordTokenFilterFactory |
Factory for
HyphenationCompoundWordTokenFilter . |
ICUCollationKeyFilterFactory |
Factory for
ICUCollationKeyFilter . |
ICUFoldingFilterFactory |
Factory for
ICUFoldingFilter |
ICUNormalizer2FilterFactory |
Factory for
ICUNormalizer2Filter
Supports the following attributes:
name: A Unicode Normalization Form,
one of 'nfc','nfkc', 'nfkc_cf'. |
ICUTokenizerFactory |
Factory for
ICUTokenizer |
ICUTransformFilterFactory |
Factory for
ICUTransformFilter . |
IndicNormalizationFilterFactory |
Factory for
IndicNormalizationFilter . |
IndonesianStemFilterFactory |
Factory for
IndonesianStemFilter . |
IrishLowerCaseFilterFactory |
Factory for
IrishLowerCaseFilter . |
ISOLatin1AccentFilterFactory | Deprecated
Use
ASCIIFoldingFilterFactory instead. |
ItalianLightStemFilterFactory |
Factory for
ItalianLightStemFilter . |
JapaneseBaseFormFilterFactory |
Factory for
JapaneseBaseFormFilter . |
JapaneseKatakanaStemFilterFactory |
Factory for
JapaneseKatakanaStemFilterFactory . |
JapanesePartOfSpeechStopFilterFactory |
Factory for
JapanesePartOfSpeechStopFilter . |
JapaneseReadingFormFilterFactory |
Factory for
JapaneseReadingFormFilter . |
JapaneseTokenizerFactory |
Factory for
JapaneseTokenizer . |
KeepWordFilter |
A TokenFilter that only keeps tokens with text contained in the
required words.
|
KeepWordFilterFactory |
Factory for
KeepWordFilter . |
KeywordMarkerFilterFactory |
Factory for
KeywordMarkerFilter . |
KeywordTokenizerFactory |
Factory for
KeywordTokenizer . |
KStemFilterFactory |
Factory for
KStemFilter |
LatvianStemFilterFactory |
Factory for
LatvianStemFilter . |
LegacyHTMLStripCharFilter | Deprecated
use
HTMLStripCharFilter |
LegacyHTMLStripCharFilterFactory | Deprecated |
LengthFilterFactory |
Factory for
LengthFilter . |
LetterTokenizerFactory |
Factory for
LetterTokenizer . |
LimitTokenCountFilterFactory |
Factory for
LimitTokenCountFilter . |
LowerCaseFilterFactory |
Factory for
LowerCaseFilter . |
LowerCaseTokenizerFactory |
Factory for
LowerCaseTokenizer . |
MappingCharFilterFactory |
Factory for
MappingCharFilter . |
NGramFilterFactory |
Factory for
NGramTokenFilter . |
NGramTokenizerFactory |
Factory for
NGramTokenizer . |
NorwegianLightStemFilterFactory |
Factory for
NorwegianLightStemFilter . |
NorwegianMinimalStemFilterFactory |
Factory for
NorwegianMinimalStemFilter . |
NumericPayloadTokenFilterFactory |
Factory for
NumericPayloadTokenFilter . |
PathHierarchyTokenizerFactory |
Factory for
PathHierarchyTokenizer . |
PatternReplaceCharFilter |
CharFilter that uses a regular expression for the target of replace string.
|
PatternReplaceCharFilterFactory |
Factory for
PatternReplaceCharFilter . |
PatternReplaceFilter |
A TokenFilter which applies a Pattern to each token in the stream,
replacing match occurances with the specified replacement string.
|
PatternReplaceFilterFactory |
Factory for
PatternReplaceFilter . |
PatternTokenizer |
This tokenizer uses regex pattern matching to construct distinct tokens
for the input stream.
|
PatternTokenizerFactory |
Factory for
PatternTokenizer . |
PersianCharFilterFactory |
Factory for
PersianCharFilter . |
PersianNormalizationFilterFactory |
Factory for
PersianNormalizationFilter . |
PhoneticFilter | Deprecated
Use
PhoneticFilter instead. |
PhoneticFilterFactory |
Factory for
PhoneticFilter . |
PorterStemFilterFactory |
Factory for
PorterStemFilter . |
PortugueseLightStemFilterFactory |
Factory for
PortugueseLightStemFilter . |
PortugueseMinimalStemFilterFactory |
Factory for
PortugueseMinimalStemFilter . |
PortugueseStemFilterFactory |
Factory for
PortugueseStemFilter . |
PositionFilterFactory |
Factory for
PositionFilter . |
RemoveDuplicatesTokenFilter |
A TokenFilter which filters out Tokens at the same position and Term text as the previous token in the stream.
|
RemoveDuplicatesTokenFilterFactory |
Factory for
RemoveDuplicatesTokenFilter . |
ReversedWildcardFilter |
This class produces a special form of reversed tokens, suitable for
better handling of leading wildcards.
|
ReversedWildcardFilterFactory |
Factory for
ReversedWildcardFilter -s. |
ReverseStringFilterFactory |
Factory for
ReverseStringFilter . |
RussianLetterTokenizerFactory | Deprecated
Use
StandardTokenizerFactory instead. |
RussianLightStemFilterFactory |
Factory for
RussianLightStemFilter . |
RussianLowerCaseFilterFactory | Deprecated
Use
LowerCaseFilterFactory instead which has the
same functionality. |
RussianStemFilterFactory | Deprecated
Use
SnowballPorterFilterFactory with "Russian" instead,
which has the same functionality. |
ShingleFilterFactory |
Factory for
ShingleFilter . |
SmartChineseSentenceTokenizerFactory |
Factory for the SmartChineseAnalyzer
SentenceTokenizer |
SmartChineseWordTokenFilterFactory |
Factory for the SmartChineseAnalyzer
WordTokenFilter
Note: this class will currently emit tokens for punctuation. |
SnowballPorterFilterFactory |
Factory for
SnowballFilter , with configurable language
Note: Use of the "Lovins" stemmer is not recommended, as it is implemented with reflection. |
SolrAnalyzer | |
SolrAnalyzer.TokenStreamInfo | |
SpanishLightStemFilterFactory |
Factory for
SpanishLightStemFilter . |
StandardFilterFactory |
Factory for
StandardFilter . |
StandardTokenizerFactory |
Factory for
StandardTokenizer . |
StemmerOverrideFilterFactory |
Factory for
StemmerOverrideFilter . |
StempelPolishStemFilterFactory |
Factory for
StempelFilter using a Polish stemming table. |
StopFilterFactory |
Factory for
StopFilter . |
SwedishLightStemFilterFactory |
Factory for
SwedishLightStemFilter . |
SynonymFilterFactory |
Factory for
SynonymFilter . |
ThaiWordFilterFactory |
Factory for
ThaiWordFilter . |
TokenizerChain | |
TokenOffsetPayloadTokenFilterFactory |
Factory for
TokenOffsetPayloadTokenFilter . |
TrieTokenizerFactory |
Tokenizer for trie fields.
|
TrimFilter |
Trims leading and trailing whitespace from Tokens in the stream.
|
TrimFilterFactory |
Factory for
TrimFilter . |
TurkishLowerCaseFilterFactory |
Factory for
TurkishLowerCaseFilter . |
TypeAsPayloadTokenFilterFactory |
Factory for
TypeAsPayloadTokenFilter . |
TypeTokenFilterFactory |
Factory class for
TypeTokenFilter . |
UAX29URLEmailTokenizerFactory |
Factory for
UAX29URLEmailTokenizer . |
WhitespaceTokenizerFactory |
Factory for
WhitespaceTokenizer . |
WikipediaTokenizerFactory |
Factory for
WikipediaTokenizer . |
WordDelimiterFilterFactory |
Factory for WordDelimiterFilter.
|
WordDelimiterIterator |
A BreakIterator-like API for iterating over subwords in text, according to WordDelimiterFilter rules.
|