ArabicLetterTokenizerFactory |
Deprecated
(3.1) Use StandardTokenizerFactory instead.
|
ArabicNormalizationFilterFactory |
|
ArabicStemFilterFactory |
|
ASCIIFoldingFilterFactory |
|
BaseCharFilterFactory |
|
BaseTokenFilterFactory |
Simple abstract implementation that handles init arg processing.
|
BaseTokenizerFactory |
Simple abstract implementation that handles init arg processing.
|
BeiderMorseFilterFactory |
|
BrazilianStemFilterFactory |
|
BufferedTokenStream |
Deprecated
This class does not support custom attributes.
|
BulgarianStemFilterFactory |
|
CapitalizationFilterFactory |
A filter to apply normal capitalization rules to Tokens.
|
ChineseFilterFactory |
Deprecated
|
ChineseTokenizerFactory |
Deprecated
|
CJKBigramFilterFactory |
|
CJKTokenizerFactory |
Deprecated |
CJKWidthFilterFactory |
|
ClassicFilterFactory |
|
ClassicTokenizerFactory |
|
CollationKeyFilterFactory |
|
CommonGramsFilter |
Construct bigrams for frequently occurring terms while indexing.
|
CommonGramsFilterFactory |
|
CommonGramsQueryFilter |
Wrap a CommonGramsFilter optimizing phrase queries by only returning single
words when they are not a member of a bigram.
|
CommonGramsQueryFilterFactory |
|
CzechStemFilterFactory |
|
DelimitedPayloadTokenFilterFactory |
|
DictionaryCompoundWordTokenFilterFactory |
|
DoubleMetaphoneFilter |
Deprecated
|
DoubleMetaphoneFilterFactory |
|
DutchStemFilterFactory |
Deprecated
|
EdgeNGramFilterFactory |
|
EdgeNGramTokenizerFactory |
|
ElisionFilterFactory |
|
EnglishMinimalStemFilterFactory |
|
EnglishPorterFilterFactory |
Deprecated
Use SnowballPorterFilterFactory with language="English" instead
|
EnglishPossessiveFilterFactory |
|
FinnishLightStemFilterFactory |
|
FrenchLightStemFilterFactory |
|
FrenchMinimalStemFilterFactory |
|
FrenchStemFilterFactory |
Deprecated
|
GalicianMinimalStemFilterFactory |
|
GalicianStemFilterFactory |
|
GermanLightStemFilterFactory |
|
GermanMinimalStemFilterFactory |
|
GermanNormalizationFilterFactory |
|
GermanStemFilterFactory |
|
GreekLowerCaseFilterFactory |
|
GreekStemFilterFactory |
|
HindiNormalizationFilterFactory |
|
HindiStemFilterFactory |
|
HTMLStripCharFilterFactory |
|
HungarianLightStemFilterFactory |
|
HunspellStemFilterFactory |
|
HyphenatedWordsFilter |
When the plain text is extracted from documents, we will often have many words hyphenated and broken into
two lines.
|
HyphenatedWordsFilterFactory |
|
HyphenationCompoundWordTokenFilterFactory |
|
ICUCollationKeyFilterFactory |
|
ICUFoldingFilterFactory |
|
ICUNormalizer2FilterFactory |
|
ICUTokenizerFactory |
|
ICUTransformFilterFactory |
|
IndicNormalizationFilterFactory |
|
IndonesianStemFilterFactory |
|
IrishLowerCaseFilterFactory |
|
ISOLatin1AccentFilterFactory |
Deprecated
|
ItalianLightStemFilterFactory |
|
JapaneseBaseFormFilterFactory |
|
JapaneseKatakanaStemFilterFactory |
|
JapanesePartOfSpeechStopFilterFactory |
|
JapaneseReadingFormFilterFactory |
|
JapaneseTokenizerFactory |
|
KeepWordFilter |
A TokenFilter that only keeps tokens with text contained in the
required words.
|
KeepWordFilterFactory |
|
KeywordMarkerFilterFactory |
|
KeywordTokenizerFactory |
|
KStemFilterFactory |
|
LatvianStemFilterFactory |
|
LegacyHTMLStripCharFilter |
Deprecated
|
LegacyHTMLStripCharFilterFactory |
Deprecated
|
LengthFilterFactory |
|
LetterTokenizerFactory |
|
LimitTokenCountFilterFactory |
|
LowerCaseFilterFactory |
|
LowerCaseTokenizerFactory |
|
MappingCharFilterFactory |
|
NGramFilterFactory |
|
NGramTokenizerFactory |
|
NorwegianLightStemFilterFactory |
|
NorwegianMinimalStemFilterFactory |
|
NumericPayloadTokenFilterFactory |
|
PathHierarchyTokenizerFactory |
|
PatternReplaceCharFilter |
CharFilter that uses a regular expression for the target of replace string.
|
PatternReplaceCharFilterFactory |
|
PatternReplaceFilter |
A TokenFilter which applies a Pattern to each token in the stream,
replacing match occurances with the specified replacement string.
|
PatternReplaceFilterFactory |
|
PatternTokenizer |
This tokenizer uses regex pattern matching to construct distinct tokens
for the input stream.
|
PatternTokenizerFactory |
|
PersianCharFilterFactory |
|
PersianNormalizationFilterFactory |
|
PhoneticFilter |
Deprecated
|
PhoneticFilterFactory |
|
PorterStemFilterFactory |
|
PortugueseLightStemFilterFactory |
|
PortugueseMinimalStemFilterFactory |
|
PortugueseStemFilterFactory |
|
PositionFilterFactory |
|
RemoveDuplicatesTokenFilter |
A TokenFilter which filters out Tokens at the same position and Term text as the previous token in the stream.
|
RemoveDuplicatesTokenFilterFactory |
|
ReversedWildcardFilter |
This class produces a special form of reversed tokens, suitable for
better handling of leading wildcards.
|
ReversedWildcardFilterFactory |
|
ReverseStringFilterFactory |
|
RussianLetterTokenizerFactory |
Deprecated
|
RussianLightStemFilterFactory |
|
RussianLowerCaseFilterFactory |
Deprecated
|
RussianStemFilterFactory |
Deprecated
|
ShingleFilterFactory |
|
SmartChineseSentenceTokenizerFactory |
|
SmartChineseWordTokenFilterFactory |
Factory for the SmartChineseAnalyzer WordTokenFilter
Note: this class will currently emit tokens for punctuation.
|
SnowballPorterFilterFactory |
Factory for SnowballFilter , with configurable language
Note: Use of the "Lovins" stemmer is not recommended, as it is implemented with reflection.
|
SolrAnalyzer |
|
SolrAnalyzer.TokenStreamInfo |
|
SpanishLightStemFilterFactory |
|
StandardFilterFactory |
|
StandardTokenizerFactory |
|
StemmerOverrideFilterFactory |
|
StempelPolishStemFilterFactory |
|
StopFilterFactory |
|
SwedishLightStemFilterFactory |
|
SynonymFilterFactory |
|
ThaiWordFilterFactory |
|
TokenizerChain |
|
TokenOffsetPayloadTokenFilterFactory |
|
TrieTokenizerFactory |
Tokenizer for trie fields.
|
TrimFilter |
Trims leading and trailing whitespace from Tokens in the stream.
|
TrimFilterFactory |
|
TurkishLowerCaseFilterFactory |
|
TypeAsPayloadTokenFilterFactory |
|
TypeTokenFilterFactory |
|
UAX29URLEmailTokenizerFactory |
|
WhitespaceTokenizerFactory |
|
WikipediaTokenizerFactory |
|
WordDelimiterFilterFactory |
Factory for WordDelimiterFilter.
|
WordDelimiterIterator |
A BreakIterator-like API for iterating over subwords in text, according to WordDelimiterFilter rules.
|