|
|||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |
See:
Description
Class Summary | |
---|---|
ASCIIFoldingFilter | This class converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the "Basic Latin" Unicode block) into their ASCII equivalents, if one exists. |
ASCIIFoldingFilterFactory | Factory for ASCIIFoldingFilter . |
CapitalizationFilter | A filter to apply normal capitalization rules to Tokens. |
CapitalizationFilterFactory | Factory for CapitalizationFilter . |
CodepointCountFilter | Removes words that are too long or too short from the stream. |
CodepointCountFilterFactory | Factory for CodepointCountFilter . |
EmptyTokenStream | An always exhausted token stream. |
HyphenatedWordsFilter | When the plain text is extracted from documents, we will often have many words hyphenated and broken into two lines. |
HyphenatedWordsFilterFactory | Factory for HyphenatedWordsFilter . |
KeepWordFilter | A TokenFilter that only keeps tokens with text contained in the required words. |
KeepWordFilterFactory | Factory for KeepWordFilter . |
KeywordMarkerFilter | Marks terms as keywords via the KeywordAttribute . |
KeywordMarkerFilterFactory | Factory for KeywordMarkerFilter . |
KeywordRepeatFilter | This TokenFilter emits each incoming token twice once as keyword and once non-keyword, in other words once with
KeywordAttribute.setKeyword(boolean) set to true and once set to false . |
KeywordRepeatFilterFactory | Factory for KeywordRepeatFilter . |
LengthFilter | Removes words that are too long or too short from the stream. |
LengthFilterFactory | Factory for LengthFilter . |
LimitTokenCountAnalyzer | This Analyzer limits the number of tokens while indexing. |
LimitTokenCountFilter | This TokenFilter limits the number of tokens while indexing. |
LimitTokenCountFilterFactory | Factory for LimitTokenCountFilter . |
LimitTokenPositionFilter | This TokenFilter limits its emitted tokens to those with positions that are not greater than the configured limit. |
LimitTokenPositionFilterFactory | Factory for LimitTokenPositionFilter . |
PatternAnalyzer | Deprecated. (4.0) use the pattern-based analysis in the analysis/pattern package instead. |
PatternKeywordMarkerFilter | Marks terms as keywords via the KeywordAttribute . |
PerFieldAnalyzerWrapper | This analyzer is used to facilitate scenarios where different fields require different analysis techniques. |
PrefixAndSuffixAwareTokenFilter | Links two PrefixAwareTokenFilter . |
PrefixAwareTokenFilter | Joins two token streams and leaves the last token of the first stream available to be used when updating the token values in the second stream based on that token. |
RemoveDuplicatesTokenFilter | A TokenFilter which filters out Tokens at the same position and Term text as the previous token in the stream. |
RemoveDuplicatesTokenFilterFactory | Factory for RemoveDuplicatesTokenFilter . |
ScandinavianFoldingFilter | This filter folds Scandinavian characters åÅäæÄÆ->a and öÖøØ->o. |
ScandinavianFoldingFilterFactory | Factory for ScandinavianFoldingFilter . |
ScandinavianNormalizationFilter | This filter normalize use of the interchangeable Scandinavian characters æÆäÄöÖøØ and folded variants (aa, ao, ae, oe and oo) by transforming them to åÅæÆøØ. |
ScandinavianNormalizationFilterFactory | Factory for ScandinavianNormalizationFilter . |
SetKeywordMarkerFilter | Marks terms as keywords via the KeywordAttribute . |
SingleTokenTokenStream | A TokenStream containing a single token. |
StemmerOverrideFilter | Provides the ability to override any KeywordAttribute aware stemmer
with custom dictionary-based stemming. |
StemmerOverrideFilter.Builder | This builder builds an FST for the StemmerOverrideFilter |
StemmerOverrideFilter.StemmerOverrideMap | A read-only 4-byte FST backed map that allows fast case-insensitive key
value lookups for StemmerOverrideFilter |
StemmerOverrideFilterFactory | Factory for StemmerOverrideFilter . |
TrimFilter | Trims leading and trailing whitespace from Tokens in the stream. |
TrimFilterFactory | Factory for TrimFilter . |
WordDelimiterFilter | Splits words into subwords and performs optional transformations on subword groups. |
WordDelimiterFilterFactory | Factory for WordDelimiterFilter . |
WordDelimiterIterator | A BreakIterator-like API for iterating over subwords in text, according to WordDelimiterFilter rules. |
Miscellaneous TokenStreams
|
|||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |