|
|||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |
See:
Description
Class Summary | |
---|---|
ASCIIFoldingFilter | This class converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the "Basic Latin" Unicode block) into their ASCII equivalents, if one exists. |
ASCIIFoldingFilterFactory | Factory for ASCIIFoldingFilter . |
CapitalizationFilter | A filter to apply normal capitalization rules to Tokens. |
CapitalizationFilterFactory | Factory for CapitalizationFilter . |
EmptyTokenStream | An always exhausted token stream. |
HyphenatedWordsFilter | When the plain text is extracted from documents, we will often have many words hyphenated and broken into two lines. |
HyphenatedWordsFilterFactory | Factory for HyphenatedWordsFilter . |
KeepWordFilter | A TokenFilter that only keeps tokens with text contained in the required words. |
KeepWordFilterFactory | Factory for KeepWordFilter . |
KeywordMarkerFilter | Marks terms as keywords via the KeywordAttribute . |
KeywordMarkerFilterFactory | Factory for KeywordMarkerFilter . |
LengthFilter | Removes words that are too long or too short from the stream. |
LengthFilterFactory | Factory for LengthFilter . |
LimitTokenCountAnalyzer | This Analyzer limits the number of tokens while indexing. |
LimitTokenCountFilter | This TokenFilter limits the number of tokens while indexing. |
LimitTokenCountFilterFactory | Factory for LimitTokenCountFilter . |
PatternAnalyzer | Deprecated. (4.0) use the pattern-based analysis in the analysis/pattern package instead. |
PerFieldAnalyzerWrapper | This analyzer is used to facilitate scenarios where different fields require different analysis techniques. |
PrefixAndSuffixAwareTokenFilter | Links two PrefixAwareTokenFilter . |
PrefixAwareTokenFilter | Joins two token streams and leaves the last token of the first stream available to be used when updating the token values in the second stream based on that token. |
RemoveDuplicatesTokenFilter | A TokenFilter which filters out Tokens at the same position and Term text as the previous token in the stream. |
RemoveDuplicatesTokenFilterFactory | Factory for RemoveDuplicatesTokenFilter . |
SingleTokenTokenStream | A TokenStream containing a single token. |
StemmerOverrideFilter | Provides the ability to override any KeywordAttribute aware stemmer
with custom dictionary-based stemming. |
StemmerOverrideFilterFactory | Factory for StemmerOverrideFilter . |
TrimFilter | Trims leading and trailing whitespace from Tokens in the stream. |
TrimFilterFactory | Factory for TrimFilter . |
WordDelimiterFilter | Splits words into subwords and performs optional transformations on subword groups. |
WordDelimiterFilterFactory | Factory for WordDelimiterFilter . |
WordDelimiterIterator | A BreakIterator-like API for iterating over subwords in text, according to WordDelimiterFilter rules. |
Miscellaneous TokenStreams
|
|||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |