|
|||||||||
PREV NEXT | FRAMES NO FRAMES |
Packages that use org.apache.lucene.analysis.miscellaneous | |
---|---|
org.apache.lucene.analysis.miscellaneous | Miscellaneous TokenStreams |
Classes in org.apache.lucene.analysis.miscellaneous used by org.apache.lucene.analysis.miscellaneous | |
---|---|
ASCIIFoldingFilter
This class converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the "Basic Latin" Unicode block) into their ASCII equivalents, if one exists. |
|
CapitalizationFilter
A filter to apply normal capitalization rules to Tokens. |
|
HyphenatedWordsFilter
When the plain text is extracted from documents, we will often have many words hyphenated and broken into two lines. |
|
LengthFilter
Removes words that are too long or too short from the stream. |
|
PatternAnalyzer
Deprecated. (4.0) use the pattern-based analysis in the analysis/pattern package instead. |
|
RemoveDuplicatesTokenFilter
A TokenFilter which filters out Tokens at the same position and Term text as the previous token in the stream. |
|
TrimFilter
Trims leading and trailing whitespace from Tokens in the stream. |
|
WordDelimiterFilter
Splits words into subwords and performs optional transformations on subword groups. |
|
|||||||||
PREV NEXT | FRAMES NO FRAMES |