Uses of Package
Packages that use org.apache.lucene.analysis.miscellaneous Package Description org.apache.lucene.analysis.customA general-purpose Analyzer that can be created with a builder-style API. org.apache.lucene.analysis.miscellaneousMiscellaneous Tokenstreams.
Classes in org.apache.lucene.analysis.miscellaneous used by org.apache.lucene.analysis.custom Class Description ConditionalTokenFilterFactoryAbstract parent class for analysis factories that create
Classes in org.apache.lucene.analysis.miscellaneous used by org.apache.lucene.analysis.miscellaneous Class Description CapitalizationFilterA filter to apply normal capitalization rules to Tokens. CodepointCountFilterRemoves words that are too long or too short from the stream. ConcatenateGraphFilter.BytesRefBuilderTermAttributeAttribute providing access to the term builder and UTF-16 conversion ConditionalTokenFilterAllows skipping TokenFilters based on the current set of attributes. ConditionalTokenFilterFactoryAbstract parent class for analysis factories that create
DelimitedTermFrequencyTokenFilterCharacters before the delimiter are the "token", the textual integer after is the term frequency. HyphenatedWordsFilterWhen the plain text is extracted from documents, we will often have many words hyphenated and broken into two lines. KeywordMarkerFilterMarks terms as keywords via the
LengthFilterRemoves words that are too long or too short from the stream. RemoveDuplicatesTokenFilterA TokenFilter which filters out Tokens at the same position and Term text as the previous token in the stream. ScandinavianNormalizationFilterThis filter normalize use of the interchangeable Scandinavian characters æÆäÄöÖøØ and folded variants (aa, ao, ae, oe and oo) by transforming them to åÅæÆøØ. ScandinavianNormalizer.FoldingsList of possible foldings that can be used when configuring the filter StemmerOverrideFilter.StemmerOverrideMapA read-only 4-byte FST backed map that allows fast case-insensitive key value lookups for