Package org.apache.lucene.analysis.pattern
Set of components for pattern-based (regex) analysis.
-
Class Summary Class Description PatternCaptureGroupFilterFactory Factory forPatternCaptureGroupTokenFilter
.PatternCaptureGroupTokenFilter CaptureGroup uses Java regexes to emit multiple tokens - one for each capture group in one or more patterns.PatternReplaceCharFilter CharFilter that uses a regular expression for the target of replace string.PatternReplaceCharFilterFactory Factory forPatternReplaceCharFilter
.PatternReplaceFilter A TokenFilter which applies a Pattern to each token in the stream, replacing match occurrences with the specified replacement string.PatternReplaceFilterFactory Factory forPatternReplaceFilter
.PatternTokenizer This tokenizer uses regex pattern matching to construct distinct tokens for the input stream.PatternTokenizerFactory Factory forPatternTokenizer
.PatternTypingFilter Set a type attribute to a parameterized value when tokens are matched by any of a several regex patterns.PatternTypingFilter.PatternTypingRule Value holding class for pattern typing rules.PatternTypingFilterFactory Provides a filter that will analyze tokens with the analyzer from an arbitrary field type.SimplePatternSplitTokenizer SimplePatternSplitTokenizerFactory Factory forSimplePatternSplitTokenizer
, for producing tokens by splitting according to the provided regexp.SimplePatternTokenizer SimplePatternTokenizerFactory Factory forSimplePatternTokenizer
, for matching tokens based on the provided regexp.