Package org.apache.lucene.analysis.pattern
Set of components for pattern-based (regex) analysis.
-
Class Summary Class Description PatternCaptureGroupFilterFactory Factory forPatternCaptureGroupTokenFilter
.PatternCaptureGroupTokenFilter CaptureGroup uses Java regexes to emit multiple tokens - one for each capture group in one or more patterns.PatternReplaceCharFilter CharFilter that uses a regular expression for the target of replace string.PatternReplaceCharFilterFactory Factory forPatternReplaceCharFilter
.PatternReplaceFilter A TokenFilter which applies a Pattern to each token in the stream, replacing match occurrences with the specified replacement string.PatternReplaceFilterFactory Factory forPatternReplaceFilter
.PatternTokenizer This tokenizer uses regex pattern matching to construct distinct tokens for the input stream.PatternTokenizerFactory Factory forPatternTokenizer
.SimplePatternSplitTokenizer SimplePatternSplitTokenizerFactory Factory forSimplePatternSplitTokenizer
, for producing tokens by splitting according to the provided regexp.SimplePatternTokenizer SimplePatternTokenizerFactory Factory forSimplePatternTokenizer
, for matching tokens based on the provided regexp.