public final class SimpleAnalyzer extends ReusableAnalyzerBase
Analyzer that filters LetterTokenizer
with LowerCaseFilter
You must specify the required Version compatibility
when creating CharTokenizer:
LowerCaseTokenizer uses an int based API to normalize and
detect token codepoints. See CharTokenizer.isTokenChar(int) and
CharTokenizer.normalize(int) for details.ReusableAnalyzerBase.TokenStreamComponents| Constructor and Description |
|---|
SimpleAnalyzer()
Deprecated.
use
SimpleAnalyzer(Version) instead |
SimpleAnalyzer(Version matchVersion)
Creates a new
SimpleAnalyzer |
| Modifier and Type | Method and Description |
|---|---|
protected ReusableAnalyzerBase.TokenStreamComponents |
createComponents(String fieldName,
Reader reader)
Creates a new
ReusableAnalyzerBase.TokenStreamComponents instance for this analyzer. |
initReader, reusableTokenStream, tokenStreamclose, getOffsetGap, getPositionIncrementGap, getPreviousTokenStream, setPreviousTokenStreampublic SimpleAnalyzer(Version matchVersion)
SimpleAnalyzermatchVersion - Lucene version to match See above@Deprecated public SimpleAnalyzer()
SimpleAnalyzer(Version) insteadSimpleAnalyzerprotected ReusableAnalyzerBase.TokenStreamComponents createComponents(String fieldName, Reader reader)
ReusableAnalyzerBaseReusableAnalyzerBase.TokenStreamComponents instance for this analyzer.createComponents in class ReusableAnalyzerBasefieldName - the name of the fields content passed to the
ReusableAnalyzerBase.TokenStreamComponents sink as a readerreader - the reader passed to the Tokenizer constructorReusableAnalyzerBase.TokenStreamComponents for this analyzer.