public final class MockAnalyzer
extends org.apache.lucene.analysis.Analyzer
This analyzer is a replacement for Whitespace/Simple/KeywordAnalyzers for unit tests. If you are testing a custom component such as a queryparser or analyzer-wrapper that consumes analysis streams, its a great idea to test it with this analyzer instead. MockAnalyzer has the following behavior:
MockTokenizer
are turned on for extra
checks that the consumer is consuming properly. These checks can be disabled
with setEnableChecks(boolean)
.
MockTokenizer
Constructor and Description |
---|
MockAnalyzer(Random random)
Create a Whitespace-lowercasing analyzer with no stopwords removal.
|
MockAnalyzer(Random random,
int pattern,
boolean lowerCase)
|
MockAnalyzer(Random random,
int pattern,
boolean lowerCase,
org.apache.lucene.analysis.CharArraySet filter,
boolean enablePositionIncrements)
Creates a new MockAnalyzer.
|
Modifier and Type | Method and Description |
---|---|
int |
getPositionIncrementGap(String fieldName) |
org.apache.lucene.analysis.TokenStream |
reusableTokenStream(String fieldName,
Reader reader) |
void |
setEnableChecks(boolean enableChecks)
Toggle consumer workflow checking: if your test consumes tokenstreams normally you
should leave this enabled.
|
void |
setMaxTokenLength(int length)
Toggle maxTokenLength for MockTokenizer
|
void |
setPositionIncrementGap(int positionIncrementGap) |
org.apache.lucene.analysis.TokenStream |
tokenStream(String fieldName,
Reader reader) |
public MockAnalyzer(Random random, int pattern, boolean lowerCase, org.apache.lucene.analysis.CharArraySet filter, boolean enablePositionIncrements)
random
- Random for payloads behaviorpattern
- pattern constant describing how tokenization should happenlowerCase
- true if the tokenizer should lowercase termsfilter
- CharArraySet describing how terms should be filtered (set of stopwords, etc)enablePositionIncrements
- true if position increments should reflect filtered terms.public MockAnalyzer(Random random, int pattern, boolean lowerCase)
public MockAnalyzer(Random random)
public org.apache.lucene.analysis.TokenStream tokenStream(String fieldName, Reader reader)
tokenStream
in class org.apache.lucene.analysis.Analyzer
public org.apache.lucene.analysis.TokenStream reusableTokenStream(String fieldName, Reader reader) throws IOException
reusableTokenStream
in class org.apache.lucene.analysis.Analyzer
IOException
public void setPositionIncrementGap(int positionIncrementGap)
public int getPositionIncrementGap(String fieldName)
getPositionIncrementGap
in class org.apache.lucene.analysis.Analyzer
public void setEnableChecks(boolean enableChecks)
public void setMaxTokenLength(int length)