|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.lucene.analysis.Analyzer org.apache.lucene.analysis.MockAnalyzer
public final class MockAnalyzer
Analyzer for testing
This analyzer is a replacement for Whitespace/Simple/KeywordAnalyzers for unit tests. If you are testing a custom component such as a queryparser or analyzer-wrapper that consumes analysis streams, its a great idea to test it with this analyzer instead. MockAnalyzer has the following behavior:
MockTokenizer
are turned on for extra
checks that the consumer is consuming properly. These checks can be disabled
with setEnableChecks(boolean)
.
MockTokenizer
Nested Class Summary |
---|
Nested classes/interfaces inherited from class org.apache.lucene.analysis.Analyzer |
---|
Analyzer.GlobalReuseStrategy, Analyzer.PerFieldReuseStrategy, Analyzer.ReuseStrategy, Analyzer.TokenStreamComponents |
Field Summary |
---|
Fields inherited from class org.apache.lucene.analysis.Analyzer |
---|
GLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY |
Constructor Summary | |
---|---|
MockAnalyzer(Random random)
Create a Whitespace-lowercasing analyzer with no stopwords removal. |
|
MockAnalyzer(Random random,
CharacterRunAutomaton runAutomaton,
boolean lowerCase)
Calls MockAnalyzer(random, runAutomaton, lowerCase, MockTokenFilter.EMPTY_STOPSET, false ). |
|
MockAnalyzer(Random random,
CharacterRunAutomaton runAutomaton,
boolean lowerCase,
CharacterRunAutomaton filter)
Creates a new MockAnalyzer. |
Method Summary | |
---|---|
Analyzer.TokenStreamComponents |
createComponents(String fieldName,
Reader reader)
|
int |
getOffsetGap(String fieldName)
Get the offset gap between tokens in fields if several fields with the same name were added. |
int |
getPositionIncrementGap(String fieldName)
|
void |
setEnableChecks(boolean enableChecks)
Toggle consumer workflow checking: if your test consumes tokenstreams normally you should leave this enabled. |
void |
setMaxTokenLength(int length)
Toggle maxTokenLength for MockTokenizer |
void |
setOffsetGap(int offsetGap)
Set a new offset gap which will then be added to the offset when several fields with the same name are indexed |
void |
setPositionIncrementGap(int positionIncrementGap)
|
Methods inherited from class org.apache.lucene.analysis.Analyzer |
---|
close, getReuseStrategy, initReader, tokenStream, tokenStream |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public MockAnalyzer(Random random, CharacterRunAutomaton runAutomaton, boolean lowerCase, CharacterRunAutomaton filter)
random
- Random for payloads behaviorrunAutomaton
- DFA describing how tokenization should happen (e.g. [a-zA-Z]+)lowerCase
- true if the tokenizer should lowercase termsfilter
- DFA describing how terms should be filtered (set of stopwords, etc)public MockAnalyzer(Random random, CharacterRunAutomaton runAutomaton, boolean lowerCase)
MockAnalyzer(random, runAutomaton, lowerCase, MockTokenFilter.EMPTY_STOPSET, false
).
public MockAnalyzer(Random random)
Calls MockAnalyzer(random, MockTokenizer.WHITESPACE, true, MockTokenFilter.EMPTY_STOPSET, false
).
Method Detail |
---|
public Analyzer.TokenStreamComponents createComponents(String fieldName, Reader reader)
createComponents
in class Analyzer
public void setPositionIncrementGap(int positionIncrementGap)
public int getPositionIncrementGap(String fieldName)
getPositionIncrementGap
in class Analyzer
public void setOffsetGap(int offsetGap)
offsetGap
- The offset gap that should be used.public int getOffsetGap(String fieldName)
getOffsetGap
in class Analyzer
fieldName
- Currently not used, the same offset gap is returned for each field.public void setEnableChecks(boolean enableChecks)
public void setMaxTokenLength(int length)
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |