|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectorg.apache.lucene.analysis.Analyzer
org.apache.lucene.analysis.AnalyzerWrapper
public abstract class AnalyzerWrapper
Extension to Analyzer
suitable for Analyzers which wrap
other Analyzers.
getWrappedAnalyzer(String)
allows the Analyzer
to wrap multiple Analyzers which are selected on a per field basis.
wrapComponents(String, Analyzer.TokenStreamComponents)
allows the
TokenStreamComponents of the wrapped Analyzer to then be wrapped
(such as adding a new TokenFilter
to form new TokenStreamComponents.
Nested Class Summary |
---|
Nested classes/interfaces inherited from class org.apache.lucene.analysis.Analyzer |
---|
Analyzer.GlobalReuseStrategy, Analyzer.PerFieldReuseStrategy, Analyzer.ReuseStrategy, Analyzer.TokenStreamComponents |
Field Summary |
---|
Fields inherited from class org.apache.lucene.analysis.Analyzer |
---|
GLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY |
Constructor Summary | |
---|---|
protected |
AnalyzerWrapper()
Deprecated. Use AnalyzerWrapper(Analyzer.ReuseStrategy)
and specify a valid Analyzer.ReuseStrategy , probably retrieved from the
wrapped analyzer using Analyzer.getReuseStrategy() . |
protected |
AnalyzerWrapper(Analyzer.ReuseStrategy reuseStrategy)
Creates a new AnalyzerWrapper with the given reuse strategy. |
Method Summary | |
---|---|
protected Analyzer.TokenStreamComponents |
createComponents(String fieldName,
Reader aReader)
Creates a new Analyzer.TokenStreamComponents instance for this analyzer. |
int |
getOffsetGap(String fieldName)
Just like Analyzer.getPositionIncrementGap(java.lang.String) , except for
Token offsets instead. |
int |
getPositionIncrementGap(String fieldName)
Invoked before indexing a IndexableField instance if terms have already been added to that field. |
protected abstract Analyzer |
getWrappedAnalyzer(String fieldName)
Retrieves the wrapped Analyzer appropriate for analyzing the field with the given name |
Reader |
initReader(String fieldName,
Reader reader)
Override this if you want to add a CharFilter chain. |
protected Analyzer.TokenStreamComponents |
wrapComponents(String fieldName,
Analyzer.TokenStreamComponents components)
Wraps / alters the given TokenStreamComponents, taken from the wrapped Analyzer, to form new components. |
protected Reader |
wrapReader(String fieldName,
Reader reader)
Wraps / alters the given Reader. |
Methods inherited from class org.apache.lucene.analysis.Analyzer |
---|
close, getReuseStrategy, tokenStream, tokenStream |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
@Deprecated protected AnalyzerWrapper()
AnalyzerWrapper(Analyzer.ReuseStrategy)
and specify a valid Analyzer.ReuseStrategy
, probably retrieved from the
wrapped analyzer using Analyzer.getReuseStrategy()
.
Analyzer.ReuseStrategy
of
the wrapped Analyzers are unknown, Analyzer.PER_FIELD_REUSE_STRATEGY
is assumed.
protected AnalyzerWrapper(Analyzer.ReuseStrategy reuseStrategy)
If you want to wrap a single delegate Analyzer you can probably
reuse its strategy when instantiating this subclass:
super(delegate.getReuseStrategy());
.
If you choose different analyzers per field, use
Analyzer.PER_FIELD_REUSE_STRATEGY
.
Analyzer.getReuseStrategy()
Method Detail |
---|
protected abstract Analyzer getWrappedAnalyzer(String fieldName)
fieldName
- Name of the field which is to be analyzed
protected Analyzer.TokenStreamComponents wrapComponents(String fieldName, Analyzer.TokenStreamComponents components)
fieldName
- Name of the field which is to be analyzedcomponents
- TokenStreamComponents taken from the wrapped Analyzer
protected Reader wrapReader(String fieldName, Reader reader)
initReader(String, Reader)
. By default, the given reader
is returned.
fieldName
- name of the field which is to be analyzedreader
- the reader to wrap
protected final Analyzer.TokenStreamComponents createComponents(String fieldName, Reader aReader)
Analyzer
Analyzer.TokenStreamComponents
instance for this analyzer.
createComponents
in class Analyzer
fieldName
- the name of the fields content passed to the
Analyzer.TokenStreamComponents
sink as a readeraReader
- the reader passed to the Tokenizer
constructor
Analyzer.TokenStreamComponents
for this analyzer.public int getPositionIncrementGap(String fieldName)
Analyzer
getPositionIncrementGap
in class Analyzer
fieldName
- IndexableField name being indexed.
Analyzer.tokenStream(String,Reader)
.
This value must be >= 0
.public int getOffsetGap(String fieldName)
Analyzer
Analyzer.getPositionIncrementGap(java.lang.String)
, except for
Token offsets instead. By default this returns 1.
This method is only called if the field
produced at least one token for indexing.
getOffsetGap
in class Analyzer
fieldName
- the field just indexed
Analyzer.tokenStream(String,Reader)
.
This value must be >= 0
.public final Reader initReader(String fieldName, Reader reader)
Analyzer
The default implementation returns reader
unchanged.
initReader
in class Analyzer
fieldName
- IndexableField name being indexedreader
- original Reader
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |