public final class NumericTokenStream extends TokenStream
TokenStream
for indexing numeric values that can be used by NumericRangeQuery
or NumericRangeFilter
.
Note that for simple usage, NumericField
is
recommended. NumericField
disables norms and
term freqs, as they are not usually needed during
searching. If you need to change these settings, you
should use this class.
See NumericField
for capabilities of fields
indexed numerically.
Here's an example usage, for an int
field:
Field field = new Field(name, new NumericTokenStream(precisionStep).setIntValue(value)); field.setOmitNorms(true); field.setIndexOptions(IndexOptions.DOCS_ONLY); document.add(field);
For optimal performance, re-use the TokenStream and Field instance for more than one document:
NumericTokenStream stream = new NumericTokenStream(precisionStep); Field field = new Field(name, stream); field.setOmitNorms(true); field.setIndexOptions(IndexOptions.DOCS_ONLY); Document document = new Document(); document.add(field); for(all documents) { stream.setIntValue(value) writer.addDocument(document); }
This stream is not intended to be used in analyzers; it's more for iterating the different precisions during indexing a specific numeric value.
NOTE: as token streams are only consumed once
the document is added to the index, if you index more
than one numeric field, use a separate NumericTokenStream
instance for each.
See NumericRangeQuery
for more details on the
precisionStep
parameter as well as how numeric fields work under the hood.
AttributeSource.AttributeFactory, AttributeSource.State
Modifier and Type | Field and Description |
---|---|
static String |
TOKEN_TYPE_FULL_PREC
The full precision token gets this token type assigned.
|
static String |
TOKEN_TYPE_LOWER_PREC
The lower precision tokens gets this token type assigned.
|
Constructor and Description |
---|
NumericTokenStream()
Creates a token stream for numeric values using the default
precisionStep
NumericUtils.PRECISION_STEP_DEFAULT (4). |
NumericTokenStream(AttributeSource.AttributeFactory factory,
int precisionStep)
Expert: Creates a token stream for numeric values with the specified
precisionStep using the given
AttributeSource.AttributeFactory . |
NumericTokenStream(AttributeSource source,
int precisionStep)
Expert: Creates a token stream for numeric values with the specified
precisionStep using the given AttributeSource . |
NumericTokenStream(int precisionStep)
Creates a token stream for numeric values with the specified
precisionStep . |
Modifier and Type | Method and Description |
---|---|
int |
getPrecisionStep()
Returns the precision step.
|
boolean |
incrementToken()
Consumers (i.e.,
IndexWriter ) use this method to advance the stream to
the next token. |
void |
reset()
Resets this stream to the beginning.
|
NumericTokenStream |
setDoubleValue(double value)
Initializes the token stream with the supplied
double value. |
NumericTokenStream |
setFloatValue(float value)
Initializes the token stream with the supplied
float value. |
NumericTokenStream |
setIntValue(int value)
Initializes the token stream with the supplied
int value. |
NumericTokenStream |
setLongValue(long value)
Initializes the token stream with the supplied
long value. |
String |
toString()
Returns a string representation of the object.
|
close, end
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState
public static final String TOKEN_TYPE_FULL_PREC
public static final String TOKEN_TYPE_LOWER_PREC
public NumericTokenStream()
precisionStep
NumericUtils.PRECISION_STEP_DEFAULT
(4). The stream is not yet initialized,
before using set a value using the various set???Value() methods.public NumericTokenStream(int precisionStep)
precisionStep
. The stream is not yet initialized,
before using set a value using the various set???Value() methods.public NumericTokenStream(AttributeSource source, int precisionStep)
precisionStep
using the given AttributeSource
.
The stream is not yet initialized,
before using set a value using the various set???Value() methods.public NumericTokenStream(AttributeSource.AttributeFactory factory, int precisionStep)
precisionStep
using the given
AttributeSource.AttributeFactory
.
The stream is not yet initialized,
before using set a value using the various set???Value() methods.public NumericTokenStream setLongValue(long value)
long
value.value
- the value, for which this TokenStream should enumerate tokens.new Field(name, new NumericTokenStream(precisionStep).setLongValue(value))
public NumericTokenStream setIntValue(int value)
int
value.value
- the value, for which this TokenStream should enumerate tokens.new Field(name, new NumericTokenStream(precisionStep).setIntValue(value))
public NumericTokenStream setDoubleValue(double value)
double
value.value
- the value, for which this TokenStream should enumerate tokens.new Field(name, new NumericTokenStream(precisionStep).setDoubleValue(value))
public NumericTokenStream setFloatValue(float value)
float
value.value
- the value, for which this TokenStream should enumerate tokens.new Field(name, new NumericTokenStream(precisionStep).setFloatValue(value))
public void reset()
TokenStream
TokenStream.reset()
is not needed for
the standard indexing process. However, if the tokens of a
TokenStream
are intended to be consumed more than once, it is
necessary to implement TokenStream.reset()
. Note that if your TokenStream
caches tokens and feeds them back again after a reset, it is imperative
that you clone the tokens when you store them away (on the first pass) as
well as when you return them (on future passes after TokenStream.reset()
).reset
in class TokenStream
public boolean incrementToken()
TokenStream
IndexWriter
) use this method to advance the stream to
the next token. Implementing classes must implement this method and update
the appropriate AttributeImpl
s with the attributes of the next
token.
The producer must make no assumptions about the attributes after the method
has been returned: the caller may arbitrarily change it. If the producer
needs to preserve the state for subsequent calls, it can use
AttributeSource.captureState()
to create a copy of the current attribute state.
This method is called for every token of a document, so an efficient
implementation is crucial for good performance. To avoid calls to
AttributeSource.addAttribute(Class)
and AttributeSource.getAttribute(Class)
,
references to all AttributeImpl
s that this stream uses should be
retrieved during instantiation.
To ensure that filters and consumers know which attributes are available,
the attributes must be added during instantiation. Filters and consumers
are not required to check for availability of attributes in
TokenStream.incrementToken()
.
incrementToken
in class TokenStream
public String toString()
AttributeSource
toString
method
returns a string that "textually represents" this object.
WARNING: For backwards compatibility this method is implemented as in Lucene 2.9/3.0. In Lucene 4.0 this default implementation will be removed.
It is recommeneded to use AttributeSource.reflectAsString(boolean)
or AttributeSource.reflectWith(org.apache.lucene.util.AttributeReflector)
to get a well-defined output of AttributeSource's internals.
toString
in class AttributeSource
public int getPrecisionStep()