Package | Description |
---|---|
org.apache.lucene.analysis |
API and code to convert text into indexable/searchable tokens.
|
org.apache.lucene.analysis.compound |
A filter that decomposes compound words you find in many Germanic
languages into the word parts.
|
org.apache.lucene.analysis.sinks |
Implementations of the SinkTokenizer that might be useful.
|
org.apache.lucene.analysis.standard |
Standards-based analyzers implemented with JFlex.
|
org.apache.lucene.analysis.standard.std31 |
Backwards-compatible implementation to match
Version.LUCENE_31 |
org.apache.lucene.analysis.standard.std34 |
Backwards-compatible implementation to match
Version.LUCENE_34 |
org.apache.lucene.analysis.tokenattributes |
Useful
Attribute s for text analysis. |
org.apache.lucene.facet.index.streaming |
Expert: attributes streaming definition for indexing facets
Steaming of facets attributes is a low level indexing interface with Lucene indexing.
|
Modifier and Type | Class and Description |
---|---|
class |
Token
A Token is an occurrence of a term from the text of a field.
|
Modifier and Type | Field and Description |
---|---|
protected CharTermAttribute |
CompoundWordTokenFilterBase.termAtt |
Modifier and Type | Field and Description |
---|---|
protected CharTermAttribute |
DateRecognizerSinkFilter.termAtt |
Modifier and Type | Method and Description |
---|---|
void |
UAX29URLEmailTokenizerImpl.getText(CharTermAttribute t)
Fills CharTermAttribute with the current token text.
|
void |
StandardTokenizerInterface.getText(CharTermAttribute t)
Copies the matched text into the CharTermAttribute
|
void |
StandardTokenizerImpl.getText(CharTermAttribute t)
Fills CharTermAttribute with the current token text.
|
Modifier and Type | Method and Description |
---|---|
void |
UAX29URLEmailTokenizerImpl31.getText(CharTermAttribute t)
Deprecated.
Fills CharTermAttribute with the current token text.
|
void |
StandardTokenizerImpl31.getText(CharTermAttribute t)
Deprecated.
Fills CharTermAttribute with the current token text.
|
Modifier and Type | Method and Description |
---|---|
void |
UAX29URLEmailTokenizerImpl34.getText(CharTermAttribute t)
Deprecated.
Fills CharTermAttribute with the current token text.
|
Modifier and Type | Class and Description |
---|---|
class |
CharTermAttributeImpl
The term text of a Token.
|
class |
TermAttributeImpl
Deprecated.
This class is not used anymore. The backwards layer in
AttributeFactory uses the replacement implementation.
|
Modifier and Type | Method and Description |
---|---|
CharTermAttribute |
CharTermAttributeImpl.append(char c) |
CharTermAttribute |
CharTermAttribute.append(char c) |
CharTermAttribute |
CharTermAttributeImpl.append(CharSequence csq) |
CharTermAttribute |
CharTermAttribute.append(CharSequence csq) |
CharTermAttribute |
CharTermAttributeImpl.append(CharSequence csq,
int start,
int end) |
CharTermAttribute |
CharTermAttribute.append(CharSequence csq,
int start,
int end) |
CharTermAttribute |
CharTermAttributeImpl.append(CharTermAttribute ta) |
CharTermAttribute |
CharTermAttribute.append(CharTermAttribute termAtt)
Appends the contents of the other
CharTermAttribute to this character sequence. |
CharTermAttribute |
CharTermAttributeImpl.append(String s) |
CharTermAttribute |
CharTermAttribute.append(String s)
Appends the specified
String to this character sequence. |
CharTermAttribute |
CharTermAttributeImpl.append(StringBuilder s) |
CharTermAttribute |
CharTermAttribute.append(StringBuilder sb)
Appends the specified
StringBuilder to this character sequence. |
CharTermAttribute |
CharTermAttributeImpl.setEmpty() |
CharTermAttribute |
CharTermAttribute.setEmpty()
Sets the length of the termBuffer to zero.
|
CharTermAttribute |
CharTermAttributeImpl.setLength(int length) |
CharTermAttribute |
CharTermAttribute.setLength(int length)
Set number of valid characters (length of the term) in
the termBuffer array.
|
Modifier and Type | Method and Description |
---|---|
CharTermAttribute |
CharTermAttributeImpl.append(CharTermAttribute ta) |
CharTermAttribute |
CharTermAttribute.append(CharTermAttribute termAtt)
Appends the contents of the other
CharTermAttribute to this character sequence. |
Modifier and Type | Field and Description |
---|---|
protected CharTermAttribute |
CategoryTokenizerBase.termAttribute
The stream's term attribute.
|