Package | Description |
---|---|
org.apache.lucene.analysis |
API and code to convert text into indexable/searchable tokens.
|
org.apache.lucene.analysis.ar |
Analyzer for Arabic.
|
org.apache.lucene.analysis.in |
Analysis components for Indian languages.
|
org.apache.lucene.analysis.ru |
Analyzer for Russian.
|
Modifier and Type | Class and Description |
---|---|
class |
LetterTokenizer
A LetterTokenizer is a tokenizer that divides text at non-letters.
|
class |
LowerCaseTokenizer
LowerCaseTokenizer performs the function of LetterTokenizer
and LowerCaseFilter together.
|
class |
WhitespaceTokenizer
A WhitespaceTokenizer is a tokenizer that divides text at whitespace.
|
Modifier and Type | Class and Description |
---|---|
class |
ArabicLetterTokenizer
Deprecated.
(3.1) Use
StandardTokenizer instead. |
Modifier and Type | Class and Description |
---|---|
class |
IndicTokenizer
Deprecated.
(3.6) Use
StandardTokenizer instead. |
Modifier and Type | Class and Description |
---|---|
class |
RussianLetterTokenizer
Deprecated.
Use
StandardTokenizer instead, which has the same functionality.
This filter will be removed in Lucene 5.0 |