Package | Description |
---|---|
org.apache.lucene.analysis.ar |
Analyzer for Arabic.
|
org.apache.lucene.analysis.cjk |
Analyzer for Chinese, Japanese, and Korean, which indexes bigrams.
|
org.apache.lucene.analysis.cn |
Analyzer for Chinese, which indexes unigrams (individual chinese characters).
|
org.apache.lucene.analysis.core |
Basic, general-purpose analysis components.
|
org.apache.lucene.analysis.ngram |
Character n-gram tokenizers and filters.
|
org.apache.lucene.analysis.path |
Analysis components for path-like strings such as filenames.
|
org.apache.lucene.analysis.pattern |
Set of components for pattern-based (regex) analysis.
|
org.apache.lucene.analysis.ru |
Analyzer for Russian.
|
org.apache.lucene.analysis.standard |
Fast, general-purpose grammar-based tokenizers.
|
org.apache.lucene.analysis.th |
Analyzer for Thai.
|
org.apache.lucene.analysis.util |
Utility functions for text analysis.
|
org.apache.lucene.analysis.wikipedia |
Tokenizer that is aware of Wikipedia syntax.
|
Modifier and Type | Class and Description |
---|---|
class |
ArabicLetterTokenizerFactory
Deprecated.
(3.1) Use StandardTokenizerFactory instead.
|
Modifier and Type | Class and Description |
---|---|
class |
CJKTokenizerFactory
Deprecated.
Use
CJKBigramFilterFactory instead. |
Modifier and Type | Class and Description |
---|---|
class |
ChineseTokenizerFactory
Deprecated.
Use
StandardTokenizerFactory instead. |
Modifier and Type | Class and Description |
---|---|
class |
KeywordTokenizerFactory
Factory for
KeywordTokenizer . |
class |
LetterTokenizerFactory
Factory for
LetterTokenizer . |
class |
LowerCaseTokenizerFactory
Factory for
LowerCaseTokenizer . |
class |
WhitespaceTokenizerFactory
Factory for
WhitespaceTokenizer . |
Modifier and Type | Class and Description |
---|---|
class |
EdgeNGramTokenizerFactory
Creates new instances of
EdgeNGramTokenizer . |
class |
NGramTokenizerFactory
Factory for
NGramTokenizer . |
Modifier and Type | Class and Description |
---|---|
class |
PathHierarchyTokenizerFactory
Factory for
PathHierarchyTokenizer . |
Modifier and Type | Class and Description |
---|---|
class |
PatternTokenizerFactory
Factory for
PatternTokenizer . |
Modifier and Type | Class and Description |
---|---|
class |
RussianLetterTokenizerFactory
Deprecated.
Use
StandardTokenizerFactory instead.
This tokenizer has no Russian-specific functionality. |
Modifier and Type | Class and Description |
---|---|
class |
ClassicTokenizerFactory
Factory for
ClassicTokenizer . |
class |
StandardTokenizerFactory
Factory for
StandardTokenizer . |
class |
UAX29URLEmailTokenizerFactory
Factory for
UAX29URLEmailTokenizer . |
Modifier and Type | Class and Description |
---|---|
class |
ThaiTokenizerFactory
Factory for
ThaiTokenizer . |
Modifier and Type | Method and Description |
---|---|
static TokenizerFactory |
TokenizerFactory.forName(String name,
Map<String,String> args)
looks up a tokenizer by name from context classpath
|
Modifier and Type | Method and Description |
---|---|
static Class<? extends TokenizerFactory> |
TokenizerFactory.lookupClass(String name)
looks up a tokenizer class by name from context classpath
|
Modifier and Type | Class and Description |
---|---|
class |
WikipediaTokenizerFactory
Factory for
WikipediaTokenizer . |
Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.