Package | Description |
---|---|
org.apache.lucene.analysis |
API and code to convert text into indexable/searchable tokens.
|
org.apache.lucene.analysis.standard |
Standards-based analyzers implemented with JFlex.
|
Modifier and Type | Class and Description |
---|---|
class |
KeywordAnalyzer
"Tokenizes" the entire stream as a single token.
|
class |
SimpleAnalyzer
An
Analyzer that filters LetterTokenizer
with LowerCaseFilter
You must specify the required Version compatibility
when creating CharTokenizer :
As of 3.1, LowerCaseTokenizer uses an int based API to normalize and
detect token codepoints. |
class |
StopAnalyzer
|
class |
StopwordAnalyzerBase
Base class for Analyzers that need to make use of stopword sets.
|
class |
WhitespaceAnalyzer
An Analyzer that uses
WhitespaceTokenizer . |
Modifier and Type | Method and Description |
---|---|
protected static CharArraySet |
StopwordAnalyzerBase.loadStopwordSet(boolean ignoreCase,
Class<? extends ReusableAnalyzerBase> aClass,
String resource,
String comment)
Creates a CharArraySet from a file resource associated with a class.
|
Modifier and Type | Class and Description |
---|---|
class |
ClassicAnalyzer
Filters
ClassicTokenizer with ClassicFilter , LowerCaseFilter and StopFilter , using a list of
English stop words. |
class |
StandardAnalyzer
Filters
StandardTokenizer with StandardFilter , LowerCaseFilter and StopFilter , using a list of
English stop words. |
class |
UAX29URLEmailAnalyzer
Filters
UAX29URLEmailTokenizer
with StandardFilter ,
LowerCaseFilter and
StopFilter , using a list of
English stop words. |