Package | Description |
---|---|
org.apache.lucene.analysis |
API and code to convert text into indexable/searchable tokens.
|
org.apache.lucene.analysis.cn.smart |
Analyzer for Simplified Chinese, which indexes words.
|
org.apache.lucene.analysis.compound |
A filter that decomposes compound words you find in many Germanic
languages into the word parts.
|
org.apache.lucene.analysis.ga |
Analysis for Irish.
|
org.apache.lucene.analysis.ja |
Analyzer for Japanese.
|
org.apache.lucene.analysis.pt |
Analyzer for Portuguese.
|
Modifier and Type | Field and Description |
---|---|
static CharArraySet |
CharArraySet.EMPTY_SET |
protected CharArraySet |
StopwordAnalyzerBase.stopwords
An immutable stopword set
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
CharArraySet.copy(Set<?> set)
Deprecated.
use
copy(Version, Set) instead. |
static CharArraySet |
CharArraySet.copy(Version matchVersion,
Set<?> set)
Returns a copy of the given set as a
CharArraySet . |
static CharArraySet |
WordlistLoader.getSnowballWordSet(Reader reader,
CharArraySet result)
Reads stopwords from a stopword list in Snowball format.
|
static CharArraySet |
WordlistLoader.getSnowballWordSet(Reader reader,
Version matchVersion)
Reads stopwords from a stopword list in Snowball format.
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
CharArraySet result)
Reads lines from a Reader and adds every line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
String comment,
CharArraySet result)
Reads lines from a Reader and adds every non-comment line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
String comment,
Version matchVersion)
Reads lines from a Reader and adds every non-comment line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
Version matchVersion)
Reads lines from a Reader and adds every line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
CharArraySet |
CharArrayMap.keySet()
Returns an
CharArraySet view on the map's keys. |
protected static CharArraySet |
StopwordAnalyzerBase.loadStopwordSet(boolean ignoreCase,
Class<? extends ReusableAnalyzerBase> aClass,
String resource,
String comment)
Creates a CharArraySet from a file resource associated with a class.
|
protected static CharArraySet |
StopwordAnalyzerBase.loadStopwordSet(File stopwords,
Version matchVersion)
Creates a CharArraySet from a file.
|
protected static CharArraySet |
StopwordAnalyzerBase.loadStopwordSet(Reader stopwords,
Version matchVersion)
Creates a CharArraySet from a file.
|
static CharArraySet |
CharArraySet.unmodifiableSet(CharArraySet set)
Returns an unmodifiable
CharArraySet . |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
WordlistLoader.getSnowballWordSet(Reader reader,
CharArraySet result)
Reads stopwords from a stopword list in Snowball format.
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
CharArraySet result)
Reads lines from a Reader and adds every line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
String comment,
CharArraySet result)
Reads lines from a Reader and adds every non-comment line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
CharArraySet.unmodifiableSet(CharArraySet set)
Returns an unmodifiable
CharArraySet . |
Constructor and Description |
---|
KeywordMarkerFilter(TokenStream in,
CharArraySet keywordSet)
Create a new KeywordMarkerFilter, that marks the current token as a
keyword if the tokens term buffer is contained in the given set via the
KeywordAttribute . |
MockAnalyzer(Random random,
int pattern,
boolean lowerCase,
CharArraySet filter,
boolean enablePositionIncrements)
Creates a new MockAnalyzer.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
SmartChineseAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
Modifier and Type | Field and Description |
---|---|
protected CharArraySet |
CompoundWordTokenFilterBase.dictionary |
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
CompoundWordTokenFilterBase.makeDictionary(Version matchVersion,
String[] dictionary)
Deprecated.
Only available for backwards compatibility.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
IrishAnalyzer.getDefaultStopSet()
Returns an unmodifiable instance of the default stop words set.
|
Constructor and Description |
---|
IrishAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
IrishAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
JapaneseAnalyzer.getDefaultStopSet() |
Constructor and Description |
---|
JapaneseAnalyzer(Version matchVersion,
UserDictionary userDict,
JapaneseTokenizer.Mode mode,
CharArraySet stopwords,
Set<String> stoptags) |
Modifier and Type | Field and Description |
---|---|
protected CharArraySet |
RSLPStemmerBase.RuleWithSetExceptions.exceptions |