|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.lucene.analysis.Analyzer org.apache.lucene.analysis.standard.StandardAnalyzer
public class StandardAnalyzer
Filters StandardTokenizer
with StandardFilter
, LowerCaseFilter
and StopFilter
, using a list of
English stop words.
You must specify the required Version
compatibility when creating StandardAnalyzer:
Field Summary | |
---|---|
static int |
DEFAULT_MAX_TOKEN_LENGTH
Default maximum allowed token length |
static Set<?> |
STOP_WORDS_SET
An unmodifiable set containing some common English words that are usually not useful for searching. |
Fields inherited from class org.apache.lucene.analysis.Analyzer |
---|
overridesTokenStreamMethod |
Constructor Summary | |
---|---|
StandardAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words ( STOP_WORDS_SET ). |
|
StandardAnalyzer(Version matchVersion,
File stopwords)
Builds an analyzer with the stop words from the given file. |
|
StandardAnalyzer(Version matchVersion,
Reader stopwords)
Builds an analyzer with the stop words from the given reader. |
|
StandardAnalyzer(Version matchVersion,
Set<?> stopWords)
Builds an analyzer with the given stop words. |
Method Summary | |
---|---|
int |
getMaxTokenLength()
|
TokenStream |
reusableTokenStream(String fieldName,
Reader reader)
Creates a TokenStream that is allowed to be re-used from the previous time that the same thread called this method. |
void |
setMaxTokenLength(int length)
Set maximum allowed token length. |
TokenStream |
tokenStream(String fieldName,
Reader reader)
Constructs a StandardTokenizer filtered by a StandardFilter , a LowerCaseFilter and a StopFilter . |
Methods inherited from class org.apache.lucene.analysis.Analyzer |
---|
close, getOffsetGap, getPositionIncrementGap, getPreviousTokenStream, setOverridesTokenStreamMethod, setPreviousTokenStream |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static final Set<?> STOP_WORDS_SET
public static final int DEFAULT_MAX_TOKEN_LENGTH
Constructor Detail |
---|
public StandardAnalyzer(Version matchVersion)
STOP_WORDS_SET
).
matchVersion
- Lucene version to match See abovepublic StandardAnalyzer(Version matchVersion, Set<?> stopWords)
matchVersion
- Lucene version to match See abovestopWords
- stop wordspublic StandardAnalyzer(Version matchVersion, File stopwords) throws IOException
matchVersion
- Lucene version to match See abovestopwords
- File to read stop words from
IOException
WordlistLoader.getWordSet(File)
public StandardAnalyzer(Version matchVersion, Reader stopwords) throws IOException
matchVersion
- Lucene version to match See abovestopwords
- Reader to read stop words from
IOException
WordlistLoader.getWordSet(Reader)
Method Detail |
---|
public TokenStream tokenStream(String fieldName, Reader reader)
StandardTokenizer
filtered by a StandardFilter
, a LowerCaseFilter
and a StopFilter
.
tokenStream
in class Analyzer
public void setMaxTokenLength(int length)
public int getMaxTokenLength()
setMaxTokenLength(int)
public TokenStream reusableTokenStream(String fieldName, Reader reader) throws IOException
Analyzer
reusableTokenStream
in class Analyzer
IOException
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |