|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.lucene.analysis.Analyzer org.apache.lucene.analysis.ru.RussianAnalyzer
public final class RussianAnalyzer
Analyzer
for Russian language.
Supports an external list of stopwords (words that will not be indexed at all). A default set of stopwords is used unless an alternative list is specified.
Field Summary |
---|
Fields inherited from class org.apache.lucene.analysis.Analyzer |
---|
overridesTokenStreamMethod |
Constructor Summary | |
---|---|
RussianAnalyzer(org.apache.lucene.util.Version matchVersion)
|
|
RussianAnalyzer(org.apache.lucene.util.Version matchVersion,
Map<?,?> stopwords)
Deprecated. use RussianAnalyzer(Version, Set) instead |
|
RussianAnalyzer(org.apache.lucene.util.Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words |
|
RussianAnalyzer(org.apache.lucene.util.Version matchVersion,
String... stopwords)
Deprecated. use RussianAnalyzer(Version, Set) instead |
Method Summary | |
---|---|
org.apache.lucene.analysis.TokenStream |
reusableTokenStream(String fieldName,
Reader reader)
Returns a (possibly reused) TokenStream which tokenizes all the text
in the provided Reader . |
org.apache.lucene.analysis.TokenStream |
tokenStream(String fieldName,
Reader reader)
Creates a TokenStream which tokenizes all the text in the
provided Reader . |
Methods inherited from class org.apache.lucene.analysis.Analyzer |
---|
close, getOffsetGap, getPositionIncrementGap, getPreviousTokenStream, setOverridesTokenStreamMethod, setPreviousTokenStream |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public RussianAnalyzer(org.apache.lucene.util.Version matchVersion)
public RussianAnalyzer(org.apache.lucene.util.Version matchVersion, String... stopwords)
RussianAnalyzer(Version, Set)
instead
public RussianAnalyzer(org.apache.lucene.util.Version matchVersion, Set<?> stopwords)
matchVersion
- lucene compatibility versionstopwords
- a stopword setpublic RussianAnalyzer(org.apache.lucene.util.Version matchVersion, Map<?,?> stopwords)
RussianAnalyzer(Version, Set)
instead
Method Detail |
---|
public org.apache.lucene.analysis.TokenStream tokenStream(String fieldName, Reader reader)
TokenStream
which tokenizes all the text in the
provided Reader
.
tokenStream
in class org.apache.lucene.analysis.Analyzer
TokenStream
built from a
RussianLetterTokenizer
filtered with
RussianLowerCaseFilter
, StopFilter
,
and RussianStemFilter
public org.apache.lucene.analysis.TokenStream reusableTokenStream(String fieldName, Reader reader) throws IOException
TokenStream
which tokenizes all the text
in the provided Reader
.
reusableTokenStream
in class org.apache.lucene.analysis.Analyzer
TokenStream
built from a
RussianLetterTokenizer
filtered with
RussianLowerCaseFilter
, StopFilter
,
and RussianStemFilter
IOException
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |