org.apache.lucene.analysis.ru
Class RussianAnalyzer

java.lang.Object
  extended by org.apache.lucene.analysis.Analyzer
      extended by org.apache.lucene.analysis.ru.RussianAnalyzer
All Implemented Interfaces:
Closeable

public final class RussianAnalyzer
extends Analyzer

Analyzer for Russian language.

Supports an external list of stopwords (words that will not be indexed at all). A default set of stopwords is used unless an alternative list is specified.


Field Summary
 
Fields inherited from class org.apache.lucene.analysis.Analyzer
overridesTokenStreamMethod
 
Constructor Summary
RussianAnalyzer(Version matchVersion)
           
RussianAnalyzer(Version matchVersion, Map<?,?> stopwords)
          Deprecated. use RussianAnalyzer(Version, Set) instead
RussianAnalyzer(Version matchVersion, Set<?> stopwords)
          Builds an analyzer with the given stop words
RussianAnalyzer(Version matchVersion, String... stopwords)
          Deprecated. use RussianAnalyzer(Version, Set) instead
 
Method Summary
 TokenStream reusableTokenStream(String fieldName, Reader reader)
          Returns a (possibly reused) TokenStream which tokenizes all the text in the provided Reader.
 TokenStream tokenStream(String fieldName, Reader reader)
          Creates a TokenStream which tokenizes all the text in the provided Reader.
 
Methods inherited from class org.apache.lucene.analysis.Analyzer
close, getOffsetGap, getPositionIncrementGap, getPreviousTokenStream, setOverridesTokenStreamMethod, setPreviousTokenStream
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

RussianAnalyzer

public RussianAnalyzer(Version matchVersion)

RussianAnalyzer

public RussianAnalyzer(Version matchVersion,
                       String... stopwords)
Deprecated. use RussianAnalyzer(Version, Set) instead

Builds an analyzer with the given stop words.


RussianAnalyzer

public RussianAnalyzer(Version matchVersion,
                       Set<?> stopwords)
Builds an analyzer with the given stop words

Parameters:
matchVersion - lucene compatibility version
stopwords - a stopword set

RussianAnalyzer

public RussianAnalyzer(Version matchVersion,
                       Map<?,?> stopwords)
Deprecated. use RussianAnalyzer(Version, Set) instead

Builds an analyzer with the given stop words. TODO: create a Set version of this ctor

Method Detail

tokenStream

public TokenStream tokenStream(String fieldName,
                               Reader reader)
Creates a TokenStream which tokenizes all the text in the provided Reader.

Specified by:
tokenStream in class Analyzer
Returns:
A TokenStream built from a RussianLetterTokenizer filtered with RussianLowerCaseFilter, StopFilter, and RussianStemFilter

reusableTokenStream

public TokenStream reusableTokenStream(String fieldName,
                                       Reader reader)
                                throws IOException
Returns a (possibly reused) TokenStream which tokenizes all the text in the provided Reader.

Overrides:
reusableTokenStream in class Analyzer
Returns:
A TokenStream built from a RussianLetterTokenizer filtered with RussianLowerCaseFilter, StopFilter, and RussianStemFilter
Throws:
IOException


Copyright © 2000-2010 Apache Software Foundation. All Rights Reserved.