public final class HindiAnalyzer extends StopwordAnalyzerBase
Analyzer.ReuseStrategy, Analyzer.TokenStreamComponents| Modifier and Type | Field and Description | 
|---|---|
| static String | DEFAULT_STOPWORD_FILEFile containing default Hindi stopwords. | 
stopwordsGLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY| Constructor and Description | 
|---|
| HindiAnalyzer()Builds an analyzer with the default stop words:
  DEFAULT_STOPWORD_FILE. | 
| HindiAnalyzer(CharArraySet stopwords)Builds an analyzer with the given stop words | 
| HindiAnalyzer(CharArraySet stopwords,
             CharArraySet stemExclusionSet)Builds an analyzer with the given stop words | 
| Modifier and Type | Method and Description | 
|---|---|
| protected Analyzer.TokenStreamComponents | createComponents(String fieldName)Creates
  Analyzer.TokenStreamComponentsused to tokenize all the text in the providedReader. | 
| static CharArraySet | getDefaultStopSet()Returns an unmodifiable instance of the default stop-words set. | 
| protected TokenStream | normalize(String fieldName,
         TokenStream in) | 
getStopwordSet, loadStopwordSet, loadStopwordSet, loadStopwordSetattributeFactory, close, getOffsetGap, getPositionIncrementGap, getReuseStrategy, getVersion, initReader, initReaderForNormalization, normalize, setVersion, tokenStream, tokenStreampublic static final String DEFAULT_STOPWORD_FILE
public HindiAnalyzer(CharArraySet stopwords, CharArraySet stemExclusionSet)
stopwords - a stopword setstemExclusionSet - a stemming exclusion setpublic HindiAnalyzer(CharArraySet stopwords)
stopwords - a stopword setpublic HindiAnalyzer()
DEFAULT_STOPWORD_FILE.public static CharArraySet getDefaultStopSet()
protected Analyzer.TokenStreamComponents createComponents(String fieldName)
Analyzer.TokenStreamComponents
 used to tokenize all the text in the provided Reader.createComponents in class AnalyzerAnalyzer.TokenStreamComponents
         built from a StandardTokenizer filtered with
         LowerCaseFilter, DecimalDigitFilter, IndicNormalizationFilter,
         HindiNormalizationFilter, SetKeywordMarkerFilter
         if a stem exclusion set is provided, HindiStemFilter, and
         Hindi Stop wordsprotected TokenStream normalize(String fieldName, TokenStream in)
Copyright © 2000-2021 Apache Software Foundation. All Rights Reserved.