Package org.apache.lucene.analysis.ca
Class CatalanAnalyzer
- java.lang.Object
- 
- org.apache.lucene.analysis.Analyzer
- 
- org.apache.lucene.analysis.StopwordAnalyzerBase
- 
- org.apache.lucene.analysis.ca.CatalanAnalyzer
 
 
 
- 
- All Implemented Interfaces:
- Closeable,- AutoCloseable
 
 public final class CatalanAnalyzer extends StopwordAnalyzerBase Analyzerfor Catalan.- Since:
- 3.1
 
- 
- 
Nested Class Summary- 
Nested classes/interfaces inherited from class org.apache.lucene.analysis.AnalyzerAnalyzer.ReuseStrategy, Analyzer.TokenStreamComponents
 
- 
 - 
Field SummaryFields Modifier and Type Field Description static StringDEFAULT_STOPWORD_FILEFile containing default Catalan stopwords.- 
Fields inherited from class org.apache.lucene.analysis.StopwordAnalyzerBasestopwords
 - 
Fields inherited from class org.apache.lucene.analysis.AnalyzerGLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY
 
- 
 - 
Constructor SummaryConstructors Constructor Description CatalanAnalyzer()Builds an analyzer with the default stop words:DEFAULT_STOPWORD_FILE.CatalanAnalyzer(CharArraySet stopwords)Builds an analyzer with the given stop words.CatalanAnalyzer(CharArraySet stopwords, CharArraySet stemExclusionSet)Builds an analyzer with the given stop words.
 - 
Method SummaryAll Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description protected Analyzer.TokenStreamComponentscreateComponents(String fieldName)Creates aAnalyzer.TokenStreamComponentswhich tokenizes all the text in the providedReader.static CharArraySetgetDefaultStopSet()Returns an unmodifiable instance of the default stop words set.protected TokenStreamnormalize(String fieldName, TokenStream in)- 
Methods inherited from class org.apache.lucene.analysis.StopwordAnalyzerBasegetStopwordSet, loadStopwordSet, loadStopwordSet, loadStopwordSet
 - 
Methods inherited from class org.apache.lucene.analysis.AnalyzerattributeFactory, close, getOffsetGap, getPositionIncrementGap, getReuseStrategy, initReader, initReaderForNormalization, normalize, tokenStream, tokenStream
 
- 
 
- 
- 
- 
Field Detail- 
DEFAULT_STOPWORD_FILEpublic static final String DEFAULT_STOPWORD_FILE File containing default Catalan stopwords.- See Also:
- Constant Field Values
 
 
- 
 - 
Constructor Detail- 
CatalanAnalyzerpublic CatalanAnalyzer() Builds an analyzer with the default stop words:DEFAULT_STOPWORD_FILE.
 - 
CatalanAnalyzerpublic CatalanAnalyzer(CharArraySet stopwords) Builds an analyzer with the given stop words.- Parameters:
- stopwords- a stopword set
 
 - 
CatalanAnalyzerpublic CatalanAnalyzer(CharArraySet stopwords, CharArraySet stemExclusionSet) Builds an analyzer with the given stop words. If a non-empty stem exclusion set is provided this analyzer will add aSetKeywordMarkerFilterbefore stemming.- Parameters:
- stopwords- a stopword set
- stemExclusionSet- a set of terms not to be stemmed
 
 
- 
 - 
Method Detail- 
getDefaultStopSetpublic static CharArraySet getDefaultStopSet() Returns an unmodifiable instance of the default stop words set.- Returns:
- default stop words set.
 
 - 
createComponentsprotected Analyzer.TokenStreamComponents createComponents(String fieldName) Creates aAnalyzer.TokenStreamComponentswhich tokenizes all the text in the providedReader.- Specified by:
- createComponentsin class- Analyzer
- Returns:
- A Analyzer.TokenStreamComponentsbuilt from anStandardTokenizerfiltered withElisionFilter,LowerCaseFilter,StopFilter,SetKeywordMarkerFilterif a stem exclusion set is provided andSnowballFilter.
 
 - 
normalizeprotected TokenStream normalize(String fieldName, TokenStream in) 
 
- 
 
-