public class MorfologikAnalyzer extends Analyzer
Analyzer
using Morfologik library.Analyzer.GlobalReuseStrategy, Analyzer.PerFieldReuseStrategy, Analyzer.ReuseStrategy, Analyzer.TokenStreamComponents
Constructor and Description |
---|
MorfologikAnalyzer(Version vers)
Builds an analyzer for an original MORFOLOGIK dictionary.
|
MorfologikAnalyzer(Version vers,
morfologik.stemming.PolishStemmer.DICTIONARY dict)
Builds an analyzer for a given PolishStemmer.DICTIONARY enum.
|
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
createComponents(String field,
Reader reader)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
close, getOffsetGap, getPositionIncrementGap, initReader, tokenStream
public MorfologikAnalyzer(Version vers, morfologik.stemming.PolishStemmer.DICTIONARY dict)
vers
- lucene compatibility versiondict
- A constant specifying which dictionary to choose. See the
Morfologik documentation for details or use the default.public MorfologikAnalyzer(Version vers)
vers
- lucene compatibility versionprotected Analyzer.TokenStreamComponents createComponents(String field, Reader reader)
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader
.createComponents
in class Analyzer
field
- ignored field namereader
- source of tokensAnalyzer.TokenStreamComponents
built from an StandardTokenizer
filtered with
StandardFilter
and MorfologikFilter
.Copyright © 2000-2013 Apache Software Foundation. All Rights Reserved.