public class MorfologikAnalyzer extends Analyzer
Analyzer using Morfologik library.Analyzer.GlobalReuseStrategy, Analyzer.PerFieldReuseStrategy, Analyzer.ReuseStrategy, Analyzer.TokenStreamComponentsGLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY| Constructor and Description |
|---|
MorfologikAnalyzer()
Builds an analyzer with the default Morfologik's Polish dictionary.
|
MorfologikAnalyzer(String dictionaryResource)
Builds an analyzer with an explicit dictionary resource.
|
MorfologikAnalyzer(Version version)
Deprecated.
|
MorfologikAnalyzer(Version version,
String dictionaryResource)
Deprecated.
|
| Modifier and Type | Method and Description |
|---|---|
protected Analyzer.TokenStreamComponents |
createComponents(String field,
Reader reader)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader. |
close, getOffsetGap, getPositionIncrementGap, getReuseStrategy, getVersion, initReader, setVersion, tokenStream, tokenStreampublic MorfologikAnalyzer(String dictionaryResource)
dictionaryResource - A constant specifying which dictionary to choose. The
dictionary resource must be named morfologik/dictionaries/{dictionaryResource}.dict
and have an associated .info metadata file. See the Morfologik project
for details.@Deprecated public MorfologikAnalyzer(Version version, String dictionaryResource)
MorfologikAnalyzer(String)public MorfologikAnalyzer()
@Deprecated public MorfologikAnalyzer(Version version)
MorfologikAnalyzer()protected Analyzer.TokenStreamComponents createComponents(String field, Reader reader)
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader.createComponents in class Analyzerfield - ignored field namereader - source of tokensAnalyzer.TokenStreamComponents
built from an StandardTokenizer filtered with
StandardFilter and MorfologikFilter.Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.