public class MorfologikAnalyzer extends Analyzer
Analyzer using Morfologik library.Analyzer.ReuseStrategy, Analyzer.TokenStreamComponentsGLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY| Constructor and Description |
|---|
MorfologikAnalyzer()
Builds an analyzer with the default Morfologik's Polish dictionary.
|
MorfologikAnalyzer(String dictionaryResource)
Builds an analyzer with an explicit dictionary resource.
|
| Modifier and Type | Method and Description |
|---|---|
protected Analyzer.TokenStreamComponents |
createComponents(String field)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader. |
close, getOffsetGap, getPositionIncrementGap, getReuseStrategy, getVersion, initReader, setVersion, tokenStream, tokenStreampublic MorfologikAnalyzer(String dictionaryResource)
dictionaryResource - A constant specifying which dictionary to choose. The
dictionary resource must be named morfologik/dictionaries/{dictionaryResource}.dict
and have an associated .info metadata file. See the Morfologik project
for details.public MorfologikAnalyzer()
protected Analyzer.TokenStreamComponents createComponents(String field)
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader.createComponents in class Analyzerfield - ignored field nameAnalyzer.TokenStreamComponents
built from an StandardTokenizer filtered with
StandardFilter and MorfologikFilter.Copyright © 2000-2015 Apache Software Foundation. All Rights Reserved.