public class MorfologikAnalyzer extends Analyzer
Analyzer
using Morfologik library.Analyzer.ReuseStrategy, Analyzer.TokenStreamComponents
GLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY
Constructor and Description |
---|
MorfologikAnalyzer()
Builds an analyzer with the default Morfologik's Polish dictionary.
|
MorfologikAnalyzer(String dictionaryResource)
Builds an analyzer with an explicit dictionary resource.
|
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
createComponents(String field)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
close, getOffsetGap, getPositionIncrementGap, getReuseStrategy, getVersion, initReader, setVersion, tokenStream, tokenStream
public MorfologikAnalyzer(String dictionaryResource)
dictionaryResource
- A constant specifying which dictionary to choose. The
dictionary resource must be named morfologik/dictionaries/{dictionaryResource}.dict
and have an associated .info
metadata file. See the Morfologik project
for details.public MorfologikAnalyzer()
protected Analyzer.TokenStreamComponents createComponents(String field)
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader
.createComponents
in class Analyzer
field
- ignored field nameAnalyzer.TokenStreamComponents
built from an StandardTokenizer
filtered with
StandardFilter
and MorfologikFilter
.Copyright © 2000-2015 Apache Software Foundation. All Rights Reserved.