public class MorfologikAnalyzer extends Analyzer
Analyzer
using Morfologik library.Analyzer.GlobalReuseStrategy, Analyzer.PerFieldReuseStrategy, Analyzer.ReuseStrategy, Analyzer.TokenStreamComponents
GLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY
Constructor and Description |
---|
MorfologikAnalyzer(Version version)
Builds an analyzer with the default Morfologik's Polish dictionary.
|
MorfologikAnalyzer(Version version,
String dictionaryResource)
Builds an analyzer with an explicit dictionary resource.
|
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
createComponents(String field,
Reader reader)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
close, getOffsetGap, getPositionIncrementGap, getReuseStrategy, initReader, tokenStream, tokenStream
public MorfologikAnalyzer(Version version, String dictionaryResource)
version
- Lucene compatibility versiondictionaryResource
- A constant specifying which dictionary to choose. The
dictionary resource must be named morfologik/dictionaries/{dictionaryResource}.dict
and have an associated .info
metadata file. See the Morfologik project
for details.public MorfologikAnalyzer(Version version)
protected Analyzer.TokenStreamComponents createComponents(String field, Reader reader)
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader
.createComponents
in class Analyzer
field
- ignored field namereader
- source of tokensAnalyzer.TokenStreamComponents
built from an StandardTokenizer
filtered with
StandardFilter
and MorfologikFilter
.Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.