public class MorfologikAnalyzer extends Analyzer
Analyzer
using Morfologik library.Analyzer.GlobalReuseStrategy, Analyzer.PerFieldReuseStrategy, Analyzer.ReuseStrategy, Analyzer.TokenStreamComponents
GLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY
Constructor and Description |
---|
MorfologikAnalyzer()
Builds an analyzer with the default Morfologik's Polish dictionary.
|
MorfologikAnalyzer(String dictionaryResource)
Builds an analyzer with an explicit dictionary resource.
|
MorfologikAnalyzer(Version version)
Deprecated.
|
MorfologikAnalyzer(Version version,
String dictionaryResource)
Deprecated.
|
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
createComponents(String field,
Reader reader)
Creates a
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader . |
close, getOffsetGap, getPositionIncrementGap, getReuseStrategy, getVersion, initReader, setVersion, tokenStream, tokenStream
public MorfologikAnalyzer(String dictionaryResource)
dictionaryResource
- A constant specifying which dictionary to choose. The
dictionary resource must be named morfologik/dictionaries/{dictionaryResource}.dict
and have an associated .info
metadata file. See the Morfologik project
for details.@Deprecated public MorfologikAnalyzer(Version version, String dictionaryResource)
MorfologikAnalyzer(String)
public MorfologikAnalyzer()
@Deprecated public MorfologikAnalyzer(Version version)
MorfologikAnalyzer()
protected Analyzer.TokenStreamComponents createComponents(String field, Reader reader)
Analyzer.TokenStreamComponents
which tokenizes all the text in the provided Reader
.createComponents
in class Analyzer
field
- ignored field namereader
- source of tokensAnalyzer.TokenStreamComponents
built from an StandardTokenizer
filtered with
StandardFilter
and MorfologikFilter
.Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.