Package org.apache.lucene.analysis.ko
Class KoreanAnalyzer
java.lang.Object
org.apache.lucene.analysis.Analyzer
org.apache.lucene.analysis.ko.KoreanAnalyzer
- All Implemented Interfaces:
Closeable
,AutoCloseable
Analyzer for Korean that uses morphological analysis.
- Since:
- 7.4.0
- See Also:
- WARNING: This API is experimental and might change in incompatible ways in the next release.
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.lucene.analysis.Analyzer
Analyzer.ReuseStrategy, Analyzer.TokenStreamComponents
-
Field Summary
Fields inherited from class org.apache.lucene.analysis.Analyzer
GLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY
-
Constructor Summary
ConstructorDescriptionCreates a new KoreanAnalyzer.KoreanAnalyzer
(UserDictionary userDict, KoreanTokenizer.DecompoundMode mode, Set<POS.Tag> stopTags, boolean outputUnknownUnigrams) Creates a new KoreanAnalyzer. -
Method Summary
Modifier and TypeMethodDescriptionprotected Analyzer.TokenStreamComponents
createComponents
(String fieldName) protected TokenStream
normalize
(String fieldName, TokenStream in) Methods inherited from class org.apache.lucene.analysis.Analyzer
attributeFactory, close, getOffsetGap, getPositionIncrementGap, getReuseStrategy, initReader, initReaderForNormalization, normalize, tokenStream, tokenStream
-
Constructor Details
-
KoreanAnalyzer
public KoreanAnalyzer()Creates a new KoreanAnalyzer. -
KoreanAnalyzer
public KoreanAnalyzer(UserDictionary userDict, KoreanTokenizer.DecompoundMode mode, Set<POS.Tag> stopTags, boolean outputUnknownUnigrams) Creates a new KoreanAnalyzer.- Parameters:
userDict
- Optional: if non-null, user dictionary.mode
- Decompound mode.stopTags
- The set of part of speech that should be filtered.outputUnknownUnigrams
- If true outputs unigrams for unknown words.
-
-
Method Details
-
createComponents
- Specified by:
createComponents
in classAnalyzer
-
normalize
-