KoreanAnalyzer (Lucene 8.8.2 API)

Skip navigation links

All Classes

Summary:
Nested |
Field |
Constr |
Method

Detail:
Field |
Constr |
Method

java.lang.Object
- org.apache.lucene.analysis.Analyzer
- - org.apache.lucene.analysis.ko.KoreanAnalyzer

All Implemented Interfaces:

Closeable, AutoCloseable
```
public class KoreanAnalyzer
extends Analyzer
```
Analyzer for Korean that uses morphological analysis.

Since:

7.4.0

See Also:

KoreanTokenizer

WARNING: This API is experimental and might change in incompatible ways in the next release.

Nested Class Summary
- Nested classes/interfaces inherited from class org.apache.lucene.analysis.Analyzer
  Analyzer.ReuseStrategy, Analyzer.TokenStreamComponents

Field Summary
- Fields inherited from class org.apache.lucene.analysis.Analyzer
  GLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY

Constructor Summary

Constructors
Constructor and Description
`KoreanAnalyzer()` Creates a new KoreanAnalyzer.
`KoreanAnalyzer(UserDictionary userDict, KoreanTokenizer.DecompoundMode mode, Set<POS.Tag> stopTags, boolean outputUnknownUnigrams)` Creates a new KoreanAnalyzer.

Method Summary

All Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`protected Analyzer.TokenStreamComponents`	`createComponents(String fieldName)`
`protected TokenStream`	`normalize(String fieldName, TokenStream in)`

Methods inherited from class org.apache.lucene.analysis.Analyzer
attributeFactory, close, getOffsetGap, getPositionIncrementGap, getReuseStrategy, getVersion, initReader, initReaderForNormalization, normalize, setVersion, tokenStream, tokenStream

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Constructor Detail
- KoreanAnalyzer
```
public KoreanAnalyzer()
```
  Creates a new KoreanAnalyzer.
- KoreanAnalyzer
```
public KoreanAnalyzer(UserDictionary userDict,
                      KoreanTokenizer.DecompoundMode mode,
                      Set<POS.Tag> stopTags,
                      boolean outputUnknownUnigrams)
```
  Creates a new KoreanAnalyzer.
  
  Parameters:
  
  userDict - Optional: if non-null, user dictionary.
  
  mode - Decompound mode.
  
  stopTags - The set of part of speech that should be filtered.
  
  outputUnknownUnigrams - If true outputs unigrams for unknown words.

Method Detail

createComponents

protected Analyzer.TokenStreamComponents createComponents(String fieldName)

Specified by:: createComponents in class Analyzer

normalize

protected TokenStream normalize(String fieldName,
                                TokenStream in)

Overrides:: normalize in class Analyzer

Skip navigation links

All Classes

Summary:
Nested |
Field |
Constr |
Method

Detail:
Field |
Constr |
Method

Copyright © 2000-2021 Apache Software Foundation. All Rights Reserved.