Package org.apache.lucene.analysis.ko
package org.apache.lucene.analysis.ko
Analyzer for Korean.
-
ClassDescriptionA token that was generated from a compound.A token stored in a
Dictionary
.Outputs the dot (graphviz) string for the viterbi lattice.Analyzer for Korean that uses morphological analysis.ATokenFilter
that normalizes Korean numbers to regular Arabic decimal numbers in half-width characters.Buffer that holds a Korean number string and a position index used as a parsed-to markerFactory forKoreanNumberFilter
.Removes tokens that match a set of part-of-speech tags.Factory forKoreanPartOfSpeechStopFilter
.Replaces term text with theReadingAttribute
which is the Hangul transcription of Hanja characters.Factory forKoreanReadingFormFilter
.Tokenizer for Korean that uses morphological analysis.Decompound mode: this determines how the tokenizer handlesPOS.Type.COMPOUND
,POS.Type.INFLECT
andPOS.Type.PREANALYSIS
tokens.Token type reflecting the original source of this tokenFactory forKoreanTokenizer
.Part of speech classification for Korean based on Sejong corpus classification.Part of speech tag for Korean based on Sejong corpus classification.The type of the token.Analyzed token with morphological data.