Analyzer for Korean.
Class Summary Class Description DecompoundTokenA token that was generated from a compound. DictionaryTokenA token stored in a
GraphvizFormatterOutputs the dot (graphviz) string for the viterbi lattice. KoreanAnalyzerAnalyzer for Korean that uses morphological analysis. KoreanPartOfSpeechStopFilterRemoves tokens that match a set of part-of-speech tags. KoreanPartOfSpeechStopFilterFactoryFactory for
KoreanReadingFormFilterReplaces term text with the
ReadingAttributewhich is the Hangul transcription of Hanja characters.
KoreanTokenizerTokenizer for Korean that uses morphological analysis. KoreanTokenizerFactoryFactory for
POSPart of speech classification for Korean based on Sejong corpus classification. TokenAnalyzed token with morphological data.
Enum Summary Enum Description KoreanTokenizer.DecompoundMode KoreanTokenizer.TypeToken type reflecting the original source of this token POS.TagPart of speech tag for Korean based on Sejong corpus classification. POS.TypeThe type of the token.