All Classes
-
All Classes Interface Summary Class Summary Enum Summary Class Description BinaryDictionary Base class for a binary-encoded in-memory dictionary.BinaryDictionary.ResourceScheme Deprecated, for removal: This API element is subject to removal in a future version. CharacterDefinition Character category data.ConnectionCosts n-gram connection cost dataDecompoundToken A token that was generated from a compound.Dictionary Dictionary interface for retrieving morphological data by id.Dictionary.Morpheme A morpheme extracted from a compound token.DictionaryBuilder Tool to build dictionaries.DictionaryToken A token stored in aDictionary
.GraphvizFormatter Outputs the dot (graphviz) string for the viterbi lattice.KoreanAnalyzer Analyzer for Korean that uses morphological analysis.KoreanNumberFilter ATokenFilter
that normalizes Korean numbers to regular Arabic decimal numbers in half-width characters.KoreanNumberFilter.NumberBuffer Buffer that holds a Korean number string and a position index used as a parsed-to markerKoreanNumberFilterFactory Factory forKoreanNumberFilter
.KoreanPartOfSpeechStopFilter Removes tokens that match a set of part-of-speech tags.KoreanPartOfSpeechStopFilterFactory Factory forKoreanPartOfSpeechStopFilter
.KoreanReadingFormFilter Replaces term text with theReadingAttribute
which is the Hangul transcription of Hanja characters.KoreanReadingFormFilterFactory Factory forKoreanReadingFormFilter
.KoreanTokenizer Tokenizer for Korean that uses morphological analysis.KoreanTokenizer.DecompoundMode Decompound mode: this determines how the tokenizer handlesPOS.Type.COMPOUND
,POS.Type.INFLECT
andPOS.Type.PREANALYSIS
tokens.KoreanTokenizer.Type Token type reflecting the original source of this tokenKoreanTokenizerFactory Factory forKoreanTokenizer
.PartOfSpeechAttribute Part of Speech attributes for Korean.PartOfSpeechAttributeImpl Part of Speech attributes for Korean.POS Part of speech classification for Korean based on Sejong corpus classification.POS.Tag Part of speech tag for Korean based on Sejong corpus classification.POS.Type The type of the token.ReadingAttribute Attribute for Korean reading dataReadingAttributeImpl Attribute for Korean reading dataToken Analyzed token with morphological data.TokenInfoDictionary Binary dictionary implementation for a known-word dictionary model: Words are encoded into an FST mapping to a list of wordIDs.TokenInfoFST Thin wrapper around an FST with root-arc caching for Hangul syllables (11,172 arcs).UnknownDictionary Dictionary for unknown-word handling.UserDictionary Class for building a User Dictionary.