Outputs the dot (graphviz) string for the viterbi lattice.
Analyzer for Japanese that uses morphological analysis.
Replaces term text with the
TokenFilter that normalizes common katakana spelling variations
ending in a long sound character by removing this character (U+30FC).
Removes tokens that match a set of part-of-speech tags.
TokenFilter that replaces the term
attribute with the reading of a token in either katakana or romaji form.
Tokenizer for Japanese that uses morphological analysis.
Analyzed token with morphological data from its dictionary.
Tokenization mode: this determines how the tokenizer handles compound and unknown words.