All Classes and Interfaces (Lucene 10.0.0 kuromoji API)

Class

Description

BaseFormAttribute

Attribute for Token.getBaseForm().

BaseFormAttributeImpl

Attribute for Token.getBaseForm().

CharacterDefinition

Character category data.

CharSequenceUtils

Utility functions for JapaneseCompletionFilter

ConnectionCosts

n-gram connection cost data

DictionaryBuilder

Tool to build dictionaries.

DictionaryBuilder.DictionaryFormat

Format of the dictionary.

InflectionAttribute

Attribute for Kuromoji inflection data.

InflectionAttributeImpl

Attribute for Kuromoji inflection data.

JaMorphData

Represents Japanese morphological information.

JapaneseAnalyzer

Analyzer for Japanese that uses morphological analysis.

JapaneseBaseFormFilter

Replaces term text with the BaseFormAttribute.

JapaneseBaseFormFilterFactory

Factory for JapaneseBaseFormFilter.

JapaneseCompletionAnalyzer

Analyzer for Japanese completion suggester.

JapaneseCompletionFilter

A TokenFilter that adds Japanese romanized tokens to the term attribute.

JapaneseCompletionFilter.Mode

Completion mode

JapaneseCompletionFilterFactory

Factory for JapaneseCompletionFilter.

JapaneseHiraganaUppercaseFilter

A TokenFilter that normalizes small letters (捨て仮名) in hiragana into normal letters.

JapaneseHiraganaUppercaseFilterFactory

Factory for JapaneseHiraganaUppercaseFilter.

JapaneseIterationMarkCharFilter

Normalizes Japanese horizontal iteration marks (odoriji) to their expanded form.

JapaneseIterationMarkCharFilterFactory

Factory for JapaneseIterationMarkCharFilter.

JapaneseKatakanaStemFilter

A TokenFilter that normalizes common katakana spelling variations ending in a long sound character by removing this character (U+30FC).

JapaneseKatakanaStemFilterFactory

Factory for JapaneseKatakanaStemFilter.

JapaneseKatakanaUppercaseFilter

A TokenFilter that normalizes small letters (捨て仮名) in katakana into normal letters.

JapaneseKatakanaUppercaseFilterFactory

Factory for JapaneseKatakanaUppercaseFilter.

JapaneseNumberFilter

A TokenFilter that normalizes Japanese numbers (kansūji) to regular Arabic decimal numbers in half-width characters.

JapaneseNumberFilter.NumberBuffer

Buffer that holds a Japanese number string and a position index used as a parsed-to marker

JapaneseNumberFilterFactory

Factory for JapaneseNumberFilter.

JapanesePartOfSpeechStopFilter

Removes tokens that match a set of part-of-speech tags.

JapanesePartOfSpeechStopFilterFactory

Factory for JapanesePartOfSpeechStopFilter.

JapaneseReadingFormFilter

A TokenFilter that replaces the term attribute with the reading of a token in either katakana or romaji form.

JapaneseReadingFormFilterFactory

Factory for JapaneseReadingFormFilter.

JapaneseTokenizer

Tokenizer for Japanese that uses morphological analysis.

JapaneseTokenizer.Mode

Tokenization mode: this determines how the tokenizer handles compound and unknown words.

JapaneseTokenizerFactory

Factory for JapaneseTokenizer.

KatakanaRomanizer

Converts a Katakana string to Romaji using the pre-defined Katakana-Romaji mapping rules.

PartOfSpeechAttribute

Attribute for Token.getPartOfSpeech().

PartOfSpeechAttributeImpl

Attribute for Token.getPartOfSpeech().

ReadingAttribute

Attribute for Kuromoji reading data

ReadingAttributeImpl

Attribute for Kuromoji reading data

Token

Analyzed token with morphological data from its dictionary.

TokenInfoDictionary

Binary dictionary implementation for a known-word dictionary model: Words are encoded into an FST mapping to a list of wordIDs.

TokenInfoFST

Thin wrapper around an FST with root-arc caching for Japanese.

ToStringUtil

Utility class for english translations of morphological data, used only for debugging.

UnknownDictionary

Dictionary for unknown-word handling.

UserDictionary

Class for building a User Dictionary.