Index (Lucene 9.10.0 kuromoji API)

A B C D E F G H I J K L M N O P Q R S T U V W
All Classes All Packages

A

accept() - Method in class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilter
advance() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer
ALPHA - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition

B

BaseFormAttribute - Interface in org.apache.lucene.analysis.ja.tokenattributes: Attribute for Token.getBaseForm().
BaseFormAttributeImpl - Class in org.apache.lucene.analysis.ja.tokenattributes: Attribute for Token.getBaseForm().
BaseFormAttributeImpl() - Constructor for class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
BinaryDictionary - Class in org.apache.lucene.analysis.ja.dict: Base class for a binary-encoded in-memory dictionary.
BinaryDictionary(IOSupplier<InputStream>, IOSupplier<InputStream>, IOSupplier<InputStream>) - Constructor for class org.apache.lucene.analysis.ja.dict.BinaryDictionary
BinaryDictionary.ResourceScheme - Enum in org.apache.lucene.analysis.ja.dict: Deprecated, for removal: This API element is subject to removal in a future version.
build(DictionaryBuilder.DictionaryFormat, Path, Path, String, boolean) - Static method in class org.apache.lucene.analysis.ja.util.DictionaryBuilder

C

calcNBestCost(String) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
CharacterDefinition - Class in org.apache.lucene.analysis.ja.dict: Character category data.
charAt(int) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer
CharSequenceUtils - Class in org.apache.lucene.analysis.ja.completion: Utility functions for JapaneseCompletionFilter
CLASS_COUNT - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
CLASSPATH - org.apache.lucene.analysis.ja.dict.BinaryDictionary.ResourceScheme: Deprecated.
clear() - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
clear() - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
clear() - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
clear() - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
close() - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
ConnectionCosts - Class in org.apache.lucene.analysis.ja.dict: n-gram connection cost data
ConnectionCosts(URL) - Constructor for class org.apache.lucene.analysis.ja.dict.ConnectionCosts: Create a ConnectionCosts from an external resource URL (e.g.
ConnectionCosts(Path) - Constructor for class org.apache.lucene.analysis.ja.dict.ConnectionCosts: Create a ConnectionCosts from an external resource path.
ConnectionCosts(BinaryDictionary.ResourceScheme, String) - Constructor for class org.apache.lucene.analysis.ja.dict.ConnectionCosts: Deprecated, for removal: This API element is subject to removal in a future version.
replaced by ConnectionCosts(Path) for files and ConnectionCosts(URL) for classpath/module resources.
copyTo(AttributeImpl) - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
copyTo(AttributeImpl) - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
copyTo(AttributeImpl) - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
copyTo(AttributeImpl) - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
correct(int) - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
create(Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory
create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseBaseFormFilterFactory
create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionFilterFactory
create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilterFactory
create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilterFactory
create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory
create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseReadingFormFilterFactory
create(AttributeFactory) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory
createComponents(String) - Method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
createComponents(String) - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer
CYRILLIC - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition

D

DEFAULT - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
DEFAULT_MINIMUM_LENGTH - Static variable in class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilter
DEFAULT_MODE - Static variable in class org.apache.lucene.analysis.ja.JapaneseCompletionFilter
DEFAULT_MODE - Static variable in class org.apache.lucene.analysis.ja.JapaneseTokenizer: Default tokenization mode.
DICT_FILENAME_SUFFIX - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
DICT_HEADER - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
Dictionary - Interface in org.apache.lucene.analysis.ja.dict: Dictionary interface for retrieving morphological data by id.
DictionaryBuilder - Class in org.apache.lucene.analysis.ja.util: Tool to build dictionaries.
DictionaryBuilder.DictionaryFormat - Enum in org.apache.lucene.analysis.ja.util: Format of the dictionary.

E

end() - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
EXTENDED - org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode: Extended mode outputs unigrams for unknown words.

F

FILE - org.apache.lucene.analysis.ja.dict.BinaryDictionary.ResourceScheme: Deprecated.
FILENAME_SUFFIX - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
FILENAME_SUFFIX - Static variable in class org.apache.lucene.analysis.ja.dict.ConnectionCosts
findTargetArc(int, FST.Arc<Long>, FST.Arc<Long>, boolean, FST.BytesReader) - Method in class org.apache.lucene.analysis.ja.dict.TokenInfoFST
finish() - Method in class org.apache.lucene.analysis.ja.GraphvizFormatter
FST_FILENAME_SUFFIX - Static variable in class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary

G

get(int, int) - Method in class org.apache.lucene.analysis.ja.dict.ConnectionCosts
getBaseForm() - Method in class org.apache.lucene.analysis.ja.Token
getBaseForm() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttribute
getBaseForm() - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
getBaseForm(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
getBaseForm(int, char[], int, int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary: Get base form of word
getBaseForm(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
getBytesReader() - Method in class org.apache.lucene.analysis.ja.dict.TokenInfoFST
getCharacterClass(char) - Method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
getCharacterDefinition() - Method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
getDefaultStopSet() - Static method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
getDefaultStopTags() - Static method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
getFirstArc(FST.Arc<Long>) - Method in class org.apache.lucene.analysis.ja.dict.TokenInfoFST
getFST() - Method in class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
getFST() - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
getInflectedFormTranslation(String) - Static method in class org.apache.lucene.analysis.ja.util.ToStringUtil: Get the english form of inflected form
getInflectionForm() - Method in class org.apache.lucene.analysis.ja.Token
getInflectionForm() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.InflectionAttribute
getInflectionForm() - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
getInflectionForm(int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
getInflectionForm(int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary: Get inflection form of tokens
getInflectionForm(int) - Method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
getInflectionForm(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
getInflectionType() - Method in class org.apache.lucene.analysis.ja.Token
getInflectionType() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.InflectionAttribute
getInflectionType() - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
getInflectionType(int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
getInflectionType(int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary: Get inflection type of tokens
getInflectionType(int) - Method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
getInflectionType(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
getInflectionTypeTranslation(String) - Static method in class org.apache.lucene.analysis.ja.util.ToStringUtil: Get the english form of inflection type
getInstance() - Static method in class org.apache.lucene.analysis.ja.completion.KatakanaRomanizer: Returns the singleton instance of KatakanaRomenizer
getInstance() - Static method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
getInstance() - Static method in class org.apache.lucene.analysis.ja.dict.ConnectionCosts
getInstance() - Static method in class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
getInstance() - Static method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
getLeftId(int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
getLeftId(int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary: Get left id of specified word
getLeftId(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
getLength() - Method in class org.apache.lucene.analysis.ja.Token
getOffset() - Method in class org.apache.lucene.analysis.ja.Token
getPartOfSpeech() - Method in class org.apache.lucene.analysis.ja.Token
getPartOfSpeech() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttribute
getPartOfSpeech() - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
getPartOfSpeech(int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
getPartOfSpeech(int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary: Get Part-Of-Speech of tokens
getPartOfSpeech(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
getPosition() - Method in class org.apache.lucene.analysis.ja.Token: Get index of this token in input text
getPositionLength() - Method in class org.apache.lucene.analysis.ja.Token: Get the length (in tokens) of this token.
getPOSTranslation(String) - Static method in class org.apache.lucene.analysis.ja.util.ToStringUtil: Get the english form of a POS tag
getPronunciation() - Method in class org.apache.lucene.analysis.ja.Token
getPronunciation() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.ReadingAttribute
getPronunciation() - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
getPronunciation(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
getPronunciation(int, char[], int, int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary: Get pronunciation of tokens
getPronunciation(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
getReading() - Method in class org.apache.lucene.analysis.ja.Token
getReading() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.ReadingAttribute
getReading() - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
getReading(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
getReading(int, char[], int, int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary: Get reading of tokens
getReading(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
getReading(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
getResource(BinaryDictionary.ResourceScheme, String) - Static method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary: Deprecated, for removal: This API element is subject to removal in a future version.
getRightId(int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
getRightId(int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary: Get right id of specified word
getRightId(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
getRomanization(Appendable, CharSequence) - Static method in class org.apache.lucene.analysis.ja.util.ToStringUtil: Romanize katakana with modified hepburn
getRomanization(String) - Static method in class org.apache.lucene.analysis.ja.util.ToStringUtil: Romanize katakana with modified hepburn
getSurfaceForm() - Method in class org.apache.lucene.analysis.ja.Token
getSurfaceFormString() - Method in class org.apache.lucene.analysis.ja.Token
getType() - Method in class org.apache.lucene.analysis.ja.Token: Returns the type of this token
getWordCost(int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
getWordCost(int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary: Get word cost of specified word
getWordCost(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
GraphvizFormatter - Class in org.apache.lucene.analysis.ja: Outputs the dot (graphviz) string for the viterbi lattice.
GraphvizFormatter(ConnectionCosts) - Constructor for class org.apache.lucene.analysis.ja.GraphvizFormatter
GREEK - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition

H

HAS_BASEFORM - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary: flag that the entry has baseform data.
HAS_PRONUNCIATION - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary: flag that the entry has pronunciation data.
HAS_READING - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary: flag that the entry has reading data.
HEADER - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
HEADER - Static variable in class org.apache.lucene.analysis.ja.dict.ConnectionCosts
HIRAGANA - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition

I

incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseBaseFormFilter
incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionFilter
incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilter
incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseReadingFormFilter
incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
INDEX - org.apache.lucene.analysis.ja.JapaneseCompletionFilter.Mode: Simple romanization.
InflectionAttribute - Interface in org.apache.lucene.analysis.ja.tokenattributes: Attribute for Kuromoji inflection data.
InflectionAttributeImpl - Class in org.apache.lucene.analysis.ja.tokenattributes: Attribute for Kuromoji inflection data.
InflectionAttributeImpl() - Constructor for class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
inform(ResourceLoader) - Method in class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory
inform(ResourceLoader) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory
initReader(String, Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
initReader(String, Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer
initReaderForNormalization(String, Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
initReaderForNormalization(String, Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer
INTERNAL_SEPARATOR - Static variable in interface org.apache.lucene.analysis.ja.dict.Dictionary
IPADIC - org.apache.lucene.analysis.ja.util.DictionaryBuilder.DictionaryFormat: IPADIC format
isArabicNumeral(char) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter: Arabic numeral predicate.
isFullWidthLowercaseAlphabet(char) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils: Checks if a char is a full-width lowercase alphabet
isGroup(char) - Method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
isInvoke(char) - Method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
isKana(CharSequence) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils: Checks if a char sequence is composed only of Katakana or hiragana
isKanji(char) - Method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
isKatakanaOrHWAlphabets(CharSequence) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils: Checks if a char sequence is composed only of Katakana or lowercase alphabets
isKnown() - Method in class org.apache.lucene.analysis.ja.Token: Returns true if this token is known word
isLowercaseAlphabets(CharSequence) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils: Checks if a char sequence is composed only of lowercase alphabets
isNumeral(char) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter: Numeral predicate
isNumeral(String) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter: Numeral predicate
isNumeralPunctuation(char) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter: Numeral punctuation predicate
isNumeralPunctuation(String) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter: Numeral punctuation predicate
isUnknown() - Method in class org.apache.lucene.analysis.ja.Token: Returns true if this token is unknown word
isUser() - Method in class org.apache.lucene.analysis.ja.Token: Returns true if this token is defined in user dictionary

J

JapaneseAnalyzer - Class in org.apache.lucene.analysis.ja: Analyzer for Japanese that uses morphological analysis.
JapaneseAnalyzer() - Constructor for class org.apache.lucene.analysis.ja.JapaneseAnalyzer
JapaneseAnalyzer(UserDictionary, JapaneseTokenizer.Mode, CharArraySet, Set<String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseAnalyzer
JapaneseBaseFormFilter - Class in org.apache.lucene.analysis.ja: Replaces term text with the BaseFormAttribute.
JapaneseBaseFormFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseBaseFormFilter
JapaneseBaseFormFilterFactory - Class in org.apache.lucene.analysis.ja: Factory for JapaneseBaseFormFilter.
JapaneseBaseFormFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseBaseFormFilterFactory: Default ctor for compatibility with SPI
JapaneseBaseFormFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseBaseFormFilterFactory: Creates a new JapaneseBaseFormFilterFactory
JapaneseCompletionAnalyzer - Class in org.apache.lucene.analysis.ja: Analyzer for Japanese completion suggester.
JapaneseCompletionAnalyzer() - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer: Creates a new JapaneseCompletionAnalyzer with default configurations
JapaneseCompletionAnalyzer(UserDictionary, JapaneseCompletionFilter.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer: Creates a new JapaneseCompletionAnalyzer
JapaneseCompletionFilter - Class in org.apache.lucene.analysis.ja: A TokenFilter that adds Japanese romanized tokens to the term attribute.
JapaneseCompletionFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionFilter: Creates a new JapaneseCompletionFilter with default configurations
JapaneseCompletionFilter(TokenStream, JapaneseCompletionFilter.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionFilter: Creates a new JapaneseCompletionFilter
JapaneseCompletionFilter.Mode - Enum in org.apache.lucene.analysis.ja: Completion mode
JapaneseCompletionFilterFactory - Class in org.apache.lucene.analysis.ja: Factory for JapaneseCompletionFilter.
JapaneseCompletionFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionFilterFactory: Default ctor for compatibility with SPI
JapaneseCompletionFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionFilterFactory: Creates a new JapaneseCompletionFilterFactory
JapaneseIterationMarkCharFilter - Class in org.apache.lucene.analysis.ja: Normalizes Japanese horizontal iteration marks (odoriji) to their expanded form.
JapaneseIterationMarkCharFilter(Reader) - Constructor for class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter: Constructor.
JapaneseIterationMarkCharFilter(Reader, boolean, boolean) - Constructor for class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter: Constructor
JapaneseIterationMarkCharFilterFactory - Class in org.apache.lucene.analysis.ja: Factory for JapaneseIterationMarkCharFilter.
JapaneseIterationMarkCharFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory: Default ctor for compatibility with SPI
JapaneseIterationMarkCharFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory: Creates a new JapaneseIterationMarkCharFilterFactory
JapaneseKatakanaStemFilter - Class in org.apache.lucene.analysis.ja: A TokenFilter that normalizes common katakana spelling variations ending in a long sound character by removing this character (U+30FC).
JapaneseKatakanaStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilter
JapaneseKatakanaStemFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilter
JapaneseKatakanaStemFilterFactory - Class in org.apache.lucene.analysis.ja: Factory for JapaneseKatakanaStemFilter.
JapaneseKatakanaStemFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilterFactory: Default ctor for compatibility with SPI
JapaneseKatakanaStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilterFactory: Creates a new JapaneseKatakanaStemFilterFactory
JapaneseNumberFilter - Class in org.apache.lucene.analysis.ja: A TokenFilter that normalizes Japanese numbers (kansūji) to regular Arabic decimal numbers in half-width characters.
JapaneseNumberFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseNumberFilter
JapaneseNumberFilter.NumberBuffer - Class in org.apache.lucene.analysis.ja: Buffer that holds a Japanese number string and a position index used as a parsed-to marker
JapaneseNumberFilterFactory - Class in org.apache.lucene.analysis.ja: Factory for JapaneseNumberFilter.
JapaneseNumberFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseNumberFilterFactory: Default ctor for compatibility with SPI
JapaneseNumberFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseNumberFilterFactory
JapanesePartOfSpeechStopFilter - Class in org.apache.lucene.analysis.ja: Removes tokens that match a set of part-of-speech tags.
JapanesePartOfSpeechStopFilter(TokenStream, Set<String>) - Constructor for class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilter: Create a new JapanesePartOfSpeechStopFilter.
JapanesePartOfSpeechStopFilterFactory - Class in org.apache.lucene.analysis.ja: Factory for JapanesePartOfSpeechStopFilter.
JapanesePartOfSpeechStopFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory: Default ctor for compatibility with SPI
JapanesePartOfSpeechStopFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory: Creates a new JapanesePartOfSpeechStopFilterFactory
JapaneseReadingFormFilter - Class in org.apache.lucene.analysis.ja: A TokenFilter that replaces the term attribute with the reading of a token in either katakana or romaji form.
JapaneseReadingFormFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseReadingFormFilter
JapaneseReadingFormFilter(TokenStream, boolean) - Constructor for class org.apache.lucene.analysis.ja.JapaneseReadingFormFilter
JapaneseReadingFormFilterFactory - Class in org.apache.lucene.analysis.ja: Factory for JapaneseReadingFormFilter.
JapaneseReadingFormFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseReadingFormFilterFactory: Default ctor for compatibility with SPI
JapaneseReadingFormFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseReadingFormFilterFactory: Creates a new JapaneseReadingFormFilterFactory
JapaneseTokenizer - Class in org.apache.lucene.analysis.ja: Tokenizer for Japanese that uses morphological analysis.
JapaneseTokenizer(UserDictionary, boolean, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer: Create a new JapaneseTokenizer.
JapaneseTokenizer(UserDictionary, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer: Create a new JapaneseTokenizer.
JapaneseTokenizer(AttributeFactory, TokenInfoDictionary, UnknownDictionary, ConnectionCosts, UserDictionary, boolean, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer: Create a new JapaneseTokenizer, supplying a custom system dictionary and unknown dictionary.
JapaneseTokenizer(AttributeFactory, UserDictionary, boolean, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer: Create a new JapaneseTokenizer using the system and unknown dictionaries shipped with Lucene.
JapaneseTokenizer(AttributeFactory, UserDictionary, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer: Create a new JapaneseTokenizer using the system and unknown dictionaries shipped with Lucene.
JapaneseTokenizer.Mode - Enum in org.apache.lucene.analysis.ja: Tokenization mode: this determines how the tokenizer handles compound and unknown words.
JapaneseTokenizer.Type - Enum in org.apache.lucene.analysis.ja: Token type reflecting the original source of this token
JapaneseTokenizerFactory - Class in org.apache.lucene.analysis.ja: Factory for JapaneseTokenizer.
JapaneseTokenizerFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory: Default ctor for compatibility with SPI
JapaneseTokenizerFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory: Creates a new JapaneseTokenizerFactory

K

KANJI - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
KANJINUMERIC - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
KATAKANA - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
KatakanaRomanizer - Class in org.apache.lucene.analysis.ja.completion: Converts a Katakana string to Romaji using the pre-defined Katakana-Romaji mapping rules.
KNOWN - org.apache.lucene.analysis.ja.JapaneseTokenizer.Type: Known words from the system dictionary.

L

LEFT_ID - Static variable in class org.apache.lucene.analysis.ja.dict.UserDictionary
length() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer
lookup(char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
lookup(char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary: Lookup words in text
lookupCharacterClass(String) - Static method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
lookupSegmentation(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
lookupWordIds(int, IntsRef) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary

M

main(String[]) - Static method in class org.apache.lucene.analysis.ja.util.DictionaryBuilder

N

NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseBaseFormFilterFactory: SPI name
NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseCompletionFilterFactory: SPI name
NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory: SPI name
NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilterFactory: SPI name
NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseNumberFilterFactory: SPI name
NAME - Static variable in class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory
NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseReadingFormFilterFactory: SPI name
NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory: SPI name
NGRAM - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
NO_OUTPUT - Variable in class org.apache.lucene.analysis.ja.dict.TokenInfoFST
NORMAL - org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode: Ordinary segmentation: no decomposition for compounds,
normalize(Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory
normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
NORMALIZE_KANA_DEFAULT - Static variable in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter: Normalize kana iteration marks by default
NORMALIZE_KANJI_DEFAULT - Static variable in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter: Normalize kanji iteration marks by default
normalizeNumber(String) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter: Normalizes a Japanese number
NumberBuffer(String) - Constructor for class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer
NUMERIC - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition

O

open(Reader) - Static method in class org.apache.lucene.analysis.ja.dict.UserDictionary
org.apache.lucene.analysis.ja - package org.apache.lucene.analysis.ja: Analyzer for Japanese.
org.apache.lucene.analysis.ja.completion - package org.apache.lucene.analysis.ja.completion: Utilities for JapaneseCompletionFilter
org.apache.lucene.analysis.ja.dict - package org.apache.lucene.analysis.ja.dict: Kuromoji dictionary implementation.
org.apache.lucene.analysis.ja.tokenattributes - package org.apache.lucene.analysis.ja.tokenattributes: Additional Kuromoji-specific Attributes for text analysis.
org.apache.lucene.analysis.ja.util - package org.apache.lucene.analysis.ja.util: Kuromoji utility classes.

P

parseLargeKanjiNumeral(JapaneseNumberFilter.NumberBuffer) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter: Parse large kanji numerals (ten thousands or larger)
parseMediumKanjiNumeral(JapaneseNumberFilter.NumberBuffer) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter: Parse medium kanji numerals (tens, hundreds or thousands)
PartOfSpeechAttribute - Interface in org.apache.lucene.analysis.ja.tokenattributes: Attribute for Token.getPartOfSpeech().
PartOfSpeechAttributeImpl - Class in org.apache.lucene.analysis.ja.tokenattributes: Attribute for Token.getPartOfSpeech().
PartOfSpeechAttributeImpl() - Constructor for class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
POSDICT_FILENAME_SUFFIX - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
POSDICT_HEADER - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
position() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer

Q

QUERY - org.apache.lucene.analysis.ja.JapaneseCompletionFilter.Mode: Input Method aware romanization.

R

read() - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
read(char[], int, int) - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
ReadingAttribute - Interface in org.apache.lucene.analysis.ja.tokenattributes: Attribute for Kuromoji reading data
ReadingAttributeImpl - Class in org.apache.lucene.analysis.ja.tokenattributes: Attribute for Kuromoji reading data
ReadingAttributeImpl() - Constructor for class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
reflectWith(AttributeReflector) - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
reflectWith(AttributeReflector) - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
reflectWith(AttributeReflector) - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
reflectWith(AttributeReflector) - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
reset() - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionFilter
reset() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
reset() - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
RIGHT_ID - Static variable in class org.apache.lucene.analysis.ja.dict.UserDictionary
romanize(CharsRef) - Method in class org.apache.lucene.analysis.ja.completion.KatakanaRomanizer: Translates a sequence of katakana to romaji.

S

SEARCH - org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode: Segmentation geared towards search: this includes a decompounding process for long nouns, also including the full compound token as a synonym.
setGraphvizFormatter(GraphvizFormatter) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer: Expert: set this to produce graphviz (dot) output of the Viterbi lattice
setNBestCost(int) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
setPositionLength(int) - Method in class org.apache.lucene.analysis.ja.Token: Set the position length (in tokens) of this token.
setToken(Token) - Method in interface org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttribute
setToken(Token) - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
setToken(Token) - Method in interface org.apache.lucene.analysis.ja.tokenattributes.InflectionAttribute
setToken(Token) - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
setToken(Token) - Method in interface org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttribute
setToken(Token) - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
setToken(Token) - Method in interface org.apache.lucene.analysis.ja.tokenattributes.ReadingAttribute
setToken(Token) - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
SPACE - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
SYMBOL - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition

T

TARGETMAP_FILENAME_SUFFIX - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
TARGETMAP_HEADER - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
toKatakana(CharSequence) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils: Convert all hiragana in a string into kanataka
Token - Class in org.apache.lucene.analysis.ja: Analyzed token with morphological data from its dictionary.
Token(int, char[], int, int, JapaneseTokenizer.Type, int, Dictionary) - Constructor for class org.apache.lucene.analysis.ja.Token
TokenInfoDictionary - Class in org.apache.lucene.analysis.ja.dict: Binary dictionary implementation for a known-word dictionary model: Words are encoded into an FST mapping to a list of wordIDs.
TokenInfoDictionary(URL, URL, URL, URL) - Constructor for class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary: Create a TokenInfoDictionary from an external resource URL (e.g.
TokenInfoDictionary(Path, Path, Path, Path) - Constructor for class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary: Create a TokenInfoDictionary from an external resource path.
TokenInfoDictionary(BinaryDictionary.ResourceScheme, String) - Constructor for class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary: Deprecated, for removal: This API element is subject to removal in a future version.
replaced by TokenInfoDictionary(Path, Path, Path, Path) for files and TokenInfoDictionary(URL, URL, URL, URL) for classpath/module resources
TokenInfoFST - Class in org.apache.lucene.analysis.ja.dict: Thin wrapper around an FST with root-arc caching for Japanese.
TokenInfoFST(FST<Long>, boolean) - Constructor for class org.apache.lucene.analysis.ja.dict.TokenInfoFST
toString() - Method in class org.apache.lucene.analysis.ja.Token
ToStringUtil - Class in org.apache.lucene.analysis.ja.util: Utility class for english translations of morphological data, used only for debugging.
ToStringUtil() - Constructor for class org.apache.lucene.analysis.ja.util.ToStringUtil

U

UNIDIC - org.apache.lucene.analysis.ja.util.DictionaryBuilder.DictionaryFormat: UNIDIC format
UNKNOWN - org.apache.lucene.analysis.ja.JapaneseTokenizer.Type: Unknown words (heuristically segmented).
UnknownDictionary - Class in org.apache.lucene.analysis.ja.dict: Dictionary for unknown-word handling.
UnknownDictionary(URL, URL, URL) - Constructor for class org.apache.lucene.analysis.ja.dict.UnknownDictionary: Create a UnknownDictionary from an external resource URL (e.g.
UnknownDictionary(Path, Path, Path) - Constructor for class org.apache.lucene.analysis.ja.dict.UnknownDictionary: Create a UnknownDictionary from an external resource path.
UnknownDictionary(BinaryDictionary.ResourceScheme, String) - Constructor for class org.apache.lucene.analysis.ja.dict.UnknownDictionary: Deprecated, for removal: This API element is subject to removal in a future version.
replaced by UnknownDictionary(Path, Path, Path) for files and UnknownDictionary(URL, URL, URL) for classpath/module resources
USER - org.apache.lucene.analysis.ja.JapaneseTokenizer.Type: Known words from the user dictionary.
UserDictionary - Class in org.apache.lucene.analysis.ja.dict: Class for building a User Dictionary.

V

valueOf(String) - Static method in enum org.apache.lucene.analysis.ja.dict.BinaryDictionary.ResourceScheme: Deprecated.

Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.lucene.analysis.ja.JapaneseCompletionFilter.Mode: Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode: Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.lucene.analysis.ja.JapaneseTokenizer.Type: Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.lucene.analysis.ja.util.DictionaryBuilder.DictionaryFormat: Returns the enum constant of this type with the specified name.
values() - Static method in enum org.apache.lucene.analysis.ja.dict.BinaryDictionary.ResourceScheme: Deprecated.

Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.lucene.analysis.ja.JapaneseCompletionFilter.Mode: Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode: Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.lucene.analysis.ja.JapaneseTokenizer.Type: Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.lucene.analysis.ja.util.DictionaryBuilder.DictionaryFormat: Returns an array containing the constants of this enum type, in the order they are declared.
VERSION - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
VERSION - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
VERSION - Static variable in class org.apache.lucene.analysis.ja.dict.ConnectionCosts

W

WORD_COST - Static variable in class org.apache.lucene.analysis.ja.dict.UserDictionary

A B C D E F G H I J K L M N O P Q R S T U V W
All Classes All Packages