Index
All Classes and Interfaces|All Packages|Constant Field Values
A
- accept() - Method in class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilter
- advance() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer
- ALPHA - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
B
- BaseFormAttribute - Interface in org.apache.lucene.analysis.ja.tokenattributes
-
Attribute for
Token.getBaseForm()
. - BaseFormAttributeImpl - Class in org.apache.lucene.analysis.ja.tokenattributes
-
Attribute for
Token.getBaseForm()
. - BaseFormAttributeImpl() - Constructor for class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
- build(DictionaryBuilder.DictionaryFormat, Path, Path, String, boolean) - Static method in class org.apache.lucene.analysis.ja.dict.DictionaryBuilder
C
- calcNBestCost(String) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
- CharacterDefinition - Class in org.apache.lucene.analysis.ja.dict
-
Character category data.
- charAt(int) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer
- CharSequenceUtils - Class in org.apache.lucene.analysis.ja.completion
-
Utility functions for
JapaneseCompletionFilter
- CLASS_COUNT - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- clear() - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
- clear() - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
- clear() - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
- clear() - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
- close() - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
- ConnectionCosts - Class in org.apache.lucene.analysis.ja.dict
-
n-gram connection cost data
- ConnectionCosts(URL) - Constructor for class org.apache.lucene.analysis.ja.dict.ConnectionCosts
-
Create a
ConnectionCosts
from an external resource URL (e.g. - ConnectionCosts(Path) - Constructor for class org.apache.lucene.analysis.ja.dict.ConnectionCosts
-
Create a
ConnectionCosts
from an external resource path. - copyTo(AttributeImpl) - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
- copyTo(AttributeImpl) - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
- copyTo(AttributeImpl) - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
- copyTo(AttributeImpl) - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
- correct(int) - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
- create(Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseBaseFormFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseHiraganaUppercaseFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseKatakanaUppercaseFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseReadingFormFilterFactory
- create(AttributeFactory) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory
- createComponents(String) - Method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
- createComponents(String) - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer
- CYRILLIC - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
D
- DEFAULT - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- DEFAULT_MINIMUM_LENGTH - Static variable in class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilter
- DEFAULT_MODE - Static variable in class org.apache.lucene.analysis.ja.JapaneseCompletionFilter
- DEFAULT_MODE - Static variable in class org.apache.lucene.analysis.ja.JapaneseTokenizer
-
Default tokenization mode.
- DictionaryBuilder - Class in org.apache.lucene.analysis.ja.dict
-
Tool to build dictionaries.
- DictionaryBuilder.DictionaryFormat - Enum Class in org.apache.lucene.analysis.ja.dict
-
Format of the dictionary.
E
- end() - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
- EXTENDED - Enum constant in enum class org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode
-
Extended mode outputs unigrams for unknown words.
F
- FST_FILENAME_SUFFIX - Static variable in class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
G
- getBaseForm() - Method in class org.apache.lucene.analysis.ja.Token
- getBaseForm() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttribute
- getBaseForm() - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
- getBaseForm(int, char[], int, int) - Method in interface org.apache.lucene.analysis.ja.dict.JaMorphData
-
Get base form of word
- getCharacterDefinition() - Method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
- getDefaultStopTags() - Static method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
- getFST() - Method in class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
- getFST() - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
- getInflectedFormTranslation(String) - Static method in class org.apache.lucene.analysis.ja.dict.ToStringUtil
-
Get the english form of inflected form
- getInflectionForm() - Method in class org.apache.lucene.analysis.ja.Token
- getInflectionForm() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.InflectionAttribute
- getInflectionForm() - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
- getInflectionForm(int) - Method in interface org.apache.lucene.analysis.ja.dict.JaMorphData
-
Get inflection form of tokens
- getInflectionType() - Method in class org.apache.lucene.analysis.ja.Token
- getInflectionType() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.InflectionAttribute
- getInflectionType() - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
- getInflectionType(int) - Method in interface org.apache.lucene.analysis.ja.dict.JaMorphData
-
Get inflection type of tokens
- getInflectionTypeTranslation(String) - Static method in class org.apache.lucene.analysis.ja.dict.ToStringUtil
-
Get the english form of inflection type
- getInstance() - Static method in class org.apache.lucene.analysis.ja.completion.KatakanaRomanizer
-
Returns the singleton instance of
KatakanaRomenizer
- getInstance() - Static method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- getInstance() - Static method in class org.apache.lucene.analysis.ja.dict.ConnectionCosts
- getInstance() - Static method in class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
- getInstance() - Static method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
- getMorphAttributes() - Method in class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
- getMorphAttributes() - Method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
- getMorphAttributes() - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
- getPartOfSpeech() - Method in class org.apache.lucene.analysis.ja.Token
- getPartOfSpeech() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttribute
- getPartOfSpeech() - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
- getPartOfSpeech(int) - Method in interface org.apache.lucene.analysis.ja.dict.JaMorphData
-
Get Part-Of-Speech of tokens
- getPOSTranslation(String) - Static method in class org.apache.lucene.analysis.ja.dict.ToStringUtil
-
Get the english form of a POS tag
- getPronunciation() - Method in class org.apache.lucene.analysis.ja.Token
- getPronunciation() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.ReadingAttribute
- getPronunciation() - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
- getPronunciation(int, char[], int, int) - Method in interface org.apache.lucene.analysis.ja.dict.JaMorphData
-
Get pronunciation of tokens
- getReading() - Method in class org.apache.lucene.analysis.ja.Token
- getReading() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.ReadingAttribute
- getReading() - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
- getReading(int, char[], int, int) - Method in interface org.apache.lucene.analysis.ja.dict.JaMorphData
-
Get reading of tokens
- getRomanization(Appendable, CharSequence) - Static method in class org.apache.lucene.analysis.ja.dict.ToStringUtil
-
Romanize katakana with modified hepburn
- getRomanization(String) - Static method in class org.apache.lucene.analysis.ja.dict.ToStringUtil
-
Romanize katakana with modified hepburn
- GREEK - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
H
- HIRAGANA - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
I
- incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseBaseFormFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseHiraganaUppercaseFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseKatakanaUppercaseFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseReadingFormFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
- INDEX - Enum constant in enum class org.apache.lucene.analysis.ja.JapaneseCompletionFilter.Mode
-
Simple romanization.
- InflectionAttribute - Interface in org.apache.lucene.analysis.ja.tokenattributes
-
Attribute for Kuromoji inflection data.
- InflectionAttributeImpl - Class in org.apache.lucene.analysis.ja.tokenattributes
-
Attribute for Kuromoji inflection data.
- InflectionAttributeImpl() - Constructor for class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory
- initReader(String, Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
- initReader(String, Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer
- initReaderForNormalization(String, Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
- initReaderForNormalization(String, Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer
- INTERNAL_SEPARATOR - Static variable in class org.apache.lucene.analysis.ja.dict.UserDictionary
- IPADIC - Enum constant in enum class org.apache.lucene.analysis.ja.dict.DictionaryBuilder.DictionaryFormat
-
IPADIC format
- isArabicNumeral(char) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
-
Arabic numeral predicate.
- isFullWidthLowercaseAlphabet(char) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils
-
Checks if a char is a full-width lowercase alphabet
- isKana(CharSequence) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils
-
Checks if a char sequence is composed only of Katakana or hiragana
- isKanji(char) - Method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- isKatakanaOrHWAlphabets(CharSequence) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils
-
Checks if a char sequence is composed only of Katakana or lowercase alphabets
- isKnown() - Method in class org.apache.lucene.analysis.ja.Token
-
Returns true if this token is known word
- isLowercaseAlphabets(CharSequence) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils
-
Checks if a char sequence is composed only of lowercase alphabets
- isNumeral(char) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
-
Numeral predicate
- isNumeral(String) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
-
Numeral predicate
- isNumeralPunctuation(char) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
-
Numeral punctuation predicate
- isNumeralPunctuation(String) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
-
Numeral punctuation predicate
- isUnknown() - Method in class org.apache.lucene.analysis.ja.Token
-
Returns true if this token is unknown word
- isUser() - Method in class org.apache.lucene.analysis.ja.Token
-
Returns true if this token is defined in user dictionary
J
- JaMorphData - Interface in org.apache.lucene.analysis.ja.dict
-
Represents Japanese morphological information.
- JapaneseAnalyzer - Class in org.apache.lucene.analysis.ja
-
Analyzer for Japanese that uses morphological analysis.
- JapaneseAnalyzer() - Constructor for class org.apache.lucene.analysis.ja.JapaneseAnalyzer
- JapaneseAnalyzer(UserDictionary, JapaneseTokenizer.Mode, CharArraySet, Set<String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseAnalyzer
- JapaneseBaseFormFilter - Class in org.apache.lucene.analysis.ja
-
Replaces term text with the
BaseFormAttribute
. - JapaneseBaseFormFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseBaseFormFilter
- JapaneseBaseFormFilterFactory - Class in org.apache.lucene.analysis.ja
-
Factory for
JapaneseBaseFormFilter
. - JapaneseBaseFormFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseBaseFormFilterFactory
-
Default ctor for compatibility with SPI
- JapaneseBaseFormFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseBaseFormFilterFactory
-
Creates a new JapaneseBaseFormFilterFactory
- JapaneseCompletionAnalyzer - Class in org.apache.lucene.analysis.ja
-
Analyzer for Japanese completion suggester.
- JapaneseCompletionAnalyzer() - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer
-
Creates a new
JapaneseCompletionAnalyzer
with default configurations - JapaneseCompletionAnalyzer(UserDictionary, JapaneseCompletionFilter.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer
-
Creates a new
JapaneseCompletionAnalyzer
- JapaneseCompletionFilter - Class in org.apache.lucene.analysis.ja
-
A
TokenFilter
that adds Japanese romanized tokens to the term attribute. - JapaneseCompletionFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionFilter
-
Creates a new
JapaneseCompletionFilter
with default configurations - JapaneseCompletionFilter(TokenStream, JapaneseCompletionFilter.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionFilter
-
Creates a new
JapaneseCompletionFilter
- JapaneseCompletionFilter.Mode - Enum Class in org.apache.lucene.analysis.ja
-
Completion mode
- JapaneseCompletionFilterFactory - Class in org.apache.lucene.analysis.ja
-
Factory for
JapaneseCompletionFilter
. - JapaneseCompletionFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionFilterFactory
-
Default ctor for compatibility with SPI
- JapaneseCompletionFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionFilterFactory
-
Creates a new
JapaneseCompletionFilterFactory
- JapaneseHiraganaUppercaseFilter - Class in org.apache.lucene.analysis.ja
-
A
TokenFilter
that normalizes small letters (捨て仮名) in hiragana into normal letters. - JapaneseHiraganaUppercaseFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseHiraganaUppercaseFilter
- JapaneseHiraganaUppercaseFilterFactory - Class in org.apache.lucene.analysis.ja
-
Factory for
JapaneseHiraganaUppercaseFilter
. - JapaneseHiraganaUppercaseFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseHiraganaUppercaseFilterFactory
-
Default ctor for compatibility with SPI
- JapaneseHiraganaUppercaseFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseHiraganaUppercaseFilterFactory
- JapaneseIterationMarkCharFilter - Class in org.apache.lucene.analysis.ja
-
Normalizes Japanese horizontal iteration marks (odoriji) to their expanded form.
- JapaneseIterationMarkCharFilter(Reader) - Constructor for class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
-
Constructor.
- JapaneseIterationMarkCharFilter(Reader, boolean, boolean) - Constructor for class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
-
Constructor
- JapaneseIterationMarkCharFilterFactory - Class in org.apache.lucene.analysis.ja
-
Factory for
JapaneseIterationMarkCharFilter
. - JapaneseIterationMarkCharFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory
-
Default ctor for compatibility with SPI
- JapaneseIterationMarkCharFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory
-
Creates a new JapaneseIterationMarkCharFilterFactory
- JapaneseKatakanaStemFilter - Class in org.apache.lucene.analysis.ja
-
A
TokenFilter
that normalizes common katakana spelling variations ending in a long sound character by removing this character (U+30FC). - JapaneseKatakanaStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilter
- JapaneseKatakanaStemFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilter
- JapaneseKatakanaStemFilterFactory - Class in org.apache.lucene.analysis.ja
-
Factory for
JapaneseKatakanaStemFilter
. - JapaneseKatakanaStemFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilterFactory
-
Default ctor for compatibility with SPI
- JapaneseKatakanaStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilterFactory
-
Creates a new JapaneseKatakanaStemFilterFactory
- JapaneseKatakanaUppercaseFilter - Class in org.apache.lucene.analysis.ja
-
A
TokenFilter
that normalizes small letters (捨て仮名) in katakana into normal letters. - JapaneseKatakanaUppercaseFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseKatakanaUppercaseFilter
- JapaneseKatakanaUppercaseFilterFactory - Class in org.apache.lucene.analysis.ja
-
Factory for
JapaneseKatakanaUppercaseFilter
. - JapaneseKatakanaUppercaseFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseKatakanaUppercaseFilterFactory
-
Default ctor for compatibility with SPI
- JapaneseKatakanaUppercaseFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseKatakanaUppercaseFilterFactory
- JapaneseNumberFilter - Class in org.apache.lucene.analysis.ja
-
A
TokenFilter
that normalizes Japanese numbers (kansūji) to regular Arabic decimal numbers in half-width characters. - JapaneseNumberFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseNumberFilter
- JapaneseNumberFilter.NumberBuffer - Class in org.apache.lucene.analysis.ja
-
Buffer that holds a Japanese number string and a position index used as a parsed-to marker
- JapaneseNumberFilterFactory - Class in org.apache.lucene.analysis.ja
-
Factory for
JapaneseNumberFilter
. - JapaneseNumberFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseNumberFilterFactory
-
Default ctor for compatibility with SPI
- JapaneseNumberFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseNumberFilterFactory
- JapanesePartOfSpeechStopFilter - Class in org.apache.lucene.analysis.ja
-
Removes tokens that match a set of part-of-speech tags.
- JapanesePartOfSpeechStopFilter(TokenStream, Set<String>) - Constructor for class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilter
-
Create a new
JapanesePartOfSpeechStopFilter
. - JapanesePartOfSpeechStopFilterFactory - Class in org.apache.lucene.analysis.ja
-
Factory for
JapanesePartOfSpeechStopFilter
. - JapanesePartOfSpeechStopFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory
-
Default ctor for compatibility with SPI
- JapanesePartOfSpeechStopFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory
-
Creates a new JapanesePartOfSpeechStopFilterFactory
- JapaneseReadingFormFilter - Class in org.apache.lucene.analysis.ja
-
A
TokenFilter
that replaces the term attribute with the reading of a token in either katakana or romaji form. - JapaneseReadingFormFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseReadingFormFilter
- JapaneseReadingFormFilter(TokenStream, boolean) - Constructor for class org.apache.lucene.analysis.ja.JapaneseReadingFormFilter
- JapaneseReadingFormFilterFactory - Class in org.apache.lucene.analysis.ja
-
Factory for
JapaneseReadingFormFilter
. - JapaneseReadingFormFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseReadingFormFilterFactory
-
Default ctor for compatibility with SPI
- JapaneseReadingFormFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseReadingFormFilterFactory
-
Creates a new JapaneseReadingFormFilterFactory
- JapaneseTokenizer - Class in org.apache.lucene.analysis.ja
-
Tokenizer for Japanese that uses morphological analysis.
- JapaneseTokenizer(UserDictionary, boolean, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer
-
Create a new JapaneseTokenizer.
- JapaneseTokenizer(UserDictionary, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer
-
Create a new JapaneseTokenizer.
- JapaneseTokenizer(AttributeFactory, TokenInfoDictionary, UnknownDictionary, ConnectionCosts, UserDictionary, boolean, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer
-
Create a new JapaneseTokenizer, supplying a custom system dictionary and unknown dictionary.
- JapaneseTokenizer(AttributeFactory, UserDictionary, boolean, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer
-
Create a new JapaneseTokenizer using the system and unknown dictionaries shipped with Lucene.
- JapaneseTokenizer(AttributeFactory, UserDictionary, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer
-
Create a new JapaneseTokenizer using the system and unknown dictionaries shipped with Lucene.
- JapaneseTokenizer.Mode - Enum Class in org.apache.lucene.analysis.ja
-
Tokenization mode: this determines how the tokenizer handles compound and unknown words.
- JapaneseTokenizerFactory - Class in org.apache.lucene.analysis.ja
-
Factory for
JapaneseTokenizer
. - JapaneseTokenizerFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory
-
Default ctor for compatibility with SPI
- JapaneseTokenizerFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory
-
Creates a new JapaneseTokenizerFactory
K
- KANJI - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- KANJINUMERIC - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- KATAKANA - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- KatakanaRomanizer - Class in org.apache.lucene.analysis.ja.completion
-
Converts a Katakana string to Romaji using the pre-defined Katakana-Romaji mapping rules.
L
- length() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer
- lookup(char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
- lookup(char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
-
Lookup words in text
- lookupCharacterClass(String) - Static method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- lookupSegmentation(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
M
- main(String[]) - Static method in class org.apache.lucene.analysis.ja.dict.DictionaryBuilder
N
- NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseBaseFormFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseCompletionFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseHiraganaUppercaseFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseKatakanaUppercaseFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseNumberFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory
- NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseReadingFormFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory
-
SPI name
- NGRAM - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- NORMAL - Enum constant in enum class org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode
-
Ordinary segmentation: no decomposition for compounds,
- normalize(Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
- NORMALIZE_KANA_DEFAULT - Static variable in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
-
Normalize kana iteration marks by default
- NORMALIZE_KANJI_DEFAULT - Static variable in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
-
Normalize kanji iteration marks by default
- normalizeNumber(String) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
-
Normalizes a Japanese number
- NumberBuffer(String) - Constructor for class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer
- NUMERIC - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
O
- open(Reader) - Static method in class org.apache.lucene.analysis.ja.dict.UserDictionary
- org.apache.lucene.analysis.ja - package org.apache.lucene.analysis.ja
-
Analyzer for Japanese.
- org.apache.lucene.analysis.ja.completion - package org.apache.lucene.analysis.ja.completion
-
Utilities for
JapaneseCompletionFilter
- org.apache.lucene.analysis.ja.dict - package org.apache.lucene.analysis.ja.dict
-
Kuromoji dictionary implementation.
- org.apache.lucene.analysis.ja.tokenattributes - package org.apache.lucene.analysis.ja.tokenattributes
-
Additional Kuromoji-specific Attributes for text analysis.
P
- parseLargeKanjiNumeral(JapaneseNumberFilter.NumberBuffer) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
-
Parse large kanji numerals (ten thousands or larger)
- parseMediumKanjiNumeral(JapaneseNumberFilter.NumberBuffer) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
-
Parse medium kanji numerals (tens, hundreds or thousands)
- PartOfSpeechAttribute - Interface in org.apache.lucene.analysis.ja.tokenattributes
-
Attribute for
Token.getPartOfSpeech()
. - PartOfSpeechAttributeImpl - Class in org.apache.lucene.analysis.ja.tokenattributes
-
Attribute for
Token.getPartOfSpeech()
. - PartOfSpeechAttributeImpl() - Constructor for class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
- position() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer
Q
- QUERY - Enum constant in enum class org.apache.lucene.analysis.ja.JapaneseCompletionFilter.Mode
-
Input Method aware romanization.
R
- read() - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
- read(char[], int, int) - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
- ReadingAttribute - Interface in org.apache.lucene.analysis.ja.tokenattributes
-
Attribute for Kuromoji reading data
- ReadingAttributeImpl - Class in org.apache.lucene.analysis.ja.tokenattributes
-
Attribute for Kuromoji reading data
- ReadingAttributeImpl() - Constructor for class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
- reflectWith(AttributeReflector) - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
- reflectWith(AttributeReflector) - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
- reflectWith(AttributeReflector) - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
- reflectWith(AttributeReflector) - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
- reset() - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionFilter
- reset() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
- reset() - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
- romanize(CharsRef) - Method in class org.apache.lucene.analysis.ja.completion.KatakanaRomanizer
-
Translates a sequence of katakana to romaji.
S
- SEARCH - Enum constant in enum class org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode
-
Segmentation geared towards search: this includes a decompounding process for long nouns, also including the full compound token as a synonym.
- setGraphvizFormatter(GraphvizFormatter<JaMorphData>) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
-
Expert: set this to produce graphviz (dot) output of the Viterbi lattice
- setNBestCost(int) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
- setToken(Token) - Method in interface org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttribute
- setToken(Token) - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
- setToken(Token) - Method in interface org.apache.lucene.analysis.ja.tokenattributes.InflectionAttribute
- setToken(Token) - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
- setToken(Token) - Method in interface org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttribute
- setToken(Token) - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
- setToken(Token) - Method in interface org.apache.lucene.analysis.ja.tokenattributes.ReadingAttribute
- setToken(Token) - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
- SPACE - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- SYMBOL - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
T
- toKatakana(CharSequence) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils
-
Convert all hiragana in a string into Katakana
- Token - Class in org.apache.lucene.analysis.ja
-
Analyzed token with morphological data from its dictionary.
- Token(char[], int, int, int, int, int, TokenType, JaMorphData) - Constructor for class org.apache.lucene.analysis.ja.Token
- TokenInfoDictionary - Class in org.apache.lucene.analysis.ja.dict
-
Binary dictionary implementation for a known-word dictionary model: Words are encoded into an FST mapping to a list of wordIDs.
- TokenInfoDictionary(URL, URL, URL, URL) - Constructor for class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
-
Create a
TokenInfoDictionary
from an external resource URL (e.g. - TokenInfoDictionary(Path, Path, Path, Path) - Constructor for class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
-
Create a
TokenInfoDictionary
from an external resource path. - TokenInfoFST - Class in org.apache.lucene.analysis.ja.dict
-
Thin wrapper around an FST with root-arc caching for Japanese.
- TokenInfoFST(FST<Long>, boolean) - Constructor for class org.apache.lucene.analysis.ja.dict.TokenInfoFST
- toString() - Method in class org.apache.lucene.analysis.ja.Token
- ToStringUtil - Class in org.apache.lucene.analysis.ja.dict
-
Utility class for english translations of morphological data, used only for debugging.
- ToStringUtil() - Constructor for class org.apache.lucene.analysis.ja.dict.ToStringUtil
U
- UNIDIC - Enum constant in enum class org.apache.lucene.analysis.ja.dict.DictionaryBuilder.DictionaryFormat
-
UNIDIC format
- UnknownDictionary - Class in org.apache.lucene.analysis.ja.dict
-
Dictionary for unknown-word handling.
- UnknownDictionary(URL, URL, URL) - Constructor for class org.apache.lucene.analysis.ja.dict.UnknownDictionary
-
Create a
UnknownDictionary
from an external resource URL (e.g. - UnknownDictionary(Path, Path, Path) - Constructor for class org.apache.lucene.analysis.ja.dict.UnknownDictionary
-
Create a
UnknownDictionary
from an external resource path. - UserDictionary - Class in org.apache.lucene.analysis.ja.dict
-
Class for building a User Dictionary.
V
- valueOf(String) - Static method in enum class org.apache.lucene.analysis.ja.dict.DictionaryBuilder.DictionaryFormat
-
Returns the enum constant of this class with the specified name.
- valueOf(String) - Static method in enum class org.apache.lucene.analysis.ja.JapaneseCompletionFilter.Mode
-
Returns the enum constant of this class with the specified name.
- valueOf(String) - Static method in enum class org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode
-
Returns the enum constant of this class with the specified name.
- values() - Static method in enum class org.apache.lucene.analysis.ja.dict.DictionaryBuilder.DictionaryFormat
-
Returns an array containing the constants of this enum class, in the order they are declared.
- values() - Static method in enum class org.apache.lucene.analysis.ja.JapaneseCompletionFilter.Mode
-
Returns an array containing the constants of this enum class, in the order they are declared.
- values() - Static method in enum class org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode
-
Returns an array containing the constants of this enum class, in the order they are declared.
All Classes and Interfaces|All Packages|Constant Field Values