A B C D E F G H I J K L M N O P Q R S T U V W
All Classes All Packages
All Classes All Packages
All Classes All Packages
A
- accept() - Method in class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilter
- advance() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer
- ALPHA - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
B
- BaseFormAttribute - Interface in org.apache.lucene.analysis.ja.tokenattributes
-
Attribute for
Token.getBaseForm()
. - BaseFormAttributeImpl - Class in org.apache.lucene.analysis.ja.tokenattributes
-
Attribute for
Token.getBaseForm()
. - BaseFormAttributeImpl() - Constructor for class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
- BinaryDictionary - Class in org.apache.lucene.analysis.ja.dict
-
Base class for a binary-encoded in-memory dictionary.
- BinaryDictionary(IOSupplier<InputStream>, IOSupplier<InputStream>, IOSupplier<InputStream>) - Constructor for class org.apache.lucene.analysis.ja.dict.BinaryDictionary
- BinaryDictionary.ResourceScheme - Enum in org.apache.lucene.analysis.ja.dict
-
Deprecated, for removal: This API element is subject to removal in a future version.
- build(DictionaryBuilder.DictionaryFormat, Path, Path, String, boolean) - Static method in class org.apache.lucene.analysis.ja.util.DictionaryBuilder
C
- calcNBestCost(String) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
- CharacterDefinition - Class in org.apache.lucene.analysis.ja.dict
-
Character category data.
- charAt(int) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer
- CharSequenceUtils - Class in org.apache.lucene.analysis.ja.completion
-
Utility functions for
JapaneseCompletionFilter
- CLASS_COUNT - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- CLASSPATH - org.apache.lucene.analysis.ja.dict.BinaryDictionary.ResourceScheme
-
Deprecated.
- clear() - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
- clear() - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
- clear() - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
- clear() - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
- close() - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
- ConnectionCosts - Class in org.apache.lucene.analysis.ja.dict
-
n-gram connection cost data
- ConnectionCosts(URL) - Constructor for class org.apache.lucene.analysis.ja.dict.ConnectionCosts
-
Create a
ConnectionCosts
from an external resource URL (e.g. - ConnectionCosts(Path) - Constructor for class org.apache.lucene.analysis.ja.dict.ConnectionCosts
-
Create a
ConnectionCosts
from an external resource path. - ConnectionCosts(BinaryDictionary.ResourceScheme, String) - Constructor for class org.apache.lucene.analysis.ja.dict.ConnectionCosts
-
Deprecated, for removal: This API element is subject to removal in a future version.replaced by
ConnectionCosts(Path)
for files andConnectionCosts(URL)
for classpath/module resources. - copyTo(AttributeImpl) - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
- copyTo(AttributeImpl) - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
- copyTo(AttributeImpl) - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
- copyTo(AttributeImpl) - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
- correct(int) - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
- create(Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseBaseFormFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory
- create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseReadingFormFilterFactory
- create(AttributeFactory) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory
- createComponents(String) - Method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
- createComponents(String) - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer
- CSVUtil - Class in org.apache.lucene.analysis.ja.util
-
Utility class for parsing CSV text
- CYRILLIC - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
D
- DEFAULT - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- DEFAULT_MINIMUM_LENGTH - Static variable in class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilter
- DEFAULT_MODE - Static variable in class org.apache.lucene.analysis.ja.JapaneseCompletionFilter
- DEFAULT_MODE - Static variable in class org.apache.lucene.analysis.ja.JapaneseTokenizer
-
Default tokenization mode.
- DICT_FILENAME_SUFFIX - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
- DICT_HEADER - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
- Dictionary - Interface in org.apache.lucene.analysis.ja.dict
-
Dictionary interface for retrieving morphological data by id.
- DictionaryBuilder - Class in org.apache.lucene.analysis.ja.util
-
Tool to build dictionaries.
- DictionaryBuilder.DictionaryFormat - Enum in org.apache.lucene.analysis.ja.util
-
Format of the dictionary.
E
- end() - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
- EXTENDED - org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode
-
Extended mode outputs unigrams for unknown words.
F
- FILE - org.apache.lucene.analysis.ja.dict.BinaryDictionary.ResourceScheme
-
Deprecated.
- FILENAME_SUFFIX - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- FILENAME_SUFFIX - Static variable in class org.apache.lucene.analysis.ja.dict.ConnectionCosts
- findTargetArc(int, FST.Arc<Long>, FST.Arc<Long>, boolean, FST.BytesReader) - Method in class org.apache.lucene.analysis.ja.dict.TokenInfoFST
- finish() - Method in class org.apache.lucene.analysis.ja.GraphvizFormatter
- FST_FILENAME_SUFFIX - Static variable in class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
G
- get(int, int) - Method in class org.apache.lucene.analysis.ja.dict.ConnectionCosts
- getBaseForm() - Method in class org.apache.lucene.analysis.ja.Token
- getBaseForm() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttribute
- getBaseForm() - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
- getBaseForm(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
- getBaseForm(int, char[], int, int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary
-
Get base form of word
- getBaseForm(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
- getBytesReader() - Method in class org.apache.lucene.analysis.ja.dict.TokenInfoFST
- getCharacterClass(char) - Method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- getCharacterDefinition() - Method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
- getDefaultStopTags() - Static method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
- getFirstArc(FST.Arc<Long>) - Method in class org.apache.lucene.analysis.ja.dict.TokenInfoFST
- getFST() - Method in class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
- getFST() - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
- getInflectedFormTranslation(String) - Static method in class org.apache.lucene.analysis.ja.util.ToStringUtil
-
Get the english form of inflected form
- getInflectionForm() - Method in class org.apache.lucene.analysis.ja.Token
- getInflectionForm() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.InflectionAttribute
- getInflectionForm() - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
- getInflectionForm(int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
- getInflectionForm(int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary
-
Get inflection form of tokens
- getInflectionForm(int) - Method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
- getInflectionForm(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
- getInflectionType() - Method in class org.apache.lucene.analysis.ja.Token
- getInflectionType() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.InflectionAttribute
- getInflectionType() - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
- getInflectionType(int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
- getInflectionType(int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary
-
Get inflection type of tokens
- getInflectionType(int) - Method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
- getInflectionType(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
- getInflectionTypeTranslation(String) - Static method in class org.apache.lucene.analysis.ja.util.ToStringUtil
-
Get the english form of inflection type
- getInstance() - Static method in class org.apache.lucene.analysis.ja.completion.KatakanaRomanizer
-
Returns the singleton instance of
KatakanaRomenizer
- getInstance() - Static method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- getInstance() - Static method in class org.apache.lucene.analysis.ja.dict.ConnectionCosts
- getInstance() - Static method in class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
- getInstance() - Static method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
- getLeftId(int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
- getLeftId(int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary
-
Get left id of specified word
- getLeftId(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
- getLength() - Method in class org.apache.lucene.analysis.ja.Token
- getOffset() - Method in class org.apache.lucene.analysis.ja.Token
- getPartOfSpeech() - Method in class org.apache.lucene.analysis.ja.Token
- getPartOfSpeech() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttribute
- getPartOfSpeech() - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
- getPartOfSpeech(int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
- getPartOfSpeech(int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary
-
Get Part-Of-Speech of tokens
- getPartOfSpeech(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
- getPosition() - Method in class org.apache.lucene.analysis.ja.Token
-
Get index of this token in input text
- getPositionLength() - Method in class org.apache.lucene.analysis.ja.Token
-
Get the length (in tokens) of this token.
- getPOSTranslation(String) - Static method in class org.apache.lucene.analysis.ja.util.ToStringUtil
-
Get the english form of a POS tag
- getPronunciation() - Method in class org.apache.lucene.analysis.ja.Token
- getPronunciation() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.ReadingAttribute
- getPronunciation() - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
- getPronunciation(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
- getPronunciation(int, char[], int, int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary
-
Get pronunciation of tokens
- getPronunciation(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
- getReading() - Method in class org.apache.lucene.analysis.ja.Token
- getReading() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.ReadingAttribute
- getReading() - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
- getReading(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
- getReading(int, char[], int, int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary
-
Get reading of tokens
- getReading(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
- getReading(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
- getResource(BinaryDictionary.ResourceScheme, String) - Static method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
-
Deprecated, for removal: This API element is subject to removal in a future version.
- getRightId(int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
- getRightId(int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary
-
Get right id of specified word
- getRightId(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
- getRomanization(Appendable, CharSequence) - Static method in class org.apache.lucene.analysis.ja.util.ToStringUtil
-
Romanize katakana with modified hepburn
- getRomanization(String) - Static method in class org.apache.lucene.analysis.ja.util.ToStringUtil
-
Romanize katakana with modified hepburn
- getSurfaceForm() - Method in class org.apache.lucene.analysis.ja.Token
- getSurfaceFormString() - Method in class org.apache.lucene.analysis.ja.Token
- getType() - Method in class org.apache.lucene.analysis.ja.Token
-
Returns the type of this token
- getWordCost(int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
- getWordCost(int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary
-
Get word cost of specified word
- getWordCost(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
- GraphvizFormatter - Class in org.apache.lucene.analysis.ja
-
Outputs the dot (graphviz) string for the viterbi lattice.
- GraphvizFormatter(ConnectionCosts) - Constructor for class org.apache.lucene.analysis.ja.GraphvizFormatter
- GREEK - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
H
- HAS_BASEFORM - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
-
flag that the entry has baseform data.
- HAS_PRONUNCIATION - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
-
flag that the entry has pronunciation data.
- HAS_READING - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
-
flag that the entry has reading data.
- HEADER - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- HEADER - Static variable in class org.apache.lucene.analysis.ja.dict.ConnectionCosts
- HIRAGANA - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
I
- incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseBaseFormFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseReadingFormFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
- INDEX - org.apache.lucene.analysis.ja.JapaneseCompletionFilter.Mode
-
Simple romanization.
- InflectionAttribute - Interface in org.apache.lucene.analysis.ja.tokenattributes
-
Attribute for Kuromoji inflection data.
- InflectionAttributeImpl - Class in org.apache.lucene.analysis.ja.tokenattributes
-
Attribute for Kuromoji inflection data.
- InflectionAttributeImpl() - Constructor for class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory
- inform(ResourceLoader) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory
- initReader(String, Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
- initReader(String, Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer
- initReaderForNormalization(String, Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
- initReaderForNormalization(String, Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer
- INTERNAL_SEPARATOR - Static variable in interface org.apache.lucene.analysis.ja.dict.Dictionary
- IPADIC - org.apache.lucene.analysis.ja.util.DictionaryBuilder.DictionaryFormat
-
IPADIC format
- isArabicNumeral(char) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
-
Arabic numeral predicate.
- isFullWidthLowercaseAlphabet(char) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils
-
Checks if a char is a full-width lowercase alphabet
- isGroup(char) - Method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- isInvoke(char) - Method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- isKana(CharSequence) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils
-
Checks if a char sequence is composed only of Katakana or hiragana
- isKanji(char) - Method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- isKatakanaOrHWAlphabets(CharSequence) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils
-
Checks if a char sequence is composed only of Katakana or lowercase alphabets
- isKnown() - Method in class org.apache.lucene.analysis.ja.Token
-
Returns true if this token is known word
- isLowercaseAlphabets(CharSequence) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils
-
Checks if a char sequence is composed only of lowercase alphabets
- isNumeral(char) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
-
Numeral predicate
- isNumeral(String) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
-
Numeral predicate
- isNumeralPunctuation(char) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
-
Numeral punctuation predicate
- isNumeralPunctuation(String) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
-
Numeral punctuation predicate
- isUnknown() - Method in class org.apache.lucene.analysis.ja.Token
-
Returns true if this token is unknown word
- isUser() - Method in class org.apache.lucene.analysis.ja.Token
-
Returns true if this token is defined in user dictionary
J
- JapaneseAnalyzer - Class in org.apache.lucene.analysis.ja
-
Analyzer for Japanese that uses morphological analysis.
- JapaneseAnalyzer() - Constructor for class org.apache.lucene.analysis.ja.JapaneseAnalyzer
- JapaneseAnalyzer(UserDictionary, JapaneseTokenizer.Mode, CharArraySet, Set<String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseAnalyzer
- JapaneseBaseFormFilter - Class in org.apache.lucene.analysis.ja
-
Replaces term text with the
BaseFormAttribute
. - JapaneseBaseFormFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseBaseFormFilter
- JapaneseBaseFormFilterFactory - Class in org.apache.lucene.analysis.ja
-
Factory for
JapaneseBaseFormFilter
. - JapaneseBaseFormFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseBaseFormFilterFactory
-
Default ctor for compatibility with SPI
- JapaneseBaseFormFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseBaseFormFilterFactory
-
Creates a new JapaneseBaseFormFilterFactory
- JapaneseCompletionAnalyzer - Class in org.apache.lucene.analysis.ja
-
Analyzer for Japanese completion suggester.
- JapaneseCompletionAnalyzer() - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer
-
Creates a new
JapaneseCompletionAnalyzer
with default configurations - JapaneseCompletionAnalyzer(UserDictionary, JapaneseCompletionFilter.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer
-
Creates a new
JapaneseCompletionAnalyzer
- JapaneseCompletionFilter - Class in org.apache.lucene.analysis.ja
-
A
TokenFilter
that adds Japanese romanized tokens to the term attribute. - JapaneseCompletionFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionFilter
-
Creates a new
JapaneseCompletionFilter
with default configurations - JapaneseCompletionFilter(TokenStream, JapaneseCompletionFilter.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionFilter
-
Creates a new
JapaneseCompletionFilter
- JapaneseCompletionFilter.Mode - Enum in org.apache.lucene.analysis.ja
-
Completion mode
- JapaneseCompletionFilterFactory - Class in org.apache.lucene.analysis.ja
-
Factory for
JapaneseCompletionFilter
. - JapaneseCompletionFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionFilterFactory
-
Default ctor for compatibility with SPI
- JapaneseCompletionFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionFilterFactory
-
Creates a new
JapaneseCompletionFilterFactory
- JapaneseIterationMarkCharFilter - Class in org.apache.lucene.analysis.ja
-
Normalizes Japanese horizontal iteration marks (odoriji) to their expanded form.
- JapaneseIterationMarkCharFilter(Reader) - Constructor for class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
-
Constructor.
- JapaneseIterationMarkCharFilter(Reader, boolean, boolean) - Constructor for class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
-
Constructor
- JapaneseIterationMarkCharFilterFactory - Class in org.apache.lucene.analysis.ja
-
Factory for
JapaneseIterationMarkCharFilter
. - JapaneseIterationMarkCharFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory
-
Default ctor for compatibility with SPI
- JapaneseIterationMarkCharFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory
-
Creates a new JapaneseIterationMarkCharFilterFactory
- JapaneseKatakanaStemFilter - Class in org.apache.lucene.analysis.ja
-
A
TokenFilter
that normalizes common katakana spelling variations ending in a long sound character by removing this character (U+30FC). - JapaneseKatakanaStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilter
- JapaneseKatakanaStemFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilter
- JapaneseKatakanaStemFilterFactory - Class in org.apache.lucene.analysis.ja
-
Factory for
JapaneseKatakanaStemFilter
. - JapaneseKatakanaStemFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilterFactory
-
Default ctor for compatibility with SPI
- JapaneseKatakanaStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilterFactory
-
Creates a new JapaneseKatakanaStemFilterFactory
- JapaneseNumberFilter - Class in org.apache.lucene.analysis.ja
-
A
TokenFilter
that normalizes Japanese numbers (kansūji) to regular Arabic decimal numbers in half-width characters. - JapaneseNumberFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseNumberFilter
- JapaneseNumberFilter.NumberBuffer - Class in org.apache.lucene.analysis.ja
-
Buffer that holds a Japanese number string and a position index used as a parsed-to marker
- JapaneseNumberFilterFactory - Class in org.apache.lucene.analysis.ja
-
Factory for
JapaneseNumberFilter
. - JapaneseNumberFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseNumberFilterFactory
-
Default ctor for compatibility with SPI
- JapaneseNumberFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseNumberFilterFactory
- JapanesePartOfSpeechStopFilter - Class in org.apache.lucene.analysis.ja
-
Removes tokens that match a set of part-of-speech tags.
- JapanesePartOfSpeechStopFilter(TokenStream, Set<String>) - Constructor for class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilter
-
Create a new
JapanesePartOfSpeechStopFilter
. - JapanesePartOfSpeechStopFilterFactory - Class in org.apache.lucene.analysis.ja
-
Factory for
JapanesePartOfSpeechStopFilter
. - JapanesePartOfSpeechStopFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory
-
Default ctor for compatibility with SPI
- JapanesePartOfSpeechStopFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory
-
Creates a new JapanesePartOfSpeechStopFilterFactory
- JapaneseReadingFormFilter - Class in org.apache.lucene.analysis.ja
-
A
TokenFilter
that replaces the term attribute with the reading of a token in either katakana or romaji form. - JapaneseReadingFormFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseReadingFormFilter
- JapaneseReadingFormFilter(TokenStream, boolean) - Constructor for class org.apache.lucene.analysis.ja.JapaneseReadingFormFilter
- JapaneseReadingFormFilterFactory - Class in org.apache.lucene.analysis.ja
-
Factory for
JapaneseReadingFormFilter
. - JapaneseReadingFormFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseReadingFormFilterFactory
-
Default ctor for compatibility with SPI
- JapaneseReadingFormFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseReadingFormFilterFactory
-
Creates a new JapaneseReadingFormFilterFactory
- JapaneseTokenizer - Class in org.apache.lucene.analysis.ja
-
Tokenizer for Japanese that uses morphological analysis.
- JapaneseTokenizer(UserDictionary, boolean, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer
-
Create a new JapaneseTokenizer.
- JapaneseTokenizer(UserDictionary, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer
-
Create a new JapaneseTokenizer.
- JapaneseTokenizer(AttributeFactory, TokenInfoDictionary, UnknownDictionary, ConnectionCosts, UserDictionary, boolean, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer
-
Create a new JapaneseTokenizer, supplying a custom system dictionary and unknown dictionary.
- JapaneseTokenizer(AttributeFactory, UserDictionary, boolean, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer
-
Create a new JapaneseTokenizer using the system and unknown dictionaries shipped with Lucene.
- JapaneseTokenizer(AttributeFactory, UserDictionary, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer
-
Create a new JapaneseTokenizer using the system and unknown dictionaries shipped with Lucene.
- JapaneseTokenizer.Mode - Enum in org.apache.lucene.analysis.ja
-
Tokenization mode: this determines how the tokenizer handles compound and unknown words.
- JapaneseTokenizer.Type - Enum in org.apache.lucene.analysis.ja
-
Token type reflecting the original source of this token
- JapaneseTokenizerFactory - Class in org.apache.lucene.analysis.ja
-
Factory for
JapaneseTokenizer
. - JapaneseTokenizerFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory
-
Default ctor for compatibility with SPI
- JapaneseTokenizerFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory
-
Creates a new JapaneseTokenizerFactory
K
- KANJI - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- KANJINUMERIC - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- KATAKANA - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- KatakanaRomanizer - Class in org.apache.lucene.analysis.ja.completion
-
Converts a Katakana string to Romaji using the pre-defined Katakana-Romaji mapping rules.
- KNOWN - org.apache.lucene.analysis.ja.JapaneseTokenizer.Type
-
Known words from the system dictionary.
L
- LEFT_ID - Static variable in class org.apache.lucene.analysis.ja.dict.UserDictionary
- length() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer
- lookup(char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
- lookup(char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
-
Lookup words in text
- lookupCharacterClass(String) - Static method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- lookupSegmentation(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
- lookupWordIds(int, IntsRef) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
M
- main(String[]) - Static method in class org.apache.lucene.analysis.ja.util.DictionaryBuilder
N
- NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseBaseFormFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseCompletionFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseNumberFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory
- NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseReadingFormFilterFactory
-
SPI name
- NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory
-
SPI name
- NGRAM - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- NO_OUTPUT - Variable in class org.apache.lucene.analysis.ja.dict.TokenInfoFST
- NORMAL - org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode
-
Ordinary segmentation: no decomposition for compounds,
- normalize(Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory
- normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
- NORMALIZE_KANA_DEFAULT - Static variable in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
-
Normalize kana iteration marks by default
- NORMALIZE_KANJI_DEFAULT - Static variable in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
-
Normalize kanji iteration marks by default
- normalizeNumber(String) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
-
Normalizes a Japanese number
- NumberBuffer(String) - Constructor for class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer
- NUMERIC - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
O
- open(Reader) - Static method in class org.apache.lucene.analysis.ja.dict.UserDictionary
- org.apache.lucene.analysis.ja - package org.apache.lucene.analysis.ja
-
Analyzer for Japanese.
- org.apache.lucene.analysis.ja.completion - package org.apache.lucene.analysis.ja.completion
-
Utilities for
JapaneseCompletionFilter
- org.apache.lucene.analysis.ja.dict - package org.apache.lucene.analysis.ja.dict
-
Kuromoji dictionary implementation.
- org.apache.lucene.analysis.ja.tokenattributes - package org.apache.lucene.analysis.ja.tokenattributes
-
Additional Kuromoji-specific Attributes for text analysis.
- org.apache.lucene.analysis.ja.util - package org.apache.lucene.analysis.ja.util
-
Kuromoji utility classes.
P
- parse(String) - Static method in class org.apache.lucene.analysis.ja.util.CSVUtil
-
Parse CSV line
- parseLargeKanjiNumeral(JapaneseNumberFilter.NumberBuffer) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
-
Parse large kanji numerals (ten thousands or larger)
- parseMediumKanjiNumeral(JapaneseNumberFilter.NumberBuffer) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
-
Parse medium kanji numerals (tens, hundreds or thousands)
- PartOfSpeechAttribute - Interface in org.apache.lucene.analysis.ja.tokenattributes
-
Attribute for
Token.getPartOfSpeech()
. - PartOfSpeechAttributeImpl - Class in org.apache.lucene.analysis.ja.tokenattributes
-
Attribute for
Token.getPartOfSpeech()
. - PartOfSpeechAttributeImpl() - Constructor for class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
- POSDICT_FILENAME_SUFFIX - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
- POSDICT_HEADER - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
- position() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer
Q
- QUERY - org.apache.lucene.analysis.ja.JapaneseCompletionFilter.Mode
-
Input Method aware romanization.
- quoteEscape(String) - Static method in class org.apache.lucene.analysis.ja.util.CSVUtil
-
Quote and escape input value for CSV
R
- read() - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
- read(char[], int, int) - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
- ReadingAttribute - Interface in org.apache.lucene.analysis.ja.tokenattributes
-
Attribute for Kuromoji reading data
- ReadingAttributeImpl - Class in org.apache.lucene.analysis.ja.tokenattributes
-
Attribute for Kuromoji reading data
- ReadingAttributeImpl() - Constructor for class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
- reflectWith(AttributeReflector) - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
- reflectWith(AttributeReflector) - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
- reflectWith(AttributeReflector) - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
- reflectWith(AttributeReflector) - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
- reset() - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionFilter
- reset() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
- reset() - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
- RIGHT_ID - Static variable in class org.apache.lucene.analysis.ja.dict.UserDictionary
- romanize(CharsRef) - Method in class org.apache.lucene.analysis.ja.completion.KatakanaRomanizer
-
Translates a sequence of katakana to romaji.
S
- SEARCH - org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode
-
Segmentation geared towards search: this includes a decompounding process for long nouns, also including the full compound token as a synonym.
- setGraphvizFormatter(GraphvizFormatter) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
-
Expert: set this to produce graphviz (dot) output of the Viterbi lattice
- setNBestCost(int) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
- setPositionLength(int) - Method in class org.apache.lucene.analysis.ja.Token
-
Set the position length (in tokens) of this token.
- setToken(Token) - Method in interface org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttribute
- setToken(Token) - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
- setToken(Token) - Method in interface org.apache.lucene.analysis.ja.tokenattributes.InflectionAttribute
- setToken(Token) - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
- setToken(Token) - Method in interface org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttribute
- setToken(Token) - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
- setToken(Token) - Method in interface org.apache.lucene.analysis.ja.tokenattributes.ReadingAttribute
- setToken(Token) - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
- SPACE - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- SYMBOL - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
T
- TARGETMAP_FILENAME_SUFFIX - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
- TARGETMAP_HEADER - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
- toKatakana(CharSequence) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils
-
Convert all hiragana in a string into kanataka
- Token - Class in org.apache.lucene.analysis.ja
-
Analyzed token with morphological data from its dictionary.
- Token(int, char[], int, int, JapaneseTokenizer.Type, int, Dictionary) - Constructor for class org.apache.lucene.analysis.ja.Token
- TokenInfoDictionary - Class in org.apache.lucene.analysis.ja.dict
-
Binary dictionary implementation for a known-word dictionary model: Words are encoded into an FST mapping to a list of wordIDs.
- TokenInfoDictionary(URL, URL, URL, URL) - Constructor for class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
-
Create a
TokenInfoDictionary
from an external resource URL (e.g. - TokenInfoDictionary(Path, Path, Path, Path) - Constructor for class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
-
Create a
TokenInfoDictionary
from an external resource path. - TokenInfoDictionary(BinaryDictionary.ResourceScheme, String) - Constructor for class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
-
Deprecated, for removal: This API element is subject to removal in a future version.replaced by
TokenInfoDictionary(Path, Path, Path, Path)
for files andTokenInfoDictionary(URL, URL, URL, URL)
for classpath/module resources - TokenInfoFST - Class in org.apache.lucene.analysis.ja.dict
-
Thin wrapper around an FST with root-arc caching for Japanese.
- TokenInfoFST(FST<Long>, boolean) - Constructor for class org.apache.lucene.analysis.ja.dict.TokenInfoFST
- toString() - Method in class org.apache.lucene.analysis.ja.Token
- ToStringUtil - Class in org.apache.lucene.analysis.ja.util
-
Utility class for english translations of morphological data, used only for debugging.
- ToStringUtil() - Constructor for class org.apache.lucene.analysis.ja.util.ToStringUtil
U
- UNIDIC - org.apache.lucene.analysis.ja.util.DictionaryBuilder.DictionaryFormat
-
UNIDIC format
- UNKNOWN - org.apache.lucene.analysis.ja.JapaneseTokenizer.Type
-
Unknown words (heuristically segmented).
- UnknownDictionary - Class in org.apache.lucene.analysis.ja.dict
-
Dictionary for unknown-word handling.
- UnknownDictionary(URL, URL, URL) - Constructor for class org.apache.lucene.analysis.ja.dict.UnknownDictionary
-
Create a
UnknownDictionary
from an external resource URL (e.g. - UnknownDictionary(Path, Path, Path) - Constructor for class org.apache.lucene.analysis.ja.dict.UnknownDictionary
-
Create a
UnknownDictionary
from an external resource path. - UnknownDictionary(BinaryDictionary.ResourceScheme, String) - Constructor for class org.apache.lucene.analysis.ja.dict.UnknownDictionary
-
Deprecated, for removal: This API element is subject to removal in a future version.replaced by
UnknownDictionary(Path, Path, Path)
for files andUnknownDictionary(URL, URL, URL)
for classpath/module resources - USER - org.apache.lucene.analysis.ja.JapaneseTokenizer.Type
-
Known words from the user dictionary.
- UserDictionary - Class in org.apache.lucene.analysis.ja.dict
-
Class for building a User Dictionary.
V
- valueOf(String) - Static method in enum org.apache.lucene.analysis.ja.dict.BinaryDictionary.ResourceScheme
-
Deprecated.Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.apache.lucene.analysis.ja.JapaneseCompletionFilter.Mode
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.apache.lucene.analysis.ja.JapaneseTokenizer.Type
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.apache.lucene.analysis.ja.util.DictionaryBuilder.DictionaryFormat
-
Returns the enum constant of this type with the specified name.
- values() - Static method in enum org.apache.lucene.analysis.ja.dict.BinaryDictionary.ResourceScheme
-
Deprecated.Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.apache.lucene.analysis.ja.JapaneseCompletionFilter.Mode
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.apache.lucene.analysis.ja.JapaneseTokenizer.Type
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.apache.lucene.analysis.ja.util.DictionaryBuilder.DictionaryFormat
-
Returns an array containing the constants of this enum type, in the order they are declared.
- VERSION - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
- VERSION - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
- VERSION - Static variable in class org.apache.lucene.analysis.ja.dict.ConnectionCosts
W
- WORD_COST - Static variable in class org.apache.lucene.analysis.ja.dict.UserDictionary
All Classes All Packages