A B C D E F G H I J K L M N O P Q R S T U V W 
All Classes All Packages

A

accept() - Method in class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilter
 
advance() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer
 
ALPHA - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 

B

BaseFormAttribute - Interface in org.apache.lucene.analysis.ja.tokenattributes
Attribute for Token.getBaseForm().
BaseFormAttributeImpl - Class in org.apache.lucene.analysis.ja.tokenattributes
Attribute for Token.getBaseForm().
BaseFormAttributeImpl() - Constructor for class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
 
BinaryDictionary - Class in org.apache.lucene.analysis.ja.dict
Base class for a binary-encoded in-memory dictionary.
BinaryDictionary(IOSupplier<InputStream>, IOSupplier<InputStream>, IOSupplier<InputStream>) - Constructor for class org.apache.lucene.analysis.ja.dict.BinaryDictionary
 
BinaryDictionary.ResourceScheme - Enum in org.apache.lucene.analysis.ja.dict
Deprecated, for removal: This API element is subject to removal in a future version.
build(DictionaryBuilder.DictionaryFormat, Path, Path, String, boolean) - Static method in class org.apache.lucene.analysis.ja.util.DictionaryBuilder
 

C

calcNBestCost(String) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
 
CharacterDefinition - Class in org.apache.lucene.analysis.ja.dict
Character category data.
charAt(int) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer
 
CharSequenceUtils - Class in org.apache.lucene.analysis.ja.completion
Utility functions for JapaneseCompletionFilter
CLASS_COUNT - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 
CLASSPATH - org.apache.lucene.analysis.ja.dict.BinaryDictionary.ResourceScheme
Deprecated.
 
clear() - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
 
clear() - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
 
clear() - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
 
clear() - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
 
close() - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
 
ConnectionCosts - Class in org.apache.lucene.analysis.ja.dict
n-gram connection cost data
ConnectionCosts(URL) - Constructor for class org.apache.lucene.analysis.ja.dict.ConnectionCosts
Create a ConnectionCosts from an external resource URL (e.g.
ConnectionCosts(Path) - Constructor for class org.apache.lucene.analysis.ja.dict.ConnectionCosts
Create a ConnectionCosts from an external resource path.
ConnectionCosts(BinaryDictionary.ResourceScheme, String) - Constructor for class org.apache.lucene.analysis.ja.dict.ConnectionCosts
Deprecated, for removal: This API element is subject to removal in a future version.
replaced by ConnectionCosts(Path) for files and ConnectionCosts(URL) for classpath/module resources.
copyTo(AttributeImpl) - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
 
copyTo(AttributeImpl) - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
 
copyTo(AttributeImpl) - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
 
copyTo(AttributeImpl) - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
 
correct(int) - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
 
create(Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory
 
create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseBaseFormFilterFactory
 
create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionFilterFactory
 
create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilterFactory
 
create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilterFactory
 
create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory
 
create(TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseReadingFormFilterFactory
 
create(AttributeFactory) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory
 
createComponents(String) - Method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
 
createComponents(String) - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer
 
CYRILLIC - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 

D

DEFAULT - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 
DEFAULT_MINIMUM_LENGTH - Static variable in class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilter
 
DEFAULT_MODE - Static variable in class org.apache.lucene.analysis.ja.JapaneseCompletionFilter
 
DEFAULT_MODE - Static variable in class org.apache.lucene.analysis.ja.JapaneseTokenizer
Default tokenization mode.
DICT_FILENAME_SUFFIX - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
 
DICT_HEADER - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
 
Dictionary - Interface in org.apache.lucene.analysis.ja.dict
Dictionary interface for retrieving morphological data by id.
DictionaryBuilder - Class in org.apache.lucene.analysis.ja.util
Tool to build dictionaries.
DictionaryBuilder.DictionaryFormat - Enum in org.apache.lucene.analysis.ja.util
Format of the dictionary.

E

end() - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
 
EXTENDED - org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode
Extended mode outputs unigrams for unknown words.

F

FILE - org.apache.lucene.analysis.ja.dict.BinaryDictionary.ResourceScheme
Deprecated.
 
FILENAME_SUFFIX - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 
FILENAME_SUFFIX - Static variable in class org.apache.lucene.analysis.ja.dict.ConnectionCosts
 
findTargetArc(int, FST.Arc<Long>, FST.Arc<Long>, boolean, FST.BytesReader) - Method in class org.apache.lucene.analysis.ja.dict.TokenInfoFST
 
finish() - Method in class org.apache.lucene.analysis.ja.GraphvizFormatter
 
FST_FILENAME_SUFFIX - Static variable in class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
 

G

get(int, int) - Method in class org.apache.lucene.analysis.ja.dict.ConnectionCosts
 
getBaseForm() - Method in class org.apache.lucene.analysis.ja.Token
 
getBaseForm() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttribute
 
getBaseForm() - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
 
getBaseForm(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
 
getBaseForm(int, char[], int, int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary
Get base form of word
getBaseForm(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
 
getBytesReader() - Method in class org.apache.lucene.analysis.ja.dict.TokenInfoFST
 
getCharacterClass(char) - Method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 
getCharacterDefinition() - Method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
 
getDefaultStopSet() - Static method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
 
getDefaultStopTags() - Static method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
 
getFirstArc(FST.Arc<Long>) - Method in class org.apache.lucene.analysis.ja.dict.TokenInfoFST
 
getFST() - Method in class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
 
getFST() - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
 
getInflectedFormTranslation(String) - Static method in class org.apache.lucene.analysis.ja.util.ToStringUtil
Get the english form of inflected form
getInflectionForm() - Method in class org.apache.lucene.analysis.ja.Token
 
getInflectionForm() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.InflectionAttribute
 
getInflectionForm() - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
 
getInflectionForm(int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
 
getInflectionForm(int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary
Get inflection form of tokens
getInflectionForm(int) - Method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
 
getInflectionForm(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
 
getInflectionType() - Method in class org.apache.lucene.analysis.ja.Token
 
getInflectionType() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.InflectionAttribute
 
getInflectionType() - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
 
getInflectionType(int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
 
getInflectionType(int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary
Get inflection type of tokens
getInflectionType(int) - Method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
 
getInflectionType(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
 
getInflectionTypeTranslation(String) - Static method in class org.apache.lucene.analysis.ja.util.ToStringUtil
Get the english form of inflection type
getInstance() - Static method in class org.apache.lucene.analysis.ja.completion.KatakanaRomanizer
Returns the singleton instance of KatakanaRomenizer
getInstance() - Static method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 
getInstance() - Static method in class org.apache.lucene.analysis.ja.dict.ConnectionCosts
 
getInstance() - Static method in class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
 
getInstance() - Static method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
 
getLeftId(int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
 
getLeftId(int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary
Get left id of specified word
getLeftId(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
 
getLength() - Method in class org.apache.lucene.analysis.ja.Token
 
getOffset() - Method in class org.apache.lucene.analysis.ja.Token
 
getPartOfSpeech() - Method in class org.apache.lucene.analysis.ja.Token
 
getPartOfSpeech() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttribute
 
getPartOfSpeech() - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
 
getPartOfSpeech(int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
 
getPartOfSpeech(int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary
Get Part-Of-Speech of tokens
getPartOfSpeech(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
 
getPosition() - Method in class org.apache.lucene.analysis.ja.Token
Get index of this token in input text
getPositionLength() - Method in class org.apache.lucene.analysis.ja.Token
Get the length (in tokens) of this token.
getPOSTranslation(String) - Static method in class org.apache.lucene.analysis.ja.util.ToStringUtil
Get the english form of a POS tag
getPronunciation() - Method in class org.apache.lucene.analysis.ja.Token
 
getPronunciation() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.ReadingAttribute
 
getPronunciation() - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
 
getPronunciation(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
 
getPronunciation(int, char[], int, int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary
Get pronunciation of tokens
getPronunciation(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
 
getReading() - Method in class org.apache.lucene.analysis.ja.Token
 
getReading() - Method in interface org.apache.lucene.analysis.ja.tokenattributes.ReadingAttribute
 
getReading() - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
 
getReading(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
 
getReading(int, char[], int, int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary
Get reading of tokens
getReading(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
 
getReading(int, char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
 
getResource(BinaryDictionary.ResourceScheme, String) - Static method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
Deprecated, for removal: This API element is subject to removal in a future version.
getRightId(int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
 
getRightId(int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary
Get right id of specified word
getRightId(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
 
getRomanization(Appendable, CharSequence) - Static method in class org.apache.lucene.analysis.ja.util.ToStringUtil
Romanize katakana with modified hepburn
getRomanization(String) - Static method in class org.apache.lucene.analysis.ja.util.ToStringUtil
Romanize katakana with modified hepburn
getSurfaceForm() - Method in class org.apache.lucene.analysis.ja.Token
 
getSurfaceFormString() - Method in class org.apache.lucene.analysis.ja.Token
 
getType() - Method in class org.apache.lucene.analysis.ja.Token
Returns the type of this token
getWordCost(int) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
 
getWordCost(int) - Method in interface org.apache.lucene.analysis.ja.dict.Dictionary
Get word cost of specified word
getWordCost(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
 
GraphvizFormatter - Class in org.apache.lucene.analysis.ja
Outputs the dot (graphviz) string for the viterbi lattice.
GraphvizFormatter(ConnectionCosts) - Constructor for class org.apache.lucene.analysis.ja.GraphvizFormatter
 
GREEK - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 

H

HAS_BASEFORM - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
flag that the entry has baseform data.
HAS_PRONUNCIATION - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
flag that the entry has pronunciation data.
HAS_READING - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
flag that the entry has reading data.
HEADER - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 
HEADER - Static variable in class org.apache.lucene.analysis.ja.dict.ConnectionCosts
 
HIRAGANA - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 

I

incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseBaseFormFilter
 
incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionFilter
 
incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilter
 
incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
 
incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseReadingFormFilter
 
incrementToken() - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
 
INDEX - org.apache.lucene.analysis.ja.JapaneseCompletionFilter.Mode
Simple romanization.
InflectionAttribute - Interface in org.apache.lucene.analysis.ja.tokenattributes
Attribute for Kuromoji inflection data.
InflectionAttributeImpl - Class in org.apache.lucene.analysis.ja.tokenattributes
Attribute for Kuromoji inflection data.
InflectionAttributeImpl() - Constructor for class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
 
inform(ResourceLoader) - Method in class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory
 
inform(ResourceLoader) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory
 
initReader(String, Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
 
initReader(String, Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer
 
initReaderForNormalization(String, Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
 
initReaderForNormalization(String, Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer
 
INTERNAL_SEPARATOR - Static variable in interface org.apache.lucene.analysis.ja.dict.Dictionary
 
IPADIC - org.apache.lucene.analysis.ja.util.DictionaryBuilder.DictionaryFormat
IPADIC format
isArabicNumeral(char) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
Arabic numeral predicate.
isFullWidthLowercaseAlphabet(char) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils
Checks if a char is a full-width lowercase alphabet
isGroup(char) - Method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 
isInvoke(char) - Method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 
isKana(CharSequence) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils
Checks if a char sequence is composed only of Katakana or hiragana
isKanji(char) - Method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 
isKatakanaOrHWAlphabets(CharSequence) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils
Checks if a char sequence is composed only of Katakana or lowercase alphabets
isKnown() - Method in class org.apache.lucene.analysis.ja.Token
Returns true if this token is known word
isLowercaseAlphabets(CharSequence) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils
Checks if a char sequence is composed only of lowercase alphabets
isNumeral(char) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
Numeral predicate
isNumeral(String) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
Numeral predicate
isNumeralPunctuation(char) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
Numeral punctuation predicate
isNumeralPunctuation(String) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
Numeral punctuation predicate
isUnknown() - Method in class org.apache.lucene.analysis.ja.Token
Returns true if this token is unknown word
isUser() - Method in class org.apache.lucene.analysis.ja.Token
Returns true if this token is defined in user dictionary

J

JapaneseAnalyzer - Class in org.apache.lucene.analysis.ja
Analyzer for Japanese that uses morphological analysis.
JapaneseAnalyzer() - Constructor for class org.apache.lucene.analysis.ja.JapaneseAnalyzer
 
JapaneseAnalyzer(UserDictionary, JapaneseTokenizer.Mode, CharArraySet, Set<String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseAnalyzer
 
JapaneseBaseFormFilter - Class in org.apache.lucene.analysis.ja
Replaces term text with the BaseFormAttribute.
JapaneseBaseFormFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseBaseFormFilter
 
JapaneseBaseFormFilterFactory - Class in org.apache.lucene.analysis.ja
JapaneseBaseFormFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseBaseFormFilterFactory
Default ctor for compatibility with SPI
JapaneseBaseFormFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseBaseFormFilterFactory
Creates a new JapaneseBaseFormFilterFactory
JapaneseCompletionAnalyzer - Class in org.apache.lucene.analysis.ja
Analyzer for Japanese completion suggester.
JapaneseCompletionAnalyzer() - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer
Creates a new JapaneseCompletionAnalyzer with default configurations
JapaneseCompletionAnalyzer(UserDictionary, JapaneseCompletionFilter.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionAnalyzer
Creates a new JapaneseCompletionAnalyzer
JapaneseCompletionFilter - Class in org.apache.lucene.analysis.ja
A TokenFilter that adds Japanese romanized tokens to the term attribute.
JapaneseCompletionFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionFilter
Creates a new JapaneseCompletionFilter with default configurations
JapaneseCompletionFilter(TokenStream, JapaneseCompletionFilter.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionFilter
Creates a new JapaneseCompletionFilter
JapaneseCompletionFilter.Mode - Enum in org.apache.lucene.analysis.ja
Completion mode
JapaneseCompletionFilterFactory - Class in org.apache.lucene.analysis.ja
JapaneseCompletionFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionFilterFactory
Default ctor for compatibility with SPI
JapaneseCompletionFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseCompletionFilterFactory
Creates a new JapaneseCompletionFilterFactory
JapaneseIterationMarkCharFilter - Class in org.apache.lucene.analysis.ja
Normalizes Japanese horizontal iteration marks (odoriji) to their expanded form.
JapaneseIterationMarkCharFilter(Reader) - Constructor for class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
Constructor.
JapaneseIterationMarkCharFilter(Reader, boolean, boolean) - Constructor for class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
Constructor
JapaneseIterationMarkCharFilterFactory - Class in org.apache.lucene.analysis.ja
JapaneseIterationMarkCharFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory
Default ctor for compatibility with SPI
JapaneseIterationMarkCharFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory
Creates a new JapaneseIterationMarkCharFilterFactory
JapaneseKatakanaStemFilter - Class in org.apache.lucene.analysis.ja
A TokenFilter that normalizes common katakana spelling variations ending in a long sound character by removing this character (U+30FC).
JapaneseKatakanaStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilter
 
JapaneseKatakanaStemFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilter
 
JapaneseKatakanaStemFilterFactory - Class in org.apache.lucene.analysis.ja
JapaneseKatakanaStemFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilterFactory
Default ctor for compatibility with SPI
JapaneseKatakanaStemFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilterFactory
Creates a new JapaneseKatakanaStemFilterFactory
JapaneseNumberFilter - Class in org.apache.lucene.analysis.ja
A TokenFilter that normalizes Japanese numbers (kansūji) to regular Arabic decimal numbers in half-width characters.
JapaneseNumberFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseNumberFilter
 
JapaneseNumberFilter.NumberBuffer - Class in org.apache.lucene.analysis.ja
Buffer that holds a Japanese number string and a position index used as a parsed-to marker
JapaneseNumberFilterFactory - Class in org.apache.lucene.analysis.ja
JapaneseNumberFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseNumberFilterFactory
Default ctor for compatibility with SPI
JapaneseNumberFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseNumberFilterFactory
 
JapanesePartOfSpeechStopFilter - Class in org.apache.lucene.analysis.ja
Removes tokens that match a set of part-of-speech tags.
JapanesePartOfSpeechStopFilter(TokenStream, Set<String>) - Constructor for class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilter
JapanesePartOfSpeechStopFilterFactory - Class in org.apache.lucene.analysis.ja
JapanesePartOfSpeechStopFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory
Default ctor for compatibility with SPI
JapanesePartOfSpeechStopFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory
Creates a new JapanesePartOfSpeechStopFilterFactory
JapaneseReadingFormFilter - Class in org.apache.lucene.analysis.ja
A TokenFilter that replaces the term attribute with the reading of a token in either katakana or romaji form.
JapaneseReadingFormFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ja.JapaneseReadingFormFilter
 
JapaneseReadingFormFilter(TokenStream, boolean) - Constructor for class org.apache.lucene.analysis.ja.JapaneseReadingFormFilter
 
JapaneseReadingFormFilterFactory - Class in org.apache.lucene.analysis.ja
JapaneseReadingFormFilterFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseReadingFormFilterFactory
Default ctor for compatibility with SPI
JapaneseReadingFormFilterFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseReadingFormFilterFactory
Creates a new JapaneseReadingFormFilterFactory
JapaneseTokenizer - Class in org.apache.lucene.analysis.ja
Tokenizer for Japanese that uses morphological analysis.
JapaneseTokenizer(UserDictionary, boolean, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer
Create a new JapaneseTokenizer.
JapaneseTokenizer(UserDictionary, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer
Create a new JapaneseTokenizer.
JapaneseTokenizer(AttributeFactory, TokenInfoDictionary, UnknownDictionary, ConnectionCosts, UserDictionary, boolean, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer
Create a new JapaneseTokenizer, supplying a custom system dictionary and unknown dictionary.
JapaneseTokenizer(AttributeFactory, UserDictionary, boolean, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer
Create a new JapaneseTokenizer using the system and unknown dictionaries shipped with Lucene.
JapaneseTokenizer(AttributeFactory, UserDictionary, boolean, JapaneseTokenizer.Mode) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizer
Create a new JapaneseTokenizer using the system and unknown dictionaries shipped with Lucene.
JapaneseTokenizer.Mode - Enum in org.apache.lucene.analysis.ja
Tokenization mode: this determines how the tokenizer handles compound and unknown words.
JapaneseTokenizer.Type - Enum in org.apache.lucene.analysis.ja
Token type reflecting the original source of this token
JapaneseTokenizerFactory - Class in org.apache.lucene.analysis.ja
Factory for JapaneseTokenizer.
JapaneseTokenizerFactory() - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory
Default ctor for compatibility with SPI
JapaneseTokenizerFactory(Map<String, String>) - Constructor for class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory
Creates a new JapaneseTokenizerFactory

K

KANJI - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 
KANJINUMERIC - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 
KATAKANA - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 
KatakanaRomanizer - Class in org.apache.lucene.analysis.ja.completion
Converts a Katakana string to Romaji using the pre-defined Katakana-Romaji mapping rules.
KNOWN - org.apache.lucene.analysis.ja.JapaneseTokenizer.Type
Known words from the system dictionary.

L

LEFT_ID - Static variable in class org.apache.lucene.analysis.ja.dict.UserDictionary
 
length() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer
 
lookup(char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UnknownDictionary
 
lookup(char[], int, int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
Lookup words in text
lookupCharacterClass(String) - Static method in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 
lookupSegmentation(int) - Method in class org.apache.lucene.analysis.ja.dict.UserDictionary
 
lookupWordIds(int, IntsRef) - Method in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
 

M

main(String[]) - Static method in class org.apache.lucene.analysis.ja.util.DictionaryBuilder
 

N

NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseBaseFormFilterFactory
SPI name
NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseCompletionFilterFactory
SPI name
NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory
SPI name
NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseKatakanaStemFilterFactory
SPI name
NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseNumberFilterFactory
SPI name
NAME - Static variable in class org.apache.lucene.analysis.ja.JapanesePartOfSpeechStopFilterFactory
 
NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseReadingFormFilterFactory
SPI name
NAME - Static variable in class org.apache.lucene.analysis.ja.JapaneseTokenizerFactory
SPI name
NGRAM - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 
NO_OUTPUT - Variable in class org.apache.lucene.analysis.ja.dict.TokenInfoFST
 
NORMAL - org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode
Ordinary segmentation: no decomposition for compounds,
normalize(Reader) - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilterFactory
 
normalize(String, TokenStream) - Method in class org.apache.lucene.analysis.ja.JapaneseAnalyzer
 
NORMALIZE_KANA_DEFAULT - Static variable in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
Normalize kana iteration marks by default
NORMALIZE_KANJI_DEFAULT - Static variable in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
Normalize kanji iteration marks by default
normalizeNumber(String) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
Normalizes a Japanese number
NumberBuffer(String) - Constructor for class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer
 
NUMERIC - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 

O

open(Reader) - Static method in class org.apache.lucene.analysis.ja.dict.UserDictionary
 
org.apache.lucene.analysis.ja - package org.apache.lucene.analysis.ja
Analyzer for Japanese.
org.apache.lucene.analysis.ja.completion - package org.apache.lucene.analysis.ja.completion
org.apache.lucene.analysis.ja.dict - package org.apache.lucene.analysis.ja.dict
Kuromoji dictionary implementation.
org.apache.lucene.analysis.ja.tokenattributes - package org.apache.lucene.analysis.ja.tokenattributes
Additional Kuromoji-specific Attributes for text analysis.
org.apache.lucene.analysis.ja.util - package org.apache.lucene.analysis.ja.util
Kuromoji utility classes.

P

parseLargeKanjiNumeral(JapaneseNumberFilter.NumberBuffer) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
Parse large kanji numerals (ten thousands or larger)
parseMediumKanjiNumeral(JapaneseNumberFilter.NumberBuffer) - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
Parse medium kanji numerals (tens, hundreds or thousands)
PartOfSpeechAttribute - Interface in org.apache.lucene.analysis.ja.tokenattributes
PartOfSpeechAttributeImpl - Class in org.apache.lucene.analysis.ja.tokenattributes
PartOfSpeechAttributeImpl() - Constructor for class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
 
POSDICT_FILENAME_SUFFIX - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
 
POSDICT_HEADER - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
 
position() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter.NumberBuffer
 

Q

QUERY - org.apache.lucene.analysis.ja.JapaneseCompletionFilter.Mode
Input Method aware romanization.

R

read() - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
 
read(char[], int, int) - Method in class org.apache.lucene.analysis.ja.JapaneseIterationMarkCharFilter
 
ReadingAttribute - Interface in org.apache.lucene.analysis.ja.tokenattributes
Attribute for Kuromoji reading data
ReadingAttributeImpl - Class in org.apache.lucene.analysis.ja.tokenattributes
Attribute for Kuromoji reading data
ReadingAttributeImpl() - Constructor for class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
 
reflectWith(AttributeReflector) - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
 
reflectWith(AttributeReflector) - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
 
reflectWith(AttributeReflector) - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
 
reflectWith(AttributeReflector) - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
 
reset() - Method in class org.apache.lucene.analysis.ja.JapaneseCompletionFilter
 
reset() - Method in class org.apache.lucene.analysis.ja.JapaneseNumberFilter
 
reset() - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
 
RIGHT_ID - Static variable in class org.apache.lucene.analysis.ja.dict.UserDictionary
 
romanize(CharsRef) - Method in class org.apache.lucene.analysis.ja.completion.KatakanaRomanizer
Translates a sequence of katakana to romaji.

S

SEARCH - org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode
Segmentation geared towards search: this includes a decompounding process for long nouns, also including the full compound token as a synonym.
setGraphvizFormatter(GraphvizFormatter) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
Expert: set this to produce graphviz (dot) output of the Viterbi lattice
setNBestCost(int) - Method in class org.apache.lucene.analysis.ja.JapaneseTokenizer
 
setPositionLength(int) - Method in class org.apache.lucene.analysis.ja.Token
Set the position length (in tokens) of this token.
setToken(Token) - Method in interface org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttribute
 
setToken(Token) - Method in class org.apache.lucene.analysis.ja.tokenattributes.BaseFormAttributeImpl
 
setToken(Token) - Method in interface org.apache.lucene.analysis.ja.tokenattributes.InflectionAttribute
 
setToken(Token) - Method in class org.apache.lucene.analysis.ja.tokenattributes.InflectionAttributeImpl
 
setToken(Token) - Method in interface org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttribute
 
setToken(Token) - Method in class org.apache.lucene.analysis.ja.tokenattributes.PartOfSpeechAttributeImpl
 
setToken(Token) - Method in interface org.apache.lucene.analysis.ja.tokenattributes.ReadingAttribute
 
setToken(Token) - Method in class org.apache.lucene.analysis.ja.tokenattributes.ReadingAttributeImpl
 
SPACE - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 
SYMBOL - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 

T

TARGETMAP_FILENAME_SUFFIX - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
 
TARGETMAP_HEADER - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
 
toKatakana(CharSequence) - Static method in class org.apache.lucene.analysis.ja.completion.CharSequenceUtils
Convert all hiragana in a string into kanataka
Token - Class in org.apache.lucene.analysis.ja
Analyzed token with morphological data from its dictionary.
Token(int, char[], int, int, JapaneseTokenizer.Type, int, Dictionary) - Constructor for class org.apache.lucene.analysis.ja.Token
 
TokenInfoDictionary - Class in org.apache.lucene.analysis.ja.dict
Binary dictionary implementation for a known-word dictionary model: Words are encoded into an FST mapping to a list of wordIDs.
TokenInfoDictionary(URL, URL, URL, URL) - Constructor for class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
Create a TokenInfoDictionary from an external resource URL (e.g.
TokenInfoDictionary(Path, Path, Path, Path) - Constructor for class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
Create a TokenInfoDictionary from an external resource path.
TokenInfoDictionary(BinaryDictionary.ResourceScheme, String) - Constructor for class org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
Deprecated, for removal: This API element is subject to removal in a future version.
replaced by TokenInfoDictionary(Path, Path, Path, Path) for files and TokenInfoDictionary(URL, URL, URL, URL) for classpath/module resources
TokenInfoFST - Class in org.apache.lucene.analysis.ja.dict
Thin wrapper around an FST with root-arc caching for Japanese.
TokenInfoFST(FST<Long>, boolean) - Constructor for class org.apache.lucene.analysis.ja.dict.TokenInfoFST
 
toString() - Method in class org.apache.lucene.analysis.ja.Token
 
ToStringUtil - Class in org.apache.lucene.analysis.ja.util
Utility class for english translations of morphological data, used only for debugging.
ToStringUtil() - Constructor for class org.apache.lucene.analysis.ja.util.ToStringUtil
 

U

UNIDIC - org.apache.lucene.analysis.ja.util.DictionaryBuilder.DictionaryFormat
UNIDIC format
UNKNOWN - org.apache.lucene.analysis.ja.JapaneseTokenizer.Type
Unknown words (heuristically segmented).
UnknownDictionary - Class in org.apache.lucene.analysis.ja.dict
Dictionary for unknown-word handling.
UnknownDictionary(URL, URL, URL) - Constructor for class org.apache.lucene.analysis.ja.dict.UnknownDictionary
Create a UnknownDictionary from an external resource URL (e.g.
UnknownDictionary(Path, Path, Path) - Constructor for class org.apache.lucene.analysis.ja.dict.UnknownDictionary
Create a UnknownDictionary from an external resource path.
UnknownDictionary(BinaryDictionary.ResourceScheme, String) - Constructor for class org.apache.lucene.analysis.ja.dict.UnknownDictionary
Deprecated, for removal: This API element is subject to removal in a future version.
replaced by UnknownDictionary(Path, Path, Path) for files and UnknownDictionary(URL, URL, URL) for classpath/module resources
USER - org.apache.lucene.analysis.ja.JapaneseTokenizer.Type
Known words from the user dictionary.
UserDictionary - Class in org.apache.lucene.analysis.ja.dict
Class for building a User Dictionary.

V

valueOf(String) - Static method in enum org.apache.lucene.analysis.ja.dict.BinaryDictionary.ResourceScheme
Deprecated.
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.lucene.analysis.ja.JapaneseCompletionFilter.Mode
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.lucene.analysis.ja.JapaneseTokenizer.Type
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.apache.lucene.analysis.ja.util.DictionaryBuilder.DictionaryFormat
Returns the enum constant of this type with the specified name.
values() - Static method in enum org.apache.lucene.analysis.ja.dict.BinaryDictionary.ResourceScheme
Deprecated.
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.lucene.analysis.ja.JapaneseCompletionFilter.Mode
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.lucene.analysis.ja.JapaneseTokenizer.Mode
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.lucene.analysis.ja.JapaneseTokenizer.Type
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.apache.lucene.analysis.ja.util.DictionaryBuilder.DictionaryFormat
Returns an array containing the constants of this enum type, in the order they are declared.
VERSION - Static variable in class org.apache.lucene.analysis.ja.dict.BinaryDictionary
 
VERSION - Static variable in class org.apache.lucene.analysis.ja.dict.CharacterDefinition
 
VERSION - Static variable in class org.apache.lucene.analysis.ja.dict.ConnectionCosts
 

W

WORD_COST - Static variable in class org.apache.lucene.analysis.ja.dict.UserDictionary
 
A B C D E F G H I J K L M N O P Q R S T U V W 
All Classes All Packages