Class UnknownDictionary
java.lang.Object
org.apache.lucene.analysis.ja.dict.BinaryDictionary
org.apache.lucene.analysis.ja.dict.UnknownDictionary
- All Implemented Interfaces:
Dictionary
Dictionary for unknown-word handling.
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.lucene.analysis.ja.dict.BinaryDictionary
BinaryDictionary.ResourceScheme
-
Field Summary
Fields inherited from class org.apache.lucene.analysis.ja.dict.BinaryDictionary
DICT_FILENAME_SUFFIX, DICT_HEADER, HAS_BASEFORM, HAS_PRONUNCIATION, HAS_READING, POSDICT_FILENAME_SUFFIX, POSDICT_HEADER, TARGETMAP_FILENAME_SUFFIX, TARGETMAP_HEADER, VERSION
Fields inherited from interface org.apache.lucene.analysis.ja.dict.Dictionary
INTERNAL_SEPARATOR
-
Constructor Summary
ConstructorDescriptionUnknownDictionary
(Path targetMapFile, Path posDictFile, Path dictFile) Create aUnknownDictionary
from an external resource path.UnknownDictionary
(BinaryDictionary.ResourceScheme scheme, String path) Deprecated, for removal: This API element is subject to removal in a future version. -
Method Summary
Modifier and TypeMethodDescriptiongetInflectionForm
(int wordId) Get inflection form of tokensgetInflectionType
(int wordId) Get inflection type of tokensstatic UnknownDictionary
getReading
(int wordId, char[] surface, int off, int len) Get reading of tokensint
lookup
(char[] text, int offset, int len) Methods inherited from class org.apache.lucene.analysis.ja.dict.BinaryDictionary
getBaseForm, getLeftId, getPartOfSpeech, getPronunciation, getResource, getRightId, getWordCost, lookupWordIds
-
Constructor Details
-
UnknownDictionary
@Deprecated(forRemoval=true, since="9.1") public UnknownDictionary(BinaryDictionary.ResourceScheme scheme, String path) throws IOException Deprecated, for removal: This API element is subject to removal in a future version.replaced byUnknownDictionary(Path, Path, Path)
- Parameters:
scheme
- scheme for loading resources (FILE or CLASSPATH).path
- where to load resources from; a path, including the file base name without extension; this is used to match multiple files with the same base name.- Throws:
IOException
-
UnknownDictionary
Create aUnknownDictionary
from an external resource path.- Parameters:
targetMapFile
- where to load target map resourceposDictFile
- where to load POS dictionary resourcedictFile
- where to load dictionary entries resource- Throws:
IOException
- if resource was not found or broken
-
-
Method Details
-
lookup
public int lookup(char[] text, int offset, int len) -
getCharacterDefinition
-
getReading
Description copied from interface:Dictionary
Get reading of tokens- Specified by:
getReading
in interfaceDictionary
- Overrides:
getReading
in classBinaryDictionary
- Parameters:
wordId
- word ID of token- Returns:
- Reading of the token
-
getInflectionType
Description copied from interface:Dictionary
Get inflection type of tokens- Specified by:
getInflectionType
in interfaceDictionary
- Overrides:
getInflectionType
in classBinaryDictionary
- Parameters:
wordId
- word ID of token- Returns:
- inflection type, or null
-
getInflectionForm
Description copied from interface:Dictionary
Get inflection form of tokens- Specified by:
getInflectionForm
in interfaceDictionary
- Overrides:
getInflectionForm
in classBinaryDictionary
- Parameters:
wordId
- word ID of token- Returns:
- inflection form, or null
-
getInstance
-
UnknownDictionary(Path, Path, Path)