Class UnknownDictionary
java.lang.Object
org.apache.lucene.analysis.ja.dict.BinaryDictionary
org.apache.lucene.analysis.ja.dict.UnknownDictionary
- All Implemented Interfaces:
Dictionary
Dictionary for unknown-word handling.
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.lucene.analysis.ja.dict.BinaryDictionary
BinaryDictionary.ResourceScheme
-
Field Summary
Fields inherited from class org.apache.lucene.analysis.ja.dict.BinaryDictionary
DICT_FILENAME_SUFFIX, DICT_HEADER, HAS_BASEFORM, HAS_PRONUNCIATION, HAS_READING, POSDICT_FILENAME_SUFFIX, POSDICT_HEADER, TARGETMAP_FILENAME_SUFFIX, TARGETMAP_HEADER, VERSION
Fields inherited from interface org.apache.lucene.analysis.ja.dict.Dictionary
INTERNAL_SEPARATOR
-
Constructor Summary
ConstructorDescriptionUnknownDictionary
(URL targetMapUrl, URL posDictUrl, URL dictUrl) Create aUnknownDictionary
from an external resource URL (e.g.UnknownDictionary
(Path targetMapFile, Path posDictFile, Path dictFile) Create aUnknownDictionary
from an external resource path.UnknownDictionary
(BinaryDictionary.ResourceScheme scheme, String path) Deprecated, for removal: This API element is subject to removal in a future version. -
Method Summary
Modifier and TypeMethodDescriptiongetInflectionForm
(int wordId) Get inflection form of tokensgetInflectionType
(int wordId) Get inflection type of tokensstatic UnknownDictionary
getReading
(int wordId, char[] surface, int off, int len) Get reading of tokensint
lookup
(char[] text, int offset, int len) Methods inherited from class org.apache.lucene.analysis.ja.dict.BinaryDictionary
getBaseForm, getLeftId, getPartOfSpeech, getPronunciation, getResource, getRightId, getWordCost, lookupWordIds
-
Constructor Details
-
UnknownDictionary
@Deprecated(forRemoval=true, since="9.1") public UnknownDictionary(BinaryDictionary.ResourceScheme scheme, String path) throws IOException Deprecated, for removal: This API element is subject to removal in a future version.replaced byUnknownDictionary(Path, Path, Path)
for files andUnknownDictionary(URL, URL, URL)
for classpath/module resources- Parameters:
scheme
- scheme for loading resources (FILE or CLASSPATH).path
- where to load resources from; a path, including the file base name without extension; this is used to match multiple files with the same base name.- Throws:
IOException
-
UnknownDictionary
Create aUnknownDictionary
from an external resource path.- Parameters:
targetMapFile
- where to load target map resourceposDictFile
- where to load POS dictionary resourcedictFile
- where to load dictionary entries resource- Throws:
IOException
- if resource was not found or broken
-
UnknownDictionary
Create aUnknownDictionary
from an external resource URL (e.g. from Classpath withClassLoader.getResource(String)
).- Parameters:
targetMapUrl
- where to load target map resourceposDictUrl
- where to load POS dictionary resourcedictUrl
- where to load dictionary entries resource- Throws:
IOException
- if resource was not found or broken
-
-
Method Details
-
lookup
public int lookup(char[] text, int offset, int len) -
getCharacterDefinition
-
getReading
Description copied from interface:Dictionary
Get reading of tokens- Specified by:
getReading
in interfaceDictionary
- Overrides:
getReading
in classBinaryDictionary
- Parameters:
wordId
- word ID of token- Returns:
- Reading of the token
-
getInflectionType
Description copied from interface:Dictionary
Get inflection type of tokens- Specified by:
getInflectionType
in interfaceDictionary
- Overrides:
getInflectionType
in classBinaryDictionary
- Parameters:
wordId
- word ID of token- Returns:
- inflection type, or null
-
getInflectionForm
Description copied from interface:Dictionary
Get inflection form of tokens- Specified by:
getInflectionForm
in interfaceDictionary
- Overrides:
getInflectionForm
in classBinaryDictionary
- Parameters:
wordId
- word ID of token- Returns:
- inflection form, or null
-
getInstance
-
UnknownDictionary(Path, Path, Path)
for files andUnknownDictionary(URL, URL, URL)
for classpath/module resources