Class TokenInfoDictionary
- java.lang.Object
-
- org.apache.lucene.analysis.ja.dict.BinaryDictionary
-
- org.apache.lucene.analysis.ja.dict.TokenInfoDictionary
-
- All Implemented Interfaces:
Dictionary
public final class TokenInfoDictionary extends BinaryDictionary
Binary dictionary implementation for a known-word dictionary model: Words are encoded into an FST mapping to a list of wordIDs.
-
-
Nested Class Summary
-
Nested classes/interfaces inherited from class org.apache.lucene.analysis.ja.dict.BinaryDictionary
BinaryDictionary.ResourceScheme
-
-
Field Summary
Fields Modifier and Type Field Description static String
FST_FILENAME_SUFFIX
-
Fields inherited from class org.apache.lucene.analysis.ja.dict.BinaryDictionary
DICT_FILENAME_SUFFIX, DICT_HEADER, HAS_BASEFORM, HAS_PRONUNCIATION, HAS_READING, POSDICT_FILENAME_SUFFIX, POSDICT_HEADER, TARGETMAP_FILENAME_SUFFIX, TARGETMAP_HEADER, VERSION
-
Fields inherited from interface org.apache.lucene.analysis.ja.dict.Dictionary
INTERNAL_SEPARATOR
-
-
Constructor Summary
Constructors Constructor Description TokenInfoDictionary(URL targetMapUrl, URL posDictUrl, URL dictUrl, URL fstUrl)
Create aTokenInfoDictionary
from an external resource URL (e.g.TokenInfoDictionary(Path targetMapFile, Path posDictFile, Path dictFile, Path fstFile)
Create aTokenInfoDictionary
from an external resource path.TokenInfoDictionary(BinaryDictionary.ResourceScheme resourceScheme, String resourcePath)
Deprecated, for removal: This API element is subject to removal in a future version.replaced byTokenInfoDictionary(Path, Path, Path, Path)
for files andTokenInfoDictionary(URL, URL, URL, URL)
for classpath/module resources
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description TokenInfoFST
getFST()
static TokenInfoDictionary
getInstance()
-
Methods inherited from class org.apache.lucene.analysis.ja.dict.BinaryDictionary
getBaseForm, getInflectionForm, getInflectionType, getLeftId, getPartOfSpeech, getPronunciation, getReading, getResource, getRightId, getWordCost, lookupWordIds
-
-
-
-
Field Detail
-
FST_FILENAME_SUFFIX
public static final String FST_FILENAME_SUFFIX
- See Also:
- Constant Field Values
-
-
Constructor Detail
-
TokenInfoDictionary
@Deprecated(forRemoval=true, since="9.1") public TokenInfoDictionary(BinaryDictionary.ResourceScheme resourceScheme, String resourcePath) throws IOException
Deprecated, for removal: This API element is subject to removal in a future version.replaced byTokenInfoDictionary(Path, Path, Path, Path)
for files andTokenInfoDictionary(URL, URL, URL, URL)
for classpath/module resources- Parameters:
resourceScheme
- - scheme for loading resources (FILE or CLASSPATH).resourcePath
- - where to load resources (dictionaries) from.- Throws:
IOException
-
TokenInfoDictionary
public TokenInfoDictionary(Path targetMapFile, Path posDictFile, Path dictFile, Path fstFile) throws IOException
Create aTokenInfoDictionary
from an external resource path.- Parameters:
targetMapFile
- where to load target map resourceposDictFile
- where to load POS dictionary resourcedictFile
- where to load dictionary entries resourcefstFile
- where to load encoded FST data resource- Throws:
IOException
- if resource was not found or broken
-
TokenInfoDictionary
public TokenInfoDictionary(URL targetMapUrl, URL posDictUrl, URL dictUrl, URL fstUrl) throws IOException
Create aTokenInfoDictionary
from an external resource URL (e.g. from Classpath withClassLoader.getResource(String)
).- Parameters:
targetMapUrl
- where to load target map resourceposDictUrl
- where to load POS dictionary resourcedictUrl
- where to load dictionary entries resourcefstUrl
- where to load encoded FST data resource- Throws:
IOException
- if resource was not found or broken
-
-
Method Detail
-
getFST
public TokenInfoFST getFST()
-
getInstance
public static TokenInfoDictionary getInstance()
-
-