Class BinaryDictionary
- java.lang.Object
-
- org.apache.lucene.analysis.ko.dict.BinaryDictionary
-
- All Implemented Interfaces:
Dictionary
- Direct Known Subclasses:
TokenInfoDictionary,UnknownDictionary
public abstract class BinaryDictionary extends Object implements Dictionary
Base class for a binary-encoded in-memory dictionary.
-
-
Nested Class Summary
-
Nested classes/interfaces inherited from interface org.apache.lucene.analysis.ko.dict.Dictionary
Dictionary.Morpheme
-
-
Field Summary
Fields Modifier and Type Field Description static StringDICT_FILENAME_SUFFIXstatic StringDICT_HEADERstatic intHAS_READINGflag that the entry has reading data.static intHAS_SINGLE_POSflag that the entry has a single part of speech (leftPOS)static StringPOSDICT_FILENAME_SUFFIXstatic StringPOSDICT_HEADERstatic StringTARGETMAP_FILENAME_SUFFIXstatic StringTARGETMAP_HEADERstatic intVERSION
-
Constructor Summary
Constructors Modifier Constructor Description protectedBinaryDictionary()
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description static InputStreamgetClassResource(Class<?> clazz, String suffix)intgetLeftId(int wordId)Get left id of specified wordPOS.TaggetLeftPOS(int wordId)Get the leftPOS.Tagof specfied word.Dictionary.Morpheme[]getMorphemes(int wordId, char[] surfaceForm, int off, int len)Get the morphemes of specified word (e.g.POS.TypegetPOSType(int wordId)Get thePOS.Typeof specified word (morpheme, compound, inflect or pre-analysis)StringgetReading(int wordId)Get the reading of specified word (mainly used for Hanja to Hangul conversion).protected InputStreamgetResource(String suffix)intgetRightId(int wordId)Get right id of specified wordPOS.TaggetRightPOS(int wordId)Get the rightPOS.Tagof specfied word.intgetWordCost(int wordId)Get word cost of specified wordvoidlookupWordIds(int sourceId, IntsRef ref)
-
-
-
Field Detail
-
TARGETMAP_FILENAME_SUFFIX
public static final String TARGETMAP_FILENAME_SUFFIX
- See Also:
- Constant Field Values
-
DICT_FILENAME_SUFFIX
public static final String DICT_FILENAME_SUFFIX
- See Also:
- Constant Field Values
-
POSDICT_FILENAME_SUFFIX
public static final String POSDICT_FILENAME_SUFFIX
- See Also:
- Constant Field Values
-
DICT_HEADER
public static final String DICT_HEADER
- See Also:
- Constant Field Values
-
TARGETMAP_HEADER
public static final String TARGETMAP_HEADER
- See Also:
- Constant Field Values
-
POSDICT_HEADER
public static final String POSDICT_HEADER
- See Also:
- Constant Field Values
-
VERSION
public static final int VERSION
- See Also:
- Constant Field Values
-
HAS_SINGLE_POS
public static final int HAS_SINGLE_POS
flag that the entry has a single part of speech (leftPOS)- See Also:
- Constant Field Values
-
HAS_READING
public static final int HAS_READING
flag that the entry has reading data. otherwise reading is surface form- See Also:
- Constant Field Values
-
-
Constructor Detail
-
BinaryDictionary
protected BinaryDictionary() throws IOException- Throws:
IOException
-
-
Method Detail
-
getResource
protected final InputStream getResource(String suffix) throws IOException
- Throws:
IOException
-
getClassResource
public static final InputStream getClassResource(Class<?> clazz, String suffix) throws IOException
- Throws:
IOException
-
lookupWordIds
public void lookupWordIds(int sourceId, IntsRef ref)
-
getLeftId
public int getLeftId(int wordId)
Description copied from interface:DictionaryGet left id of specified word- Specified by:
getLeftIdin interfaceDictionary
-
getRightId
public int getRightId(int wordId)
Description copied from interface:DictionaryGet right id of specified word- Specified by:
getRightIdin interfaceDictionary
-
getWordCost
public int getWordCost(int wordId)
Description copied from interface:DictionaryGet word cost of specified word- Specified by:
getWordCostin interfaceDictionary
-
getPOSType
public POS.Type getPOSType(int wordId)
Description copied from interface:DictionaryGet thePOS.Typeof specified word (morpheme, compound, inflect or pre-analysis)- Specified by:
getPOSTypein interfaceDictionary
-
getLeftPOS
public POS.Tag getLeftPOS(int wordId)
Description copied from interface:DictionaryGet the leftPOS.Tagof specfied word. ForPOS.Type.MORPHEMEandPOS.Type.COMPOUNDthe left and right POS are the same.- Specified by:
getLeftPOSin interfaceDictionary
-
getRightPOS
public POS.Tag getRightPOS(int wordId)
Description copied from interface:DictionaryGet the rightPOS.Tagof specfied word. ForPOS.Type.MORPHEMEandPOS.Type.COMPOUNDthe left and right POS are the same.- Specified by:
getRightPOSin interfaceDictionary
-
getReading
public String getReading(int wordId)
Description copied from interface:DictionaryGet the reading of specified word (mainly used for Hanja to Hangul conversion).- Specified by:
getReadingin interfaceDictionary
-
getMorphemes
public Dictionary.Morpheme[] getMorphemes(int wordId, char[] surfaceForm, int off, int len)
Description copied from interface:DictionaryGet the morphemes of specified word (e.g. 가깝으나: 가깝 + 으나).- Specified by:
getMorphemesin interfaceDictionary
-
-