Class SegToken
java.lang.Object
org.apache.lucene.analysis.cn.smart.hhmm.SegToken
SmartChineseAnalyzer internal token
- WARNING: This API is experimental and might change in incompatible ways in the next release.
-
Field Summary
Modifier and TypeFieldDescriptionchar[]
Character array containing token textint
end offset into original sentenceint
during segmentation, this is used to store the index of the token in the token list tableint
start offset into original sentenceint
word frequencyint
WordType
of the text -
Constructor Summary
ConstructorDescriptionSegToken
(char[] idArray, int start, int end, int wordType, int weight) Create a new SegToken from a character array. -
Method Summary
-
Field Details
-
charArray
public char[] charArrayCharacter array containing token text -
startOffset
public int startOffsetstart offset into original sentence -
endOffset
public int endOffsetend offset into original sentence -
wordType
public int wordTypeWordType
of the text -
weight
public int weightword frequency -
index
public int indexduring segmentation, this is used to store the index of the token in the token list table
-
-
Constructor Details
-
SegToken
public SegToken(char[] idArray, int start, int end, int wordType, int weight) Create a new SegToken from a character array.- Parameters:
idArray
- character array containing textstart
- start offset of SegToken in original sentenceend
- end offset of SegToken in original sentencewordType
-WordType
of the textweight
- word frequency
-
-
Method Details