Package org.apache.lucene.analysis.icu.tokenattributes
package org.apache.lucene.analysis.icu.tokenattributes
Additional ICU-specific Attributes for text analysis.
-
ClassDescriptionExtension of
CharTermAttributeImpl
that encodes the term text as a binary Unicode collation key instead of as UTF-8 bytes.This attribute stores the UTR #24 script value for a token of text.Implementation ofScriptAttribute
that stores the script as an integer.