public final class ICUTransformFilter extends org.apache.lucene.analysis.TokenFilter
TokenFilterthat transforms text with ICU.
ICU provides text-transformation functionality via its Transliteration API. Although script conversion is its most common use, a Transliterator can actually perform a more general class of tasks. In fact, Transliterator defines a very general API which specifies only that a segment of the input text is replaced by new text. The particulars of this conversion are determined entirely by subclasses of Transliterator.
Some useful transformations for search are built-in:
stream = new ICUTransformFilter(stream, Transliterator.getInstance("Traditional-Simplified"));For more details, see the ICU User Guide.
|Constructor and Description|
Create a new ICUTransformFilter that transforms text on the given stream.
|Modifier and Type||Method and Description|
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString
public ICUTransformFilter(org.apache.lucene.analysis.TokenStream input, com.ibm.icu.text.Transliterator transform)
transform- Transliterator to transform the text.