public final class ICUFoldingFilter extends ICUNormalizer2Filter
This filter applies the following foldings from the report to unicode text:
Additionally, Default Ignorables are removed, and text is normalized to NFKC. All foldings, case folding, and normalization mappings are applied recursively to ensure a fully folded and normalized result.
A normalizer with additional settings such as a filter that lists characters not to be normalized can be passed in the constructor.
AttributeSource.State
Modifier and Type | Field and Description |
---|---|
static com.ibm.icu.text.Normalizer2 |
NORMALIZER
A normalizer for search term folding to Unicode text,
applying foldings from UTR#30 Character Foldings.
|
input
DEFAULT_TOKEN_ATTRIBUTE_FACTORY
Constructor and Description |
---|
ICUFoldingFilter(TokenStream input)
Create a new ICUFoldingFilter on the specified input
|
ICUFoldingFilter(TokenStream input,
com.ibm.icu.text.Normalizer2 normalizer)
Create a new ICUFoldingFilter on the specified input with the specified
normalizer
|
incrementToken
close, end, reset
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, endAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, removeAllAttributes, restoreState, toString
public static final com.ibm.icu.text.Normalizer2 NORMALIZER
public ICUFoldingFilter(TokenStream input)
public ICUFoldingFilter(TokenStream input, com.ibm.icu.text.Normalizer2 normalizer)
Copyright © 2000-2021 Apache Software Foundation. All Rights Reserved.