public final class CJKWidthFilter
extends org.apache.lucene.analysis.TokenFilter
TokenFilter
that normalizes CJK width differences:
NOTE: this filter can be viewed as a (practical) subset of NFKC/NFKD Unicode normalization. See the normalization support in the ICU package for full normalization.
Constructor and Description |
---|
CJKWidthFilter(org.apache.lucene.analysis.TokenStream input) |
Modifier and Type | Method and Description |
---|---|
boolean |
incrementToken() |
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString
public CJKWidthFilter(org.apache.lucene.analysis.TokenStream input)
public boolean incrementToken() throws IOException
incrementToken
in class org.apache.lucene.analysis.TokenStream
IOException