public final class HindiNormalizationFilter extends TokenFilter
TokenFilter that applies HindiNormalizer to normalize the
orthography.
In some cases the normalization may cause unrelated terms to conflate, so
to prevent terms from being normalized use an instance of
SetKeywordMarkerFilter or a custom TokenFilter that sets
the KeywordAttribute before this TokenStream.
HindiNormalizerAttributeSource.StateinputDEFAULT_TOKEN_ATTRIBUTE_FACTORY| Constructor and Description |
|---|
HindiNormalizationFilter(TokenStream input) |
| Modifier and Type | Method and Description |
|---|---|
boolean |
incrementToken() |
close, end, resetaddAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, removeAllAttributes, restoreState, toStringpublic HindiNormalizationFilter(TokenStream input)
public boolean incrementToken()
throws IOException
incrementToken in class TokenStreamIOExceptionCopyright © 2000-2016 Apache Software Foundation. All Rights Reserved.