public final class TeluguNormalizationFilter extends TokenFilter
TokenFilter that applies TeluguNormalizer to normalize the orthography.
In some cases the normalization may cause unrelated terms to conflate, so to prevent terms
from being normalized use an instance of SetKeywordMarkerFilter or a custom TokenFilter that sets the KeywordAttribute before this TokenStream.
TeluguNormalizerAttributeSource.StateinputDEFAULT_TOKEN_ATTRIBUTE_FACTORY| Constructor and Description |
|---|
TeluguNormalizationFilter(TokenStream input) |
| Modifier and Type | Method and Description |
|---|---|
boolean |
incrementToken() |
close, end, resetaddAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, endAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, removeAllAttributes, restoreState, toStringpublic TeluguNormalizationFilter(TokenStream input)
public boolean incrementToken()
throws IOException
incrementToken in class TokenStreamIOExceptionCopyright © 2000-2024 Apache Software Foundation. All Rights Reserved.