public final class HindiNormalizationFilter
extends org.apache.lucene.analysis.TokenFilter
TokenFilter
that applies HindiNormalizer
to normalize the
orthography.
In some cases the normalization may cause unrelated terms to conflate, so
to prevent terms from being normalized use an instance of
KeywordMarkerFilter
or a custom TokenFilter
that sets
the KeywordAttribute
before this TokenStream
.
HindiNormalizer
Constructor and Description |
---|
HindiNormalizationFilter(org.apache.lucene.analysis.TokenStream input) |
Modifier and Type | Method and Description |
---|---|
boolean |
incrementToken() |
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString
public HindiNormalizationFilter(org.apache.lucene.analysis.TokenStream input)
public boolean incrementToken() throws IOException
incrementToken
in class org.apache.lucene.analysis.TokenStream
IOException