org.apache.lucene.analysis.miscellaneous
Class ScandinavianNormalizationFilter
java.lang.Object
org.apache.lucene.util.AttributeSource
org.apache.lucene.analysis.TokenStream
org.apache.lucene.analysis.TokenFilter
org.apache.lucene.analysis.miscellaneous.ScandinavianNormalizationFilter
- All Implemented Interfaces:
- Closeable
public final class ScandinavianNormalizationFilter
- extends TokenFilter
This filter normalize use of the interchangeable Scandinavian characters æÆäÄöÖøØ
and folded variants (aa, ao, ae, oe and oo) by transforming them to åÅæÆøØ.
It's a semantically less destructive solution than ScandinavianFoldingFilter
,
most useful when a person with a Norwegian or Danish keyboard queries a Swedish index
and vice versa. This filter does not the common Swedish folds of å and ä to a nor ö to o.
blåbærsyltetøj == blåbärsyltetöj == blaabaarsyltetoej but not blabarsyltetoj
räksmörgås == ræksmørgås == ræksmörgaos == raeksmoergaas but not raksmorgas
- See Also:
ScandinavianFoldingFilter
Methods inherited from class org.apache.lucene.util.AttributeSource |
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString |
ScandinavianNormalizationFilter
public ScandinavianNormalizationFilter(TokenStream input)
incrementToken
public boolean incrementToken()
throws IOException
- Specified by:
incrementToken
in class TokenStream
- Throws:
IOException
Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.