Class BengaliNormalizer

java.lang.Object
org.apache.lucene.analysis.bn.BengaliNormalizer

public class BengaliNormalizer extends Object
Normalizer for Bengali.

Implements the Bengali-language specific algorithm specified in: A Double Metaphone encoding for Bangla and its application in spelling checker Naushad UzZaman and Mumit Khan. http://www.panl10n.net/english/final%20reports/pdf%20files/Bangladesh/BAN16.pdf

  • Constructor Details

    • BengaliNormalizer

      public BengaliNormalizer()
  • Method Details

    • normalize

      public int normalize(char[] s, int len)
      Normalize an input buffer of Bengali text
      Parameters:
      s - input buffer
      len - length of input buffer
      Returns:
      length of input buffer after normalization