Class PersianNormalizer

java.lang.Object
org.apache.lucene.analysis.fa.PersianNormalizer

public class PersianNormalizer extends Object
Normalizer for Persian.

Normalization is done in-place for efficiency, operating on a termbuffer.

Normalization is defined as:

  • Normalization of various heh + hamza forms and heh goal to heh.
  • Normalization of farsi yeh and yeh barree to arabic yeh
  • Normalization of persian keheh to arabic kaf
  • Field Details

  • Constructor Details

    • PersianNormalizer

      public PersianNormalizer()
  • Method Details

    • normalize

      public int normalize(char[] s, int len)
      Normalize an input buffer of Persian text
      Parameters:
      s - input buffer
      len - length of input buffer
      Returns:
      length of input buffer after normalization