Package org.apache.lucene.analysis.fa
Class PersianNormalizer
java.lang.Object
org.apache.lucene.analysis.fa.PersianNormalizer
Normalizer for Persian.
Normalization is done in-place for efficiency, operating on a termbuffer.
Normalization is defined as:
- Normalization of various heh + hamza forms and heh goal to heh.
- Normalization of farsi yeh and yeh barree to arabic yeh
- Normalization of persian keheh to arabic kaf
-
Field Summary
Modifier and TypeFieldDescriptionstatic final char
static final char
static final char
static final char
static final char
static final char
static final char
static final char
static final char
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionint
normalize
(char[] s, int len) Normalize an input buffer of Persian text
-
Field Details
-
YEH
public static final char YEH- See Also:
-
FARSI_YEH
public static final char FARSI_YEH- See Also:
-
YEH_BARREE
public static final char YEH_BARREE- See Also:
-
KEHEH
public static final char KEHEH- See Also:
-
KAF
public static final char KAF- See Also:
-
HAMZA_ABOVE
public static final char HAMZA_ABOVE- See Also:
-
HEH_YEH
public static final char HEH_YEH- See Also:
-
HEH_GOAL
public static final char HEH_GOAL- See Also:
-
HEH
public static final char HEH- See Also:
-
-
Constructor Details
-
PersianNormalizer
public PersianNormalizer()
-
-
Method Details
-
normalize
public int normalize(char[] s, int len) Normalize an input buffer of Persian text- Parameters:
s
- input bufferlen
- length of input buffer- Returns:
- length of input buffer after normalization
-