Class PersianAnalyzer

  • All Implemented Interfaces:
    Closeable, AutoCloseable

    public final class PersianAnalyzer
    extends StopwordAnalyzerBase
    Analyzer for Persian.

    This Analyzer uses PersianCharFilter which implies tokenizing around zero-width non-joiner in addition to whitespace. Some persian-specific variant forms (such as farsi yeh and keheh) are standardized. "Stemming" is accomplished via stopwords.

    Since:
    3.1