Class ICUNormalizer2CharFilterFactory


public class ICUNormalizer2CharFilterFactory extends CharFilterFactory
Factory for ICUNormalizer2CharFilter

Supports the following attributes:

  • form: A Unicode Normalization Form, one of 'nfc','nfkc', 'nfkc_cf'. Default is nfkc_cf.
  • mode: Either 'compose' or 'decompose'. Default is compose. Use "decompose" with nfc or nfkc, to get nfd or nfkd, respectively.
  • filter: A UnicodeSet pattern. Codepoints outside the set are always left unchanged. Default is [] (the null set, no filtering).
Since:
4.10.0
See Also:
SPI Name (case-insensitive: if the name is 'htmlStrip', 'htmlstrip' can be used when looking up the service).
"icuNormalizer2"
  • Field Details

  • Constructor Details

    • ICUNormalizer2CharFilterFactory

      public ICUNormalizer2CharFilterFactory(Map<String,String> args)
      Creates a new ICUNormalizer2CharFilterFactory
    • ICUNormalizer2CharFilterFactory

      public ICUNormalizer2CharFilterFactory()
      Default ctor for compatibility with SPI
  • Method Details