Class NLPLemmatizerOp

java.lang.Object
org.apache.lucene.analysis.opennlp.tools.NLPLemmatizerOp

public class NLPLemmatizerOp extends Object
Supply OpenNLP Lemmatizer tools.

Both a dictionary-based lemmatizer and a MaxEnt lemmatizer are supported. If both are configured, the dictionary-based lemmatizer is tried first, and then the MaxEnt lemmatizer is consulted for out-of-vocabulary tokens.

The MaxEnt implementation requires binary models from OpenNLP project on SourceForge.

  • Constructor Details

    • NLPLemmatizerOp

      public NLPLemmatizerOp(opennlp.tools.lemmatizer.DictionaryLemmatizer dictionaryLemmatizer, opennlp.tools.lemmatizer.LemmatizerModel lemmatizerModel) throws IOException
      Throws:
      IOException
  • Method Details