Class NLPLemmatizerOp


  • public class NLPLemmatizerOp
    extends Object
    Supply OpenNLP Lemmatizer tools.

    Both a dictionary-based lemmatizer and a MaxEnt lemmatizer are supported. If both are configured, the dictionary-based lemmatizer is tried first, and then the MaxEnt lemmatizer is consulted for out-of-vocabulary tokens.

    The MaxEnt implementation requires binary models from OpenNLP project on SourceForge.

    • Constructor Detail

      • NLPLemmatizerOp

        public NLPLemmatizerOp​(InputStream dictionary,
                               opennlp.tools.lemmatizer.LemmatizerModel lemmatizerModel)
                        throws IOException
        Throws:
        IOException