Class NLPLemmatizerOp
java.lang.Object
org.apache.lucene.analysis.opennlp.tools.NLPLemmatizerOp
Supply OpenNLP Lemmatizer tools.
Both a dictionary-based lemmatizer and a MaxEnt lemmatizer are supported. If both are configured, the dictionary-based lemmatizer is tried first, and then the MaxEnt lemmatizer is consulted for out-of-vocabulary tokens.
The MaxEnt implementation requires binary models from OpenNLP project on SourceForge.
-
Constructor Summary
ConstructorDescriptionNLPLemmatizerOp
(opennlp.tools.lemmatizer.DictionaryLemmatizer dictionaryLemmatizer, opennlp.tools.lemmatizer.LemmatizerModel lemmatizerModel) -
Method Summary
-
Constructor Details
-
NLPLemmatizerOp
public NLPLemmatizerOp(opennlp.tools.lemmatizer.DictionaryLemmatizer dictionaryLemmatizer, opennlp.tools.lemmatizer.LemmatizerModel lemmatizerModel) throws IOException - Throws:
IOException
-
-
Method Details
-
lemmatize
-