Package org.apache.lucene.classification.document
Uses already seen data (the indexed documents) to classify new documents.
Currently contains a (simplistic) Naive Bayes classifier and a k-Nearest Neighbor classifier.
-
Interface Summary Interface Description DocumentClassifier<T> A classifier, seehttp://en.wikipedia.org/wiki/Classifier_(mathematics)
, which assign classes of typeT
to aDocument
s -
Class Summary Class Description KNearestNeighborDocumentClassifier A k-Nearest Neighbor Document classifier (seehttp://en.wikipedia.org/wiki/K-nearest_neighbors
) based onMoreLikeThis
.SimpleNaiveBayesDocumentClassifier A simplistic Lucene based NaiveBayes classifier, seehttp://en.wikipedia.org/wiki/Naive_Bayes_classifier