Package org.apache.lucene.classification
Uses already seen data (the indexed documents) to classify an input ( can be simple text or a structured document).
Currently contains a (simplistic) Naive Bayes classifier, a k-Nearest
Neighbor classifier and a Perceptron based classifier.
-
Interface Summary Interface Description Classifier<T> A classifier, seehttp://en.wikipedia.org/wiki/Classifier_(mathematics)
, which assign classes of typeT
-
Class Summary Class Description BM25NBClassifier A classifier approximating naive bayes classifier by using pure queries on BM25.BooleanPerceptronClassifier CachingNaiveBayesClassifier A simplistic Lucene based NaiveBayes classifier, with caching feature, seehttp://en.wikipedia.org/wiki/Naive_Bayes_classifier
ClassificationResult<T> The result of a call toClassifier.assignClass(String)
holding an assigned class of typeT
and a score.KNearestFuzzyClassifier A k-Nearest Neighbor classifier based onNearestFuzzyQuery
.KNearestNeighborClassifier A k-Nearest Neighbor classifier (seehttp://en.wikipedia.org/wiki/K-nearest_neighbors
) based onMoreLikeThis
SimpleNaiveBayesClassifier A simplistic Lucene based NaiveBayes classifier, seehttp://en.wikipedia.org/wiki/Naive_Bayes_classifier