Class DocToDoubleVectorUtils

java.lang.Object
org.apache.lucene.classification.utils.DocToDoubleVectorUtils

public class DocToDoubleVectorUtils extends Object
utility class for converting Lucene Documents to Double vectors.
  • Method Details

    • toSparseLocalFreqDoubleArray

      public static Double[] toSparseLocalFreqDoubleArray(Terms docTerms, Terms fieldTerms) throws IOException
      create a sparse Double vector given doc and field term vectors using local frequency of the terms in the doc
      Parameters:
      docTerms - term vectors for a given document
      fieldTerms - field term vectors
      Returns:
      a sparse vector of Doubles as an array
      Throws:
      IOException - in case accessing the underlying index fails
    • toDenseLocalFreqDoubleArray

      public static Double[] toDenseLocalFreqDoubleArray(Terms docTerms) throws IOException
      create a dense Double vector given doc and field term vectors using local frequency of the terms in the doc
      Parameters:
      docTerms - term vectors for a given document
      Returns:
      a dense vector of Doubles as an array
      Throws:
      IOException - in case accessing the underlying index fails