Class DocToDoubleVectorUtils


  • public class DocToDoubleVectorUtils
    extends Object
    utility class for converting Lucene Documents to Double vectors.
    • Method Detail

      • toSparseLocalFreqDoubleArray

        public static Double[] toSparseLocalFreqDoubleArray​(Terms docTerms,
                                                            Terms fieldTerms)
                                                     throws IOException
        create a sparse Double vector given doc and field term vectors using local frequency of the terms in the doc
        Parameters:
        docTerms - term vectors for a given document
        fieldTerms - field term vectors
        Returns:
        a sparse vector of Doubles as an array
        Throws:
        IOException - in case accessing the underlying index fails
      • toDenseLocalFreqDoubleArray

        public static Double[] toDenseLocalFreqDoubleArray​(Terms docTerms)
                                                    throws IOException
        create a dense Double vector given doc and field term vectors using local frequency of the terms in the doc
        Parameters:
        docTerms - term vectors for a given document
        Returns:
        a dense vector of Doubles as an array
        Throws:
        IOException - in case accessing the underlying index fails