Class NormalizationH1

  • public class NormalizationH1
    extends Normalization
    Normalization model that assumes a uniform distribution of the term frequency.

    While this model is parameterless in the original article, information-based models (see IBSimilarity) introduced a multiplying factor. The default value for the c parameter is 1.

    WARNING: This API is experimental and might change in incompatible ways in the next release.
    • Constructor Detail

      • NormalizationH1

        public NormalizationH1​(float c)
        Creates NormalizationH1 with the supplied parameter c.
        c - hyper-parameter that controls the term frequency normalization with respect to the document length.
    • Method Detail

      • tfn

        public final float tfn​(BasicStats stats,
                               float tf,
                               float len)
        Description copied from class: Normalization
        Returns the normalized term frequency.
        Specified by:
        tfn in class Normalization
        len - the field length.
      • toString

        public String toString()
        Description copied from class: Normalization
        Subclasses must override this method to return the code of the normalization formula. Refer to the original paper for the list.
        Specified by:
        toString in class Normalization