Package org.apache.lucene.codecs.blockterms
package org.apache.lucene.codecs.blockterms
Pluggable term index / block terms dictionary implementations.
-
ClassDescriptionHandles a terms dict, but decouples all details of doc/freqs/positions reading to an instance of
PostingsReaderBase
.Writes terms dict, block-encoding (column stride) each term's metadata for each set of terms between two index terms.TermsIndexReader for simple every Nth terms indexes.Selects every Nth term as and index term, and hold term bytes (mostly) fully expanded in memory.BlockTermsReader
interacts with an instance of this class to manage its terms index.Similar to TermsEnum, except, the only "metadata" it reports for a given indexed term is the long fileOffset into the main terms dictionary file.Base class for terms index implementations to plug intoBlockTermsWriter
.Selects index terms according to provided pluggableVariableGapTermsIndexWriter.IndexTermSelector
, and stores them in a prefix trie that's loaded entirely in RAM stored as an FST.Sets an index term when docFreq >= docFreqThresh, or every interval terms.Same policy asFixedGapTermsIndexWriter
Hook for selecting which terms should be placed in the terms index.