org.apache.lucene.codecs.lucene90.blocktree.Lucene90BlockTreeTermsReader

All Implemented Interfaces:: Closeable, AutoCloseable, Iterable<String>

public final class Lucene90BlockTreeTermsReader extends FieldsProducer

A block-based terms index and dictionary that assigns terms to variable length blocks according to how they share prefixes. The terms index is a prefix trie whose leaves are term blocks. The advantage of this approach is that seekExact is often able to determine a term cannot exist without doing any IO, and intersection with Automata is very fast. Note that this terms dictionary has its own fixed terms index (ie, it does not support a pluggable terms index implementation).

NOTE: this terms dictionary supports min/maxItemsPerBlock during indexing to control how much memory the terms index uses.

The data structure used by this implementation is very similar to a burst trie (http://citeseer.ist.psu.edu/viewdoc/summary?doi=10.1.1.18.3499), but with added logic to break up too-large blocks of all terms sharing a given prefix into smaller ones.

Use CheckIndex with the -verbose option to see summary statistics on the blocks in the dictionary.

See Lucene90BlockTreeTermsWriter.

WARNING: This API is experimental and might change in incompatible ways in the next release.

Field Summary

Fields

Modifier and Type

Field

Description

static final int

VERSION_CURRENT

Current terms format.

static final int

VERSION_START

Initial terms format.

Fields inherited from class org.apache.lucene.index.Fields
EMPTY_ARRAY
Constructor Summary

Constructors

Constructor

Description

Lucene90BlockTreeTermsReader(PostingsReaderBase postingsReader, SegmentReadState state)

Sole constructor.
Method Summary

Modifier and Type

Method

Description

void

checkIntegrity()

Checks consistency of this reader.

void

close()

Iterator<String>

iterator()

Returns an iterator that will step through all fields names.

int

size()

Returns the number of fields or -1 if the number of distinct field names is unknown.

Terms

terms(String field)

Get the Terms for this field.

String

toString()

Methods inherited from class org.apache.lucene.codecs.FieldsProducer
getMergeInstance

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait

Methods inherited from interface java.lang.Iterable
forEach, spliterator

Field Details
- VERSION_START
  
  public static final int VERSION_START
  
  Initial terms format.
  See Also:
  
  Constant Field Values
- VERSION_CURRENT
  
  public static final int VERSION_CURRENT
  
  Current terms format.
  See Also:
  
  Constant Field Values
Constructor Details
- Lucene90BlockTreeTermsReader
  
  public Lucene90BlockTreeTermsReader(PostingsReaderBase postingsReader, SegmentReadState state) throws IOException
  
  Sole constructor.
  
  Throws:
  
  IOException
Method Details
- close
  
  public void close() throws IOException
  
  Specified by:
  
  close in interface AutoCloseable
  
  Specified by:
  
  close in interface Closeable
  
  Specified by:
  
  close in class FieldsProducer
  
  Throws:
  
  IOException
- iterator
  
  public Iterator<String> iterator()
  
  Description copied from class: Fields
  
  Returns an iterator that will step through all fields names. This will not return null.
  
  Specified by:
  
  iterator in interface Iterable<String>
  
  Specified by:
  
  iterator in class Fields
- terms
  
  public Terms terms(String field) throws IOException
  
  Description copied from class: Fields
  
  Get the Terms for this field. This will return null if the field does not exist.
  
  Specified by:
  
  terms in class Fields
  
  Throws:
  
  IOException
- size
  
  public int size()
  
  Description copied from class: Fields
  
  Returns the number of fields or -1 if the number of distinct field names is unknown. If >= 0, Fields.iterator() will return as many field names.
  
  Specified by:
  
  size in class Fields
- checkIntegrity
  
  public void checkIntegrity() throws IOException
  
  Description copied from class: FieldsProducer
  
  Checks consistency of this reader.
  Note that this may be costly in terms of I/O, e.g. may involve computing a checksum value against large data files.
  
  Specified by:
  
  checkIntegrity in class FieldsProducer
  
  Throws:
  
  IOException
- toString
  
  public String toString()
  
  Overrides:
  
  toString in class Object

Class Lucene90BlockTreeTermsReader

Field Summary

Fields inherited from class org.apache.lucene.index.Fields

Constructor Summary

Method Summary

Methods inherited from class org.apache.lucene.codecs.FieldsProducer

Methods inherited from class java.lang.Object

Methods inherited from interface java.lang.Iterable

Field Details

VERSION_START

VERSION_CURRENT

Constructor Details

Lucene90BlockTreeTermsReader

Method Details

close

iterator

terms

size

checkIntegrity

toString