|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.apache.lucene.index.IndexReader
org.apache.lucene.index.ParallelReader
public class ParallelReader
An IndexReader which reads multiple, parallel indexes. Each index added must have the same number of documents, but typically each contains different fields. Each document contains the union of the fields of all documents with the same document number. When searching, matches for a query term are from the first index added that has the field.
This is useful, e.g., with collections that have large fields which change rarely and small fields that change more frequently. The smaller fields may be re-indexed in a new index and both indexes may be searched together.
Warning: It is up to you to make sure all indexes are created and modified the same way. For example, if you add documents to one index, you need to add the same documents in the same order to the other indexes. Failure to do so will result in undefined behavior.
| Nested Class Summary |
|---|
| Nested classes/interfaces inherited from class org.apache.lucene.index.IndexReader |
|---|
IndexReader.FieldOption |
| Field Summary |
|---|
| Fields inherited from class org.apache.lucene.index.IndexReader |
|---|
hasChanges |
| Constructor Summary | |
|---|---|
ParallelReader()
Construct a ParallelReader. |
|
ParallelReader(boolean closeSubReaders)
Construct a ParallelReader. |
|
| Method Summary | |
|---|---|
void |
add(IndexReader reader)
Add an IndexReader. |
void |
add(IndexReader reader,
boolean ignoreStoredFields)
Add an IndexReader whose stored fields will not be returned. |
Object |
clone()
Efficiently clones the IndexReader (sharing most internal state). |
int |
docFreq(Term term)
Returns the number of documents containing the term t. |
protected void |
doClose()
Implements close. |
protected void |
doCommit(Map<String,String> commitUserData)
Implements commit. |
Document |
document(int n,
FieldSelector fieldSelector)
Get the Document at the n
th position. |
protected void |
doDelete(int n)
Implements deletion of the document numbered docNum. |
protected IndexReader |
doReopen(boolean doClone)
|
protected void |
doSetNorm(int n,
String field,
byte value)
Implements setNorm in subclass. |
protected void |
doUndeleteAll()
Implements actual undeleteAll() in subclass. |
Collection<String> |
getFieldNames(IndexReader.FieldOption fieldNames)
Get a list of unique field names that exist in this index and have the specified field option information. |
TermFreqVector |
getTermFreqVector(int n,
String field)
Return a term frequency vector for the specified document and field. |
void |
getTermFreqVector(int docNumber,
String field,
TermVectorMapper mapper)
Load the Term Vector into a user-defined data structure instead of relying on the parallel arrays of the TermFreqVector. |
void |
getTermFreqVector(int docNumber,
TermVectorMapper mapper)
Map all the term vectors for all fields in a Document |
TermFreqVector[] |
getTermFreqVectors(int n)
Return an array of term frequency vectors for the specified document. |
long |
getVersion()
Not implemented. |
boolean |
hasDeletions()
Returns true if any documents have been deleted |
boolean |
hasNorms(String field)
Returns true if there are norms stored for this field. |
boolean |
isCurrent()
Checks recursively if all subreaders are up to date. |
boolean |
isDeleted(int n)
Returns true if document n has been deleted |
boolean |
isOptimized()
Checks recursively if all subindexes are optimized |
int |
maxDoc()
Returns one greater than the largest possible document number. |
byte[] |
norms(String field)
Returns the byte-encoded normalization factor for the named field of every document. |
void |
norms(String field,
byte[] result,
int offset)
Reads the byte-encoded normalization factor for the named field of every document. |
int |
numDocs()
Returns the number of documents in this index. |
IndexReader |
reopen()
Tries to reopen the subreaders. |
TermDocs |
termDocs()
Returns an unpositioned TermDocs enumerator. |
TermDocs |
termDocs(Term term)
Returns an enumeration of all the documents which contain term. |
TermPositions |
termPositions()
Returns an unpositioned TermPositions enumerator. |
TermPositions |
termPositions(Term term)
Returns an enumeration of all the documents which contain term. |
TermEnum |
terms()
Returns an enumeration of all the terms in the index. |
TermEnum |
terms(Term term)
Returns an enumeration of all terms starting at a given term. |
| Methods inherited from class org.apache.lucene.index.IndexReader |
|---|
acquireWriteLock, clone, close, commit, commit, decRef, deleteDocument, deleteDocuments, directory, document, ensureOpen, flush, flush, getCommitUserData, getCommitUserData, getCurrentVersion, getDeletesCacheKey, getFieldCacheKey, getIndexCommit, getRefCount, getSequentialSubReaders, getTermInfosIndexDivisor, getUniqueTermCount, incRef, indexExists, lastModified, listCommits, main, numDeletedDocs, open, open, open, open, open, open, open, reopen, reopen, setNorm, setNorm, undeleteAll |
| Methods inherited from class java.lang.Object |
|---|
equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public ParallelReader()
throws IOException
Note that all subreaders are closed if this ParallelReader is closed.
IOException
public ParallelReader(boolean closeSubReaders)
throws IOException
closeSubReaders - indicates whether the subreaders should be closed
when this ParallelReader is closed
IOException| Method Detail |
|---|
public void add(IndexReader reader)
throws IOException
IOException - if there is a low-level IO error
public void add(IndexReader reader,
boolean ignoreStoredFields)
throws IOException
IllegalArgumentException - if not all indexes contain the same number
of documents
IllegalArgumentException - if not all indexes have the same value
of IndexReader.maxDoc()
IOException - if there is a low-level IO errorpublic Object clone()
IndexReaderOn cloning a reader with pending changes (deletions, norms), the original reader transfers its write lock to the cloned reader. This means only the cloned reader may make further changes to the index, and commit the changes to the index on close, but the old reader still reflects all changes made up until it was cloned.
Like IndexReader.reopen(), it's safe to make changes to
either the original or the cloned reader: all shared
mutable state obeys "copy on write" semantics to ensure
the changes are not seen by other readers.
clone in class IndexReader
public IndexReader reopen()
throws CorruptIndexException,
IOException
A re-opened instance might share one or more subreaders with the old
instance. Index modification operations result in undefined behavior
when performed before the old instance is closed.
(see IndexReader.reopen()).
If subreaders are shared, then the reference count of those readers is increased to ensure that the subreaders remain open until the last referring reader is closed.
reopen in class IndexReaderCorruptIndexException - if the index is corrupt
IOException - if there is a low-level IO error
protected IndexReader doReopen(boolean doClone)
throws CorruptIndexException,
IOException
CorruptIndexException
IOExceptionpublic int numDocs()
IndexReader
numDocs in class IndexReaderpublic int maxDoc()
IndexReader
maxDoc in class IndexReaderpublic boolean hasDeletions()
IndexReader
hasDeletions in class IndexReaderpublic boolean isDeleted(int n)
IndexReader
isDeleted in class IndexReader
protected void doDelete(int n)
throws CorruptIndexException,
IOException
IndexReaderdocNum.
Applications should call IndexReader.deleteDocument(int) or IndexReader.deleteDocuments(Term).
doDelete in class IndexReaderCorruptIndexException
IOException
protected void doUndeleteAll()
throws CorruptIndexException,
IOException
IndexReader
doUndeleteAll in class IndexReaderCorruptIndexException
IOException
public Document document(int n,
FieldSelector fieldSelector)
throws CorruptIndexException,
IOException
IndexReaderDocument at the n
th position. The FieldSelector may be used to determine
what Fields to load and how they should
be loaded. NOTE: If this Reader (more specifically, the underlying
FieldsReader) is closed before the lazy
Field is loaded an exception may be
thrown. If you want the value of a lazy
Field to be available after closing you
must explicitly load it or fetch the Document again with a new loader.
NOTE: for performance reasons, this method does not check if the
requested document is deleted, and therefore asking for a deleted document
may yield unspecified results. Usually this is not required, however you
can call IndexReader.isDeleted(int) with the requested document ID to verify
the document is not deleted.
document in class IndexReadern - Get the document at the nth positionfieldSelector - The FieldSelector to use to determine what
Fields should be loaded on the Document. May be null, in which case
all Fields will be loaded.
Document at the nth position
CorruptIndexException - if the index is corrupt
IOException - if there is a low-level IO errorFieldable,
FieldSelector,
SetBasedFieldSelector,
LoadFirstFieldSelector
public TermFreqVector[] getTermFreqVectors(int n)
throws IOException
IndexReaderTermFreqVector
or of type TermPositionVector if
positions or offsets have been stored.
getTermFreqVectors in class IndexReadern - document for which term frequency vectors are returned
IOException - if index cannot be accessedField.TermVector
public TermFreqVector getTermFreqVector(int n,
String field)
throws IOException
IndexReaderTermPositionVector is returned.
getTermFreqVector in class IndexReadern - document for which the term frequency vector is returnedfield - field for which the term frequency vector is returned.
IOException - if index cannot be accessedField.TermVector
public void getTermFreqVector(int docNumber,
String field,
TermVectorMapper mapper)
throws IOException
IndexReaderTermFreqVector.
getTermFreqVector in class IndexReaderdocNumber - The number of the document to load the vector forfield - The name of the field to loadmapper - The TermVectorMapper to process the vector. Must not be null
IOException - if term vectors cannot be accessed or if they do not exist on the field and doc. specified.
public void getTermFreqVector(int docNumber,
TermVectorMapper mapper)
throws IOException
IndexReader
getTermFreqVector in class IndexReaderdocNumber - The number of the document to load the vector formapper - The TermVectorMapper to process the vector. Must not be null
IOException - if term vectors cannot be accessed or if they do not exist on the field and doc. specified.
public boolean hasNorms(String field)
throws IOException
IndexReader
hasNorms in class IndexReaderIOException
public byte[] norms(String field)
throws IOException
IndexReader
norms in class IndexReaderIOExceptionAbstractField.setBoost(float)
public void norms(String field,
byte[] result,
int offset)
throws IOException
IndexReader
norms in class IndexReaderIOExceptionAbstractField.setBoost(float)
protected void doSetNorm(int n,
String field,
byte value)
throws CorruptIndexException,
IOException
IndexReader
doSetNorm in class IndexReaderCorruptIndexException
IOException
public TermEnum terms()
throws IOException
IndexReaderTermEnum.next() must be called
on the resulting enumeration before calling other methods such as
TermEnum.term().
terms in class IndexReaderIOException - if there is a low-level IO error
public TermEnum terms(Term term)
throws IOException
IndexReader
terms in class IndexReaderIOException - if there is a low-level IO error
public int docFreq(Term term)
throws IOException
IndexReadert.
docFreq in class IndexReaderIOException - if there is a low-level IO error
public TermDocs termDocs(Term term)
throws IOException
IndexReaderterm. For each document, the document number, the frequency of
the term in that document is also provided, for use in
search scoring. If term is null, then all non-deleted
docs are returned with freq=1.
Thus, this method implements the mapping:
The enumeration is ordered by document number. Each document number is greater than all that precede it in the enumeration.
termDocs in class IndexReaderIOException - if there is a low-level IO error
public TermDocs termDocs()
throws IOException
IndexReaderTermDocs enumerator.
termDocs in class IndexReaderIOException - if there is a low-level IO error
public TermPositions termPositions(Term term)
throws IOException
IndexReaderterm. For each document, in addition to the document number
and frequency of the term in that document, a list of all of the ordinal
positions of the term in the document is available. Thus, this method
implements the mapping:
This positional information facilitates phrase and proximity searching.
The enumeration is ordered by document number. Each document number is greater than all that precede it in the enumeration.
termPositions in class IndexReaderIOException - if there is a low-level IO error
public TermPositions termPositions()
throws IOException
IndexReaderTermPositions enumerator.
termPositions in class IndexReaderIOException - if there is a low-level IO error
public boolean isCurrent()
throws CorruptIndexException,
IOException
isCurrent in class IndexReaderCorruptIndexException - if the index is corrupt
IOException - if there is a low-level IO errorpublic boolean isOptimized()
isOptimized in class IndexReadertrue if the index is optimized; false otherwisepublic long getVersion()
getVersion in class IndexReaderUnsupportedOperationException
protected void doCommit(Map<String,String> commitUserData)
throws IOException
IndexReader
doCommit in class IndexReaderIOException
protected void doClose()
throws IOException
IndexReader
doClose in class IndexReaderIOExceptionpublic Collection<String> getFieldNames(IndexReader.FieldOption fieldNames)
IndexReader
getFieldNames in class IndexReaderfieldNames - specifies which field option should be available for the returned fields
IndexReader.FieldOption
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||