|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.lucene.index.IndexReader org.apache.lucene.index.ParallelReader
public class ParallelReader
An IndexReader which reads multiple, parallel indexes. Each index added must have the same number of documents, but typically each contains different fields. Each document contains the union of the fields of all documents with the same document number. When searching, matches for a query term are from the first index added that has the field.
This is useful, e.g., with collections that have large fields which change rarely and small fields that change more frequently. The smaller fields may be re-indexed in a new index and both indexes may be searched together.
Warning: It is up to you to make sure all indexes are created and modified the same way. For example, if you add documents to one index, you need to add the same documents in the same order to the other indexes. Failure to do so will result in undefined behavior.
Nested Class Summary |
---|
Nested classes/interfaces inherited from class org.apache.lucene.index.IndexReader |
---|
IndexReader.FieldOption, IndexReader.ReaderFinishedListener |
Field Summary |
---|
Fields inherited from class org.apache.lucene.index.IndexReader |
---|
hasChanges, readerFinishedListeners |
Constructor Summary | |
---|---|
ParallelReader()
Construct a ParallelReader. |
|
ParallelReader(boolean closeSubReaders)
Construct a ParallelReader. |
Method Summary | |
---|---|
void |
add(IndexReader reader)
Add an IndexReader. |
void |
add(IndexReader reader,
boolean ignoreStoredFields)
Add an IndexReader whose stored fields will not be returned. |
void |
addReaderFinishedListener(IndexReader.ReaderFinishedListener listener)
Expert: adds a IndexReader.ReaderFinishedListener . |
Object |
clone()
Efficiently clones the IndexReader (sharing most internal state). |
int |
docFreq(Term term)
Returns the number of documents containing the term t . |
protected void |
doClose()
Implements close. |
protected void |
doCommit(Map<String,String> commitUserData)
Implements commit. |
Document |
document(int n,
FieldSelector fieldSelector)
Get the Document at the n
th position. |
protected void |
doDelete(int n)
Implements deletion of the document numbered docNum . |
protected IndexReader |
doReopen(boolean doClone)
|
protected void |
doSetNorm(int n,
String field,
byte value)
Implements setNorm in subclass. |
protected void |
doUndeleteAll()
Implements actual undeleteAll() in subclass. |
Collection<String> |
getFieldNames(IndexReader.FieldOption fieldNames)
Get a list of unique field names that exist in this index and have the specified field option information. |
TermFreqVector |
getTermFreqVector(int n,
String field)
Return a term frequency vector for the specified document and field. |
void |
getTermFreqVector(int docNumber,
String field,
TermVectorMapper mapper)
Load the Term Vector into a user-defined data structure instead of relying on the parallel arrays of the TermFreqVector . |
void |
getTermFreqVector(int docNumber,
TermVectorMapper mapper)
Map all the term vectors for all fields in a Document |
TermFreqVector[] |
getTermFreqVectors(int n)
Return an array of term frequency vectors for the specified document. |
long |
getVersion()
Not implemented. |
boolean |
hasDeletions()
Returns true if any documents have been deleted |
boolean |
hasNorms(String field)
Returns true if there are norms stored for this field. |
boolean |
isCurrent()
Checks recursively if all subreaders are up to date. |
boolean |
isDeleted(int n)
Returns true if document n has been deleted |
boolean |
isOptimized()
Checks recursively if all subindexes are optimized |
int |
maxDoc()
Returns one greater than the largest possible document number. |
byte[] |
norms(String field)
Returns the byte-encoded normalization factor for the named field of every document. |
void |
norms(String field,
byte[] result,
int offset)
Reads the byte-encoded normalization factor for the named field of every document. |
int |
numDocs()
Returns the number of documents in this index. |
void |
removeReaderFinishedListener(IndexReader.ReaderFinishedListener listener)
Expert: remove a previously added IndexReader.ReaderFinishedListener . |
IndexReader |
reopen()
Tries to reopen the subreaders. |
TermDocs |
termDocs()
Returns an unpositioned TermDocs enumerator. |
TermDocs |
termDocs(Term term)
Returns an enumeration of all the documents which contain term . |
TermPositions |
termPositions()
Returns an unpositioned TermPositions enumerator. |
TermPositions |
termPositions(Term term)
Returns an enumeration of all the documents which contain term . |
TermEnum |
terms()
Returns an enumeration of all the terms in the index. |
TermEnum |
terms(Term term)
Returns an enumeration of all terms starting at a given term. |
String |
toString()
|
Methods inherited from class org.apache.lucene.index.IndexReader |
---|
acquireWriteLock, clone, close, commit, commit, decRef, deleteDocument, deleteDocuments, directory, document, ensureOpen, flush, flush, getCommitUserData, getCommitUserData, getCoreCacheKey, getCurrentVersion, getDeletesCacheKey, getIndexCommit, getRefCount, getSequentialSubReaders, getTermInfosIndexDivisor, getUniqueTermCount, incRef, indexExists, lastModified, listCommits, main, notifyReaderFinishedListeners, numDeletedDocs, open, open, open, open, open, open, open, open, readerFinished, reopen, reopen, reopen, setNorm, setNorm, undeleteAll |
Methods inherited from class java.lang.Object |
---|
equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
Constructor Detail |
---|
public ParallelReader() throws IOException
Note that all subreaders are closed if this ParallelReader is closed.
IOException
public ParallelReader(boolean closeSubReaders) throws IOException
closeSubReaders
- indicates whether the subreaders should be closed
when this ParallelReader is closed
IOException
Method Detail |
---|
public String toString()
toString
in class IndexReader
public void add(IndexReader reader) throws IOException
IOException
- if there is a low-level IO errorpublic void add(IndexReader reader, boolean ignoreStoredFields) throws IOException
IllegalArgumentException
- if not all indexes contain the same number
of documents
IllegalArgumentException
- if not all indexes have the same value
of IndexReader.maxDoc()
IOException
- if there is a low-level IO errorpublic Object clone()
IndexReader
On cloning a reader with pending changes (deletions, norms), the original reader transfers its write lock to the cloned reader. This means only the cloned reader may make further changes to the index, and commit the changes to the index on close, but the old reader still reflects all changes made up until it was cloned.
Like IndexReader.reopen()
, it's safe to make changes to
either the original or the cloned reader: all shared
mutable state obeys "copy on write" semantics to ensure
the changes are not seen by other readers.
clone
in class IndexReader
public IndexReader reopen() throws CorruptIndexException, IOException
A re-opened instance might share one or more subreaders with the old
instance. Index modification operations result in undefined behavior
when performed before the old instance is closed.
(see IndexReader.reopen()
).
If subreaders are shared, then the reference count of those readers is increased to ensure that the subreaders remain open until the last referring reader is closed.
reopen
in class IndexReader
CorruptIndexException
- if the index is corrupt
IOException
- if there is a low-level IO errorprotected IndexReader doReopen(boolean doClone) throws CorruptIndexException, IOException
CorruptIndexException
IOException
public int numDocs()
IndexReader
numDocs
in class IndexReader
public int maxDoc()
IndexReader
maxDoc
in class IndexReader
public boolean hasDeletions()
IndexReader
hasDeletions
in class IndexReader
public boolean isDeleted(int n)
IndexReader
isDeleted
in class IndexReader
protected void doDelete(int n) throws CorruptIndexException, IOException
IndexReader
docNum
.
Applications should call IndexReader.deleteDocument(int)
or IndexReader.deleteDocuments(Term)
.
doDelete
in class IndexReader
CorruptIndexException
IOException
protected void doUndeleteAll() throws CorruptIndexException, IOException
IndexReader
doUndeleteAll
in class IndexReader
CorruptIndexException
IOException
public Document document(int n, FieldSelector fieldSelector) throws CorruptIndexException, IOException
IndexReader
Document
at the n
th position. The FieldSelector
may be used to determine
what Field
s to load and how they should
be loaded. NOTE: If this Reader (more specifically, the underlying
FieldsReader
) is closed before the lazy
Field
is loaded an exception may be
thrown. If you want the value of a lazy
Field
to be available after closing you
must explicitly load it or fetch the Document again with a new loader.
NOTE: for performance reasons, this method does not check if the
requested document is deleted, and therefore asking for a deleted document
may yield unspecified results. Usually this is not required, however you
can call IndexReader.isDeleted(int)
with the requested document ID to verify
the document is not deleted.
document
in class IndexReader
n
- Get the document at the n
th positionfieldSelector
- The FieldSelector
to use to determine what
Fields should be loaded on the Document. May be null, in which case
all Fields will be loaded.
Document
at the nth position
CorruptIndexException
- if the index is corrupt
IOException
- if there is a low-level IO errorFieldable
,
FieldSelector
,
SetBasedFieldSelector
,
LoadFirstFieldSelector
public TermFreqVector[] getTermFreqVectors(int n) throws IOException
IndexReader
TermFreqVector
or of type TermPositionVector
if
positions or offsets have been stored.
getTermFreqVectors
in class IndexReader
n
- document for which term frequency vectors are returned
IOException
- if index cannot be accessedField.TermVector
public TermFreqVector getTermFreqVector(int n, String field) throws IOException
IndexReader
TermPositionVector
is returned.
getTermFreqVector
in class IndexReader
n
- document for which the term frequency vector is returnedfield
- field for which the term frequency vector is returned.
IOException
- if index cannot be accessedField.TermVector
public void getTermFreqVector(int docNumber, String field, TermVectorMapper mapper) throws IOException
IndexReader
TermFreqVector
.
getTermFreqVector
in class IndexReader
docNumber
- The number of the document to load the vector forfield
- The name of the field to loadmapper
- The TermVectorMapper
to process the vector. Must not be null
IOException
- if term vectors cannot be accessed or if they do not exist on the field and doc. specified.public void getTermFreqVector(int docNumber, TermVectorMapper mapper) throws IOException
IndexReader
getTermFreqVector
in class IndexReader
docNumber
- The number of the document to load the vector formapper
- The TermVectorMapper
to process the vector. Must not be null
IOException
- if term vectors cannot be accessed or if they do not exist on the field and doc. specified.public boolean hasNorms(String field) throws IOException
IndexReader
hasNorms
in class IndexReader
IOException
public byte[] norms(String field) throws IOException
IndexReader
norms
in class IndexReader
IOException
AbstractField.setBoost(float)
public void norms(String field, byte[] result, int offset) throws IOException
IndexReader
norms
in class IndexReader
IOException
AbstractField.setBoost(float)
protected void doSetNorm(int n, String field, byte value) throws CorruptIndexException, IOException
IndexReader
doSetNorm
in class IndexReader
CorruptIndexException
IOException
public TermEnum terms() throws IOException
IndexReader
TermEnum.next()
must be called
on the resulting enumeration before calling other methods such as
TermEnum.term()
.
terms
in class IndexReader
IOException
- if there is a low-level IO errorpublic TermEnum terms(Term term) throws IOException
IndexReader
terms
in class IndexReader
IOException
- if there is a low-level IO errorpublic int docFreq(Term term) throws IOException
IndexReader
t
.
docFreq
in class IndexReader
IOException
- if there is a low-level IO errorpublic TermDocs termDocs(Term term) throws IOException
IndexReader
term
. For each document, the document number, the frequency of
the term in that document is also provided, for use in
search scoring. If term is null, then all non-deleted
docs are returned with freq=1.
Thus, this method implements the mapping:
The enumeration is ordered by document number. Each document number is greater than all that precede it in the enumeration.
termDocs
in class IndexReader
IOException
- if there is a low-level IO errorpublic TermDocs termDocs() throws IOException
IndexReader
TermDocs
enumerator.
Note: the TermDocs returned is unpositioned. Before using it, ensure
that you first position it with TermDocs.seek(Term)
or
TermDocs.seek(TermEnum)
.
termDocs
in class IndexReader
IOException
- if there is a low-level IO errorpublic TermPositions termPositions(Term term) throws IOException
IndexReader
term
. For each document, in addition to the document number
and frequency of the term in that document, a list of all of the ordinal
positions of the term in the document is available. Thus, this method
implements the mapping:
This positional information facilitates phrase and proximity searching.
The enumeration is ordered by document number. Each document number is greater than all that precede it in the enumeration.
termPositions
in class IndexReader
IOException
- if there is a low-level IO errorpublic TermPositions termPositions() throws IOException
IndexReader
TermPositions
enumerator.
termPositions
in class IndexReader
IOException
- if there is a low-level IO errorpublic boolean isCurrent() throws CorruptIndexException, IOException
isCurrent
in class IndexReader
CorruptIndexException
- if the index is corrupt
IOException
- if there is a low-level IO errorpublic boolean isOptimized()
isOptimized
in class IndexReader
true
if the index is optimized; false
otherwisepublic long getVersion()
getVersion
in class IndexReader
UnsupportedOperationException
protected void doCommit(Map<String,String> commitUserData) throws IOException
IndexReader
doCommit
in class IndexReader
IOException
protected void doClose() throws IOException
IndexReader
doClose
in class IndexReader
IOException
public Collection<String> getFieldNames(IndexReader.FieldOption fieldNames)
IndexReader
getFieldNames
in class IndexReader
fieldNames
- specifies which field option should be available for the returned fields
IndexReader.FieldOption
public void addReaderFinishedListener(IndexReader.ReaderFinishedListener listener)
IndexReader
IndexReader.ReaderFinishedListener
. The
provided listener is also added to any sub-readers, if
this is a composite reader. Also, any reader reopened
or cloned from this one will also copy the listeners at
the time of reopen.
addReaderFinishedListener
in class IndexReader
public void removeReaderFinishedListener(IndexReader.ReaderFinishedListener listener)
IndexReader
IndexReader.ReaderFinishedListener
.
removeReaderFinishedListener
in class IndexReader
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |