|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.lucene.index.MergePolicy org.apache.lucene.index.TieredMergePolicy
public class TieredMergePolicy
Merges segments of approximately equal size, subject to
an allowed number of segments per tier. This is similar
to LogByteSizeMergePolicy
, except this merge
policy is able to merge non-adjacent segment, and
separates how many segments are merged at once (setMaxMergeAtOnce(int)
) from how many segments are allowed
per tier (setSegmentsPerTier(double)
). This merge
policy also does not over-merge (ie, cascade merges).
For normal merging, this policy first computes a "budget" of how many segments are allowed by be in the index. If the index is over-budget, then the policy sorts segments by decresing size (pro-rating by percent deletes), and then finds the least-cost merge. Merge cost is measured by a combination of the "skew" of the merge (size of largest seg divided by smallest seg), total merge size and pct deletes reclaimed, so that merges with lower skew, smaller size and those reclaiming more deletes, are favored.
If a merge will produce a segment that's larger than
setMaxMergedSegmentMB(double)
, then the policy will
merge fewer segments (down to 1 at once, if that one has
deletions) to keep the segment size under budget.
NOTE: this policy freely merges non-adjacent
segments; if this is a problem, use LogMergePolicy
.
NOTE: This policy always merges by byte size
of the segments, always pro-rates by percent deletes,
and does not apply any maximum segment size during
optimize (unlike LogByteSizeMergePolicy
.
Nested Class Summary | |
---|---|
protected static class |
TieredMergePolicy.MergeScore
|
Nested classes/interfaces inherited from class org.apache.lucene.index.MergePolicy |
---|
MergePolicy.MergeAbortedException, MergePolicy.MergeException, MergePolicy.MergeSpecification, MergePolicy.OneMerge |
Field Summary |
---|
Fields inherited from class org.apache.lucene.index.MergePolicy |
---|
writer |
Constructor Summary | |
---|---|
TieredMergePolicy()
|
Method Summary | |
---|---|
void |
close()
Release all resources for the policy. |
MergePolicy.MergeSpecification |
findMerges(SegmentInfos infos)
Determine what set of merge operations are now necessary on the index. |
MergePolicy.MergeSpecification |
findMergesForOptimize(SegmentInfos infos,
int maxSegmentCount,
Set<SegmentInfo> segmentsToOptimize)
Determine what set of merge operations is necessary in order to optimize the index. |
MergePolicy.MergeSpecification |
findMergesToExpungeDeletes(SegmentInfos infos)
Determine what set of merge operations is necessary in order to expunge all deletes from the index. |
double |
getExpungeDeletesPctAllowed()
|
double |
getFloorSegmentMB()
|
int |
getMaxMergeAtOnce()
|
int |
getMaxMergeAtOnceExplicit()
|
double |
getMaxMergedSegmentMB()
|
double |
getNoCFSRatio()
|
double |
getSegmentsPerTier()
|
boolean |
getUseCompoundFile()
|
protected TieredMergePolicy.MergeScore |
score(List<SegmentInfo> candidate,
boolean hitTooLarge,
long mergingBytes)
Expert: scores one merge; subclasses can override. |
TieredMergePolicy |
setExpungeDeletesPctAllowed(double v)
When expungeDeletes is called, we only merge away a segment if its delete percentage is over this threshold. |
TieredMergePolicy |
setFloorSegmentMB(double v)
Segments smaller than this are "rounded up" to this size, ie treated as equal (floor) size for merge selection. |
TieredMergePolicy |
setMaxMergeAtOnce(int v)
Maximum number of segments to be merged at a time during "normal" merging. |
TieredMergePolicy |
setMaxMergeAtOnceExplicit(int v)
Maximum number of segments to be merged at a time, during optimize or expungeDeletes. |
TieredMergePolicy |
setMaxMergedSegmentMB(double v)
Maximum sized segment to produce during normal merging. |
TieredMergePolicy |
setNoCFSRatio(double noCFSRatio)
If a merged segment will be more than this percentage of the total size of the index, leave the segment as non-compound file even if compound file is enabled. |
TieredMergePolicy |
setSegmentsPerTier(double v)
Sets the allowed number of segments per tier. |
TieredMergePolicy |
setUseCompoundFile(boolean useCompoundFile)
Sets whether compound file format should be used for newly flushed and newly merged segments. |
String |
toString()
|
boolean |
useCompoundFile(SegmentInfos infos,
SegmentInfo mergedInfo)
Returns true if a new segment (regardless of its origin) should use the compound file format. |
Methods inherited from class org.apache.lucene.index.MergePolicy |
---|
setIndexWriter |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
Constructor Detail |
---|
public TieredMergePolicy()
Method Detail |
---|
public TieredMergePolicy setMaxMergeAtOnce(int v)
setMaxMergeAtOnceExplicit(int)
. Default is 10.
public int getMaxMergeAtOnce()
setMaxMergeAtOnce(int)
public TieredMergePolicy setMaxMergeAtOnceExplicit(int v)
public int getMaxMergeAtOnceExplicit()
setMaxMergeAtOnceExplicit(int)
public TieredMergePolicy setMaxMergedSegmentMB(double v)
public double getMaxMergedSegmentMB()
getMaxMergedSegmentMB()
public TieredMergePolicy setFloorSegmentMB(double v)
public double getFloorSegmentMB()
setFloorSegmentMB(double)
public TieredMergePolicy setExpungeDeletesPctAllowed(double v)
public double getExpungeDeletesPctAllowed()
setExpungeDeletesPctAllowed(double)
public TieredMergePolicy setSegmentsPerTier(double v)
public double getSegmentsPerTier()
setSegmentsPerTier(double)
public TieredMergePolicy setUseCompoundFile(boolean useCompoundFile)
public boolean getUseCompoundFile()
setUseCompoundFile(boolean)
public TieredMergePolicy setNoCFSRatio(double noCFSRatio)
public double getNoCFSRatio()
setNoCFSRatio(double)
public MergePolicy.MergeSpecification findMerges(SegmentInfos infos) throws IOException
MergePolicy
IndexWriter
calls this whenever there is a change to the segments.
This call is always synchronized on the IndexWriter
instance so
only one thread at a time will call this method.
findMerges
in class MergePolicy
infos
- the total set of segments in the index
IOException
protected TieredMergePolicy.MergeScore score(List<SegmentInfo> candidate, boolean hitTooLarge, long mergingBytes) throws IOException
IOException
public MergePolicy.MergeSpecification findMergesForOptimize(SegmentInfos infos, int maxSegmentCount, Set<SegmentInfo> segmentsToOptimize) throws IOException
MergePolicy
IndexWriter
calls this when its
IndexWriter.optimize()
method is called. This call is always
synchronized on the IndexWriter
instance so only one thread at a
time will call this method.
findMergesForOptimize
in class MergePolicy
infos
- the total set of segments in the indexmaxSegmentCount
- requested maximum number of segments in the index (currently this
is always 1)segmentsToOptimize
- contains the specific SegmentInfo instances that must be merged
away. This may be a subset of all SegmentInfos.
IOException
public MergePolicy.MergeSpecification findMergesToExpungeDeletes(SegmentInfos infos) throws CorruptIndexException, IOException
MergePolicy
findMergesToExpungeDeletes
in class MergePolicy
infos
- the total set of segments in the index
CorruptIndexException
IOException
public boolean useCompoundFile(SegmentInfos infos, SegmentInfo mergedInfo) throws IOException
MergePolicy
useCompoundFile
in class MergePolicy
IOException
public void close()
MergePolicy
close
in interface Closeable
close
in class MergePolicy
public String toString()
toString
in class Object
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |