|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.lucene.benchmark.byTask.feeds.ContentSource org.apache.lucene.benchmark.byTask.feeds.TrecContentSource
public class TrecContentSource
Implements a ContentSource
over the TREC collection.
Supports the following configuration parameters (on top of
ContentSource
):
TrecDocParser
class to use for
parsing the TREC documents content (default=TrecGov2Parser).
HTMLParser
class to use for
parsing the HTML parts of the TREC documents content (default=DemoHTMLParser).
Field Summary | |
---|---|
static String |
DOC
|
static String |
DOCNO
|
static String |
NEW_LINE
separator between lines in the byffer |
static String |
TERMINATING_DOC
|
static String |
TERMINATING_DOCNO
|
Fields inherited from class org.apache.lucene.benchmark.byTask.feeds.ContentSource |
---|
encoding, forever, logStep, verbose |
Constructor Summary | |
---|---|
TrecContentSource()
|
Method Summary | |
---|---|
void |
close()
Called when reading from this content source is no longer required. |
DocData |
getNextDocData(DocData docData)
Returns the next DocData from the content source. |
Date |
parseDate(String dateStr)
|
void |
resetInputs()
Resets the input for this content source, so that the test would behave as if it was just started, input-wise. |
void |
setConfig(Config config)
Sets the Config for this content source. |
Methods inherited from class org.apache.lucene.benchmark.byTask.feeds.ContentSource |
---|
addBytes, addDoc, collectFiles, getBytesCount, getConfig, getDocsCount, getTotalBytesCount, getTotalDocsCount, shouldLog |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static final String DOCNO
public static final String TERMINATING_DOCNO
public static final String DOC
public static final String TERMINATING_DOC
public static final String NEW_LINE
Constructor Detail |
---|
public TrecContentSource()
Method Detail |
---|
public Date parseDate(String dateStr)
public void close() throws IOException
ContentSource
close
in class ContentSource
IOException
public DocData getNextDocData(DocData docData) throws NoMoreDataException, IOException
ContentSource
DocData
from the content source.
getNextDocData
in class ContentSource
NoMoreDataException
IOException
public void resetInputs() throws IOException
ContentSource
NOTE: the default implementation resets the number of bytes and documents generated since the last reset, so it's important to call super.resetInputs in case you override this method.
resetInputs
in class ContentSource
IOException
public void setConfig(Config config)
ContentSource
Config
for this content source. If you override this
method, you must call super.setConfig.
setConfig
in class ContentSource
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |