Class ExtractWikipedia
- java.lang.Object
-
- org.apache.lucene.benchmark.utils.ExtractWikipedia
-
public class ExtractWikipedia extends Object
Extract the downloaded Wikipedia dump into separate files for indexing.
-
-
Constructor Summary
Constructors Constructor Description ExtractWikipedia(DocMaker docMaker, Path outputDir)
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description void
create(String id, String title, String time, String body)
Path
directory(int count, Path directory)
void
extract()
static void
main(String[] args)
-
-
-
Field Detail
-
count
public static int count
-
docMaker
protected DocMaker docMaker
-
-
Constructor Detail
-
ExtractWikipedia
public ExtractWikipedia(DocMaker docMaker, Path outputDir) throws IOException
- Throws:
IOException
-
-
Method Detail
-
create
public void create(String id, String title, String time, String body) throws IOException
- Throws:
IOException
-
-