Package org.apache.lucene.benchmark.utils
Benchmark Utility functions.
-
Class Summary Class Description ExtractReuters Split the Reuters SGML documents into Simple Text files containing: Title, Date, Dateline, BodyExtractWikipedia Extract the downloaded Wikipedia dump into separate files for indexing.