org.apache.lucene.benchmark.utils
Classes 
ExtractReuters
ExtractWikipedia