|
||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | |||||||||
| Class Summary | |
|---|---|
| ExtractReuters | Split the Reuters SGML documents into Simple Text files containing: Title, Date, Dateline, Body |
| ExtractWikipedia | Extract the downloaded Wikipedia dump into separate files for indexing. |
| NoDeletionPolicy | |
|
||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | |||||||||