public class TrecFBISParser extends TrecDocParser
TrecDocParser.ParsePathType
DEFAULT_PATH_TYPE
Constructor and Description |
---|
TrecFBISParser() |
Modifier and Type | Method and Description |
---|---|
DocData |
parse(DocData docData,
String name,
TrecContentSource trecSrc,
StringBuilder docBuf,
TrecDocParser.ParsePathType pathType)
parse the text prepared in docBuf into a result DocData,
no synchronization is required.
|
extract, pathType, stripTags, stripTags
public DocData parse(DocData docData, String name, TrecContentSource trecSrc, StringBuilder docBuf, TrecDocParser.ParsePathType pathType) throws IOException, InterruptedException
TrecDocParser
parse
in class TrecDocParser
docData
- reusable resultname
- name that should be set to the resulttrecSrc
- calling trec content sourcedocBuf
- text to parsepathType
- type of parsed file, or null if unknown - may be used by
parsers to alter their behavior according to the file path type.IOException
InterruptedException