Class TrecFBISParser


  • public class TrecFBISParser
    extends TrecDocParser
    Parser for the FBIS docs in trec disks 4+5 collection format
    • Constructor Detail

      • TrecFBISParser

        public TrecFBISParser()
    • Method Detail

      • parse

        public DocData parse​(DocData docData,
                             String name,
                             TrecContentSource trecSrc,
                             StringBuilder docBuf,
                             TrecDocParser.ParsePathType pathType)
                      throws IOException
        Description copied from class: TrecDocParser
        parse the text prepared in docBuf into a result DocData, no synchronization is required.
        Specified by:
        parse in class TrecDocParser
        Parameters:
        docData - reusable result
        name - name that should be set to the result
        trecSrc - calling trec content source
        docBuf - text to parse
        pathType - type of parsed file, or null if unknown - may be used by parsers to alter their behavior according to the file path type.
        Throws:
        IOException