Class TrecGov2Parser

  • public class TrecGov2Parser
    extends TrecDocParser
    Parser for the GOV2 collection format
    • Constructor Detail

      • TrecGov2Parser

        public TrecGov2Parser()
    • Method Detail

      • parse

        public DocData parse​(DocData docData,
                             String name,
                             TrecContentSource trecSrc,
                             StringBuilder docBuf,
                             TrecDocParser.ParsePathType pathType)
                      throws IOException
        Description copied from class: TrecDocParser
        parse the text prepared in docBuf into a result DocData, no synchronization is required.
        Specified by:
        parse in class TrecDocParser
        docData - reusable result
        name - name that should be set to the result
        trecSrc - calling trec content source
        docBuf - text to parse
        pathType - type of parsed file, or null if unknown - may be used by parsers to alter their behavior according to the file path type.