Class TrecParserByPath

  • public class TrecParserByPath
    extends TrecDocParser
    Parser for trec docs which selects the parser to apply according to the source files path, defaulting to TrecGov2Parser.
    • Constructor Detail

      • TrecParserByPath

        public TrecParserByPath()
    • Method Detail

      • parse

        public DocData parse​(DocData docData,
                             String name,
                             TrecContentSource trecSrc,
                             StringBuilder docBuf,
                             TrecDocParser.ParsePathType pathType)
                      throws IOException
        Description copied from class: TrecDocParser
        parse the text prepared in docBuf into a result DocData, no synchronization is required.
        Specified by:
        parse in class TrecDocParser
        docData - reusable result
        name - name that should be set to the result
        trecSrc - calling trec content source
        docBuf - text to parse
        pathType - type of parsed file, or null if unknown - may be used by parsers to alter their behavior according to the file path type.