public interface HTMLParser
DocData parse(DocData docData, String name, Date date, Reader reader, TrecContentSource trecSrc) throws IOException
docData
- result reusedname
- name of the result doc data.date
- date of the result doc data. If null, attempt to set by parsed data.reader
- reader of html text to parse.trecSrc
- the TrecContentSource
used to parse dates.IOException
- If there is a low-level I/O error.Copyright © 2000-2012 Apache Software Foundation. All Rights Reserved.