public interface HTMLParser
DocData parse(DocData docData, String name, Date date, String title, Reader reader, DateFormat dateFormat) throws IOException, InterruptedException
docData
- result reusedname
- name of the result doc data.date
- date of the result doc data. If null, attempt to set by parsed data.title
- title of the result doc data. If null, attempt to set by parsed data.reader
- reader of html text to parse.dateFormat
- date formatter to use for extracting the date.IOException
InterruptedException