public class DemoHTMLParser extends Object implements HTMLParser
Modifier and Type | Class and Description |
---|---|
static class |
DemoHTMLParser.Parser
The actual parser to read HTML documents
|
Constructor and Description |
---|
DemoHTMLParser() |
Modifier and Type | Method and Description |
---|---|
DocData |
parse(DocData docData,
String name,
Date date,
InputSource source,
TrecContentSource trecSrc) |
DocData |
parse(DocData docData,
String name,
Date date,
Reader reader,
TrecContentSource trecSrc)
Parse the input Reader and return DocData.
|
public DocData parse(DocData docData, String name, Date date, Reader reader, TrecContentSource trecSrc) throws IOException
HTMLParser
parse
in interface HTMLParser
docData
- result reusedname
- name of the result doc data.date
- date of the result doc data. If null, attempt to set by parsed data.reader
- reader of html text to parse.trecSrc
- the TrecContentSource
used to parse dates.IOException
- If there is a low-level I/O error.public DocData parse(DocData docData, String name, Date date, InputSource source, TrecContentSource trecSrc) throws IOException, SAXException
IOException
SAXException
Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.