| java.lang.Object java.io.InputStream org.archive.io.ArchiveRecord org.archive.io.warc.WARCRecord
Constructor Summary | |
public | WARCRecord(InputStream in, String identifier, long offset) Constructor. | public | WARCRecord(InputStream in, ArchiveRecordHeader headers) Constructor. | public | WARCRecord(InputStream in, String identifier, long offset, boolean digest, boolean strict) Constructor.
Parameters: in - Stream cue'd up to be at the start of the record this instanceis to represent or, if headers is not null, just past theHeader Line and Named Fields. Parameters: identifier - Identifier for this the hosting Reader. Parameters: offset - Current offset into in (Used to keepposition properly aligned). |
WARCRecord | public WARCRecord(InputStream in, String identifier, long offset) throws IOException(Code) | | Constructor.
Parameters: in - Stream cue'd up to be at the start of the record this instanceis to represent. throws: IOException - |
WARCRecord | public WARCRecord(InputStream in, String identifier, long offset, boolean digest, boolean strict) throws IOException(Code) | | Constructor.
Parameters: in - Stream cue'd up to be at the start of the record this instanceis to represent or, if headers is not null, just past theHeader Line and Named Fields. Parameters: identifier - Identifier for this the hosting Reader. Parameters: offset - Current offset into in (Used to keepposition properly aligned). Usually 0. Parameters: digest - True if we're to calculate digest for this record. Notdigesting saves about ~15% of cpu during parse. Parameters: strict - Be strict parsing (Parsing stops if file inproperlyformatted). throws: IOException - |
parseHeaders | protected ArchiveRecordHeader parseHeaders(InputStream in, String identifier, long offset, boolean strict) throws IOException(Code) | | Parse WARC Header Line and Named Fields.
Parameters: in - Stream to read. Parameters: identifier - Identifier for the hosting Reader. Parameters: offset - Absolute offset into Reader. Parameters: strict - Whether to be loose parsing or not. An ArchiveRecordHeader. throws: IOException - |
|
|