| java.lang.Object org.archive.crawler.io.CrawlerJournal org.archive.crawler.frontier.RecoveryJournal
RecoveryJournal | public class RecoveryJournal extends CrawlerJournal implements FrontierJournal(Code) | | Helper class for managing a simple Frontier change-events journal which is
useful for recovering from crawl problems.
By replaying the journal into a new Frontier, its state (at least with
respect to URIs alreadyIncluded and in pending queues) will match that of the
original Frontier, allowing a pseudo-resume of a previous crawl, at least as
far as URI visitation/coverage is concerned.
author: gojomo |
RecoveryJournal | public RecoveryJournal(String path, String filename) throws IOException(Code) | | Create a new recovery journal at the given location
Parameters: path - Directory to make the recovery journal in. Parameters: filename - Name to use for recovery journal file. throws: IOException - |
finishedFailure | public void finishedFailure(UURI uuri)(Code) | | |
finishedSuccess | public void finishedSuccess(UURI uuri)(Code) | | |
finishedSuccess | protected void finishedSuccess(String uuri)(Code) | | |
|
|