org.archive.crawler.processor.recrawl |
|
Java Source File Name | Type | Comment |
FetchHistoryProcessor.java | Class | Maintain a history of fetch information inside the CrawlURI's attributes. |
PersistLoadProcessor.java | Class | Store CrawlURI attributes from latest fetch to persistent storage for
consultation by a later recrawl. |
PersistLogProcessor.java | Class | Log CrawlURI attributes from latest fetch for consultation by a later
recrawl. |
PersistOnlineProcessor.java | Class | Common superclass for persisting Processors which directly store/load
to persistence (as opposed to logging for batch load later). |
PersistProcessor.java | Class | Superclass for Processors which utilize BDB-JE for URI state
(including most notably history) persistence. |
PersistStoreProcessor.java | Class | Store CrawlURI attributes from latest fetch to persistent storage for
consultation by a later recrawl. |