| java.lang.Object org.archive.crawler.url.Canonicalizer
Canonicalizer | public class Canonicalizer (Code) | | URL canonicalizer.
author: stack version: $Date: 2006-09-26 20:38:48 +0000 (Tue, 26 Sep 2006) $, $Revision: 4667 $ |
Method Summary | |
public static String | canonicalize(UURI uuri, CrawlOrder order) Convenience method that is passed a settings object instance pulling
from it what it needs to canonicalize.
Parameters: uuri - UURI to canonicalize. Parameters: order - A crawlorder instance. | public static String | canonicalize(UURI uuri, Iterator rules) Run the passed uuri through the list of rules.
Parameters: uuri - Url to canonicalize. Parameters: rules - Iterator of canonicalization rules to apply (Get oneof these on the url-canonicalizer-rules element in order files orcreate a list externally). |
canonicalize | public static String canonicalize(UURI uuri, CrawlOrder order)(Code) | | Convenience method that is passed a settings object instance pulling
from it what it needs to canonicalize.
Parameters: uuri - UURI to canonicalize. Parameters: order - A crawlorder instance. Canonicalized string of uuri else uuri if an error. |
canonicalize | public static String canonicalize(UURI uuri, Iterator rules)(Code) | | Run the passed uuri through the list of rules.
Parameters: uuri - Url to canonicalize. Parameters: rules - Iterator of canonicalization rules to apply (Get oneof these on the url-canonicalizer-rules element in order files orcreate a list externally). Rules must implement the Rule interface. Canonicalized URL. |
|
|