| Strip any 'www' found on http/https URLs, IF they have some
path/query component (content after third slash). (Top 'slash page'
URIs are left unstripped, so that we prefer crawling redundant
top pages to missing an entire site only available from either
the www-full or www-less hostname, but not both).
author: stack version: $Date: 2006-09-25 20:27:35 +0000 (Mon, 25 Sep 2006) $, $Revision: 4655 $ |