| java.lang.Object com.knowgate.lucene.Crawler
Crawler | public class Crawler (Code) | | Simple HTML crawler for Lucene
author: Sergio Montoro Ten version: 1.0 See Also: http://jakarta.apache.org/lucene/docs/index.html |
Constructor Summary | |
public | Crawler() |
crawl | public void crawl(String sBasePath, String sFileFilter, String sIndexDirectory, boolean bRebuild) throws IOException, MalformedPatternException(Code) | | Add contents to a Lucene Index
Parameters: sBasePath - Base Path for crawling Parameters: sFileFilter - Perl5 Regular Expression filter for file names Parameters: sIndexDirectory - Lucene index target directory Parameters: bRebuild - true if index must be deleted and fully rebuild. throws: IOException - throws: FileNotFoundException - If sBasePath direcory does not exist throws: MalformedPatternException - If sFileFilter is not a valid Perl5 regular expression pattern |
|
|