Java Doc for DownloadParameters.java in » Web-Crawler » WebSPHINX » websphinx » Java Source Code / Java DocumentationJava Source Code and Java Documentation

Java Source Code / Java Documentation

1.	6.0 JDK Core
2.	6.0 JDK Modules
3.	6.0 JDK Modules com.sun
4.	6.0 JDK Modules com.sun.java
5.	6.0 JDK Modules sun
6.	6.0 JDK Platform
7.	Ajax
8.	Apache Harmony Java SE
9.	Aspect oriented
10.	Authentication Authorization
11.	Blogger System
12.	Build
13.	Byte Code
14.	Cache
15.	Chart
16.	Chat
17.	Code Analyzer
18.	Collaboration
19.	Content Management System
20.	Database Client
21.	Database DBMS
22.	Database JDBC Connection Pool
23.	Database ORM
24.	Development
25.	EJB Server geronimo
26.	EJB Server GlassFish
27.	EJB Server JBoss 4.2.1
28.	EJB Server resin 3.1.5
29.	ERP CRM Financial
30.	ESB
31.	Forum
32.	GIS
33.	Graphic Library
34.	Groupware
35.	HTML Parser
36.	IDE
37.	IDE Eclipse
38.	IDE Netbeans
39.	Installer
40.	Internationalization Localization
41.	Inversion of Control
42.	Issue Tracking
43.	J2EE
44.	JBoss
45.	JMS
46.	JMX
47.	Library
48.	Mail Clients
49.	Net
50.	Parser
51.	PDF
52.	Portal
53.	Profiler
54.	Project Management
55.	Report
56.	RSS RDF
57.	Rule Engine
58.	Science
59.	Scripting
60.	Search Engine
61.	Security
62.	Sevlet Container
63.	Source Control
64.	Swing Library
65.	Template Engine
66.	Test Coverage
67.	Testing
68.	UML
69.	Web Crawler
70.	Web Framework
71.	Web Mail
72.	Web Server
73.	Web Services
74.	Web Services apache cxf 2.0.1
75.	Web Services AXIS2
76.	Wiki Engine
77.	Workflow Engines
78.	XML
79.	XML UI

Java

Java Tutorial

Illustrator Tutorials

GIMP Tutorials

C# / C Sharp

C# / CSharp Tutorial

C# / CSharp Open Source

SQL Server / T-SQL Tutorial

Oracle PL / SQL

Oracle PL/SQL Tutorial

Flash / Flex / ActionScript

VBA / Excel / Access / Word

XML

XML Tutorial

Microsoft Office PowerPoint 2007 Tutorial

Microsoft Office Excel 2007 Tutorial

Microsoft Office Word 2007 Tutorial

Java Source Code / Java Documentation » Web Crawler » WebSPHINX » websphinx

Source Cross Reference

Class Diagram

Java Document (Java Doc)

java.lang .Object

websphinx .DownloadParameters

DownloadParameters
public class DownloadParameters implements Cloneable,java.io.Serializable(Code)
	Download parameters. These parameters are limits on how Page can download a Link. A Crawler has a default set of download parameters, but the defaults can be overridden on individual links by calling Link.setDownloadParameters(). DownloadParameters is an immutable class (like String). "Changing" a parameter actually returns a new instance of the class with only the specified parameter changed.

Field Summary
final public static DownloadParameters	DEFAULT
final public static DownloadParameters	NO_LIMITS

Constructor Summary
public	DownloadParameters() Make a DownloadParameters object with default settigns.

Method Summary
public DownloadParameters	changeAcceptedMIMETypes(String types) Change accepted MIME types. Parameters: types - list of MIME types that can be handledby the crawler.
public DownloadParameters	changeCrawlTimeout(int timeout) Change timeout value. Parameters: timeout - maximum length of time (in seconds) that crawler will run.Use a negative value to turn off timeout.
public DownloadParameters	changeDownloadTimeout(int timeout) Change download timeout value. Parameters: timeout - length of time (in seconds) to wait for a page to downloadUse a negative value to turn off timeout.
public DownloadParameters	changeInteractive(boolean f) Change interactive flag.
public DownloadParameters	changeMaxPageSize(int maxPageSize) Change maximum page size.
public DownloadParameters	changeMaxThreads(int maxthreads) Set maximum threads.
public DownloadParameters	changeObeyRobotExclusion(boolean f) Change obey-robot-exclusion flag. Parameters: f - If true, then thecrawler checks robots.txt on the remote Web sitebefore downloading a page.
public DownloadParameters	changeUseCaches(boolean f) Change use-caches flag.
public DownloadParameters	changeUserAgent(String userAgent) Change User-agent field used in HTTP requests. Parameters: userAgent - user-agent field used in HTTPrequests.
public Object	clone() Clone a DownloadParameters object.
public String	getAcceptedMIMETypes() Get accepted MIME types.
public int	getCrawlTimeout() Get timeout on entire crawl. maximum length of time (in seconds) that crawler will runbefore aborting.
public int	getDownloadTimeout() Get download timeout value. length of time (in seconds) that crawler will wait for a page to downloadbefore aborting it.timeout.
public boolean	getInteractive() Get interactive flag. true if a user is available to respond todialog boxes (for instance, to enter passwords forauthentication).
public int	getMaxPageSize() Get maximum page size.
public int	getMaxThreads() Get maximum threads.
public boolean	getObeyRobotExclusion() Get obey-robot-exclusion flag.
public boolean	getUseCaches() Get use-caches flag.
public String	getUserAgent() Get User-agent header used in HTTP requests. user-agent field used in HTTP requests,or null if the Java library's default user-agentis used.

Field Detail

DEFAULT
final public static DownloadParameters DEFAULT(Code)

NO_LIMITS
final public static DownloadParameters NO_LIMITS(Code)

Constructor Detail

DownloadParameters
public DownloadParameters()(Code)
	Make a DownloadParameters object with default settigns.

Method Detail

changeAcceptedMIMETypes
public DownloadParameters changeAcceptedMIMETypes(String types)(Code)
	Change accepted MIME types. Parameters: types - list of MIME types that can be handledby the crawler. Use null if the crawler can handle anything. new DownloadParameters object with the specified parameter changed.

changeCrawlTimeout
public DownloadParameters changeCrawlTimeout(int timeout)(Code)
	Change timeout value. Parameters: timeout - maximum length of time (in seconds) that crawler will run.Use a negative value to turn off timeout. new DownloadParameters object with the specified parameter changed.

changeDownloadTimeout
public DownloadParameters changeDownloadTimeout(int timeout)(Code)
	Change download timeout value. Parameters: timeout - length of time (in seconds) to wait for a page to downloadUse a negative value to turn off timeout. new DownloadParameters object with the specified parameter changed.

changeInteractive
public DownloadParameters changeInteractive(boolean f)(Code)
	Change interactive flag. Parameters: f - true if a user is available to respondto dialog boxes new DownloadParameters object with the specified parameter changed.

changeMaxPageSize
public DownloadParameters changeMaxPageSize(int maxPageSize)(Code)
	Change maximum page size. Pages larger than this limit are treated as leaves in the crawl graph -- neither downloaded nor parsed. Parameters: maxPageSize - maximum page size in kilobytes new DownloadParameters object with the specified parameter changed.

changeMaxThreads
public DownloadParameters changeMaxThreads(int maxthreads)(Code)
	Set maximum threads. Parameters: maxthreads - maximum number of background threads used by crawler new DownloadParameters object with the specified parameter changed.

changeObeyRobotExclusion
public DownloadParameters changeObeyRobotExclusion(boolean f)(Code)
	Change obey-robot-exclusion flag. Parameters: f - If true, then thecrawler checks robots.txt on the remote Web sitebefore downloading a page. new DownloadParameters object with the specified parameter changed.

changeUseCaches
public DownloadParameters changeUseCaches(boolean f)(Code)
	Change use-caches flag. Parameters: f - true if cached pages should be used whenever possible new DownloadParameters object with the specified parameter changed.

changeUserAgent
public DownloadParameters changeUserAgent(String userAgent)(Code)
	Change User-agent field used in HTTP requests. Parameters: userAgent - user-agent field used in HTTPrequests. Pass null to use the Java library's defaultuser-agent field. new DownloadParameters object with the specified parameter changed.

clone
public Object clone()(Code)
	Clone a DownloadParameters object.

getAcceptedMIMETypes
public String getAcceptedMIMETypes()(Code)
	Get accepted MIME types. list of MIME types that can be handled by the crawler (which are passed as the Accept headerin the HTTP request).Default is null.

getCrawlTimeout
public int getCrawlTimeout()(Code)
	Get timeout on entire crawl. maximum length of time (in seconds) that crawler will runbefore aborting. Default is -1 (no limit).

getDownloadTimeout
public int getDownloadTimeout()(Code)
	Get download timeout value. length of time (in seconds) that crawler will wait for a page to downloadbefore aborting it.timeout. Default is 60 seconds.

getInteractive
public boolean getInteractive()(Code)
	Get interactive flag. true if a user is available to respond todialog boxes (for instance, to enter passwords forauthentication). Default is true.

getMaxPageSize
public int getMaxPageSize()(Code)
	Get maximum page size. Pages larger than this limit are neither downloaded nor parsed. Default value is 100 (KB). 0 or negative values mean no limit. maximum page size in kilobytes

getMaxThreads
public int getMaxThreads()(Code)
	Get maximum threads. maximum number of background threads used by crawler.Default is 4.

getObeyRobotExclusion
public boolean getObeyRobotExclusion()(Code)
	Get obey-robot-exclusion flag. true iff thecrawler checks robots.txt on the remote Web sitebefore downloading a page. Default is false.

getUseCaches
public boolean getUseCaches()(Code)
	Get use-caches flag. true if cached pages should be used wheneverpossible

getUserAgent
public String getUserAgent()(Code)
	Get User-agent header used in HTTP requests. user-agent field used in HTTP requests,or null if the Java library's default user-agentis used. Default value is null (but for a Crawler,the default DownloadParameters has the Crawler'sname as its default user-agent).

Methods inherited from java.lang.Object

native protected Object clone() throws CloneNotSupportedException(Code)(Java Doc)
public boolean equals(Object obj)(Code)(Java Doc)
protected void finalize() throws Throwable(Code)(Java Doc)
final native public Class getClass()(Code)(Java Doc)
native public int hashCode()(Code)(Java Doc)
final native public void notify()(Code)(Java Doc)
final native public void notifyAll()(Code)(Java Doc)
public String toString()(Code)(Java Doc)
final native public void wait(long timeout) throws InterruptedException(Code)(Java Doc)
final public void wait(long timeout, int nanos) throws InterruptedException(Code)(Java Doc)
final public void wait() throws InterruptedException(Code)(Java Doc)

www.java2java.com | Contact Us

All other trademarks are property of their respective owners.