Java Doc for DownloadParameters.java in  » Web-Crawler » WebSPHINX » websphinx » Java Source Code / Java DocumentationJava Source Code and Java Documentation

Java Source Code / Java Documentation
1. 6.0 JDK Core
2. 6.0 JDK Modules
3. 6.0 JDK Modules com.sun
4. 6.0 JDK Modules com.sun.java
5. 6.0 JDK Modules sun
6. 6.0 JDK Platform
7. Ajax
8. Apache Harmony Java SE
9. Aspect oriented
10. Authentication Authorization
11. Blogger System
12. Build
13. Byte Code
14. Cache
15. Chart
16. Chat
17. Code Analyzer
18. Collaboration
19. Content Management System
20. Database Client
21. Database DBMS
22. Database JDBC Connection Pool
23. Database ORM
24. Development
25. EJB Server geronimo
26. EJB Server GlassFish
27. EJB Server JBoss 4.2.1
28. EJB Server resin 3.1.5
29. ERP CRM Financial
30. ESB
31. Forum
32. GIS
33. Graphic Library
34. Groupware
35. HTML Parser
36. IDE
37. IDE Eclipse
38. IDE Netbeans
39. Installer
40. Internationalization Localization
41. Inversion of Control
42. Issue Tracking
43. J2EE
44. JBoss
45. JMS
46. JMX
47. Library
48. Mail Clients
49. Net
50. Parser
51. PDF
52. Portal
53. Profiler
54. Project Management
55. Report
56. RSS RDF
57. Rule Engine
58. Science
59. Scripting
60. Search Engine
61. Security
62. Sevlet Container
63. Source Control
64. Swing Library
65. Template Engine
66. Test Coverage
67. Testing
68. UML
69. Web Crawler
70. Web Framework
71. Web Mail
72. Web Server
73. Web Services
74. Web Services apache cxf 2.0.1
75. Web Services AXIS2
76. Wiki Engine
77. Workflow Engines
78. XML
79. XML UI
Java
Java Tutorial
Java Open Source
Jar File Download
Java Articles
Java Products
Java by API
Photoshop Tutorials
Maya Tutorials
Flash Tutorials
3ds-Max Tutorials
Illustrator Tutorials
GIMP Tutorials
C# / C Sharp
C# / CSharp Tutorial
C# / CSharp Open Source
ASP.Net
ASP.NET Tutorial
JavaScript DHTML
JavaScript Tutorial
JavaScript Reference
HTML / CSS
HTML CSS Reference
C / ANSI-C
C Tutorial
C++
C++ Tutorial
Ruby
PHP
Python
Python Tutorial
Python Open Source
SQL Server / T-SQL
SQL Server / T-SQL Tutorial
Oracle PL / SQL
Oracle PL/SQL Tutorial
PostgreSQL
SQL / MySQL
MySQL Tutorial
VB.Net
VB.Net Tutorial
Flash / Flex / ActionScript
VBA / Excel / Access / Word
XML
XML Tutorial
Microsoft Office PowerPoint 2007 Tutorial
Microsoft Office Excel 2007 Tutorial
Microsoft Office Word 2007 Tutorial
Java Source Code / Java Documentation » Web Crawler » WebSPHINX » websphinx 
Source Cross Reference  Class Diagram Java Document (Java Doc) 


java.lang.Object
   websphinx.DownloadParameters

DownloadParameters
public class DownloadParameters implements Cloneable,java.io.Serializable(Code)
Download parameters. These parameters are limits on how Page can download a Link. A Crawler has a default set of download parameters, but the defaults can be overridden on individual links by calling Link.setDownloadParameters().

DownloadParameters is an immutable class (like String). "Changing" a parameter actually returns a new instance of the class with only the specified parameter changed.



Field Summary
final public static  DownloadParametersDEFAULT
    
final public static  DownloadParametersNO_LIMITS
    

Constructor Summary
public  DownloadParameters()
     Make a DownloadParameters object with default settigns.

Method Summary
public  DownloadParameterschangeAcceptedMIMETypes(String types)
     Change accepted MIME types.
Parameters:
  types - list of MIME types that can be handledby the crawler.
public  DownloadParameterschangeCrawlTimeout(int timeout)
     Change timeout value.
Parameters:
  timeout - maximum length of time (in seconds) that crawler will run.Use a negative value to turn off timeout.
public  DownloadParameterschangeDownloadTimeout(int timeout)
     Change download timeout value.
Parameters:
  timeout - length of time (in seconds) to wait for a page to downloadUse a negative value to turn off timeout.
public  DownloadParameterschangeInteractive(boolean f)
     Change interactive flag.
public  DownloadParameterschangeMaxPageSize(int maxPageSize)
     Change maximum page size.
public  DownloadParameterschangeMaxThreads(int maxthreads)
     Set maximum threads.
public  DownloadParameterschangeObeyRobotExclusion(boolean f)
     Change obey-robot-exclusion flag.
Parameters:
  f - If true, then thecrawler checks robots.txt on the remote Web sitebefore downloading a page.
public  DownloadParameterschangeUseCaches(boolean f)
     Change use-caches flag.
public  DownloadParameterschangeUserAgent(String userAgent)
     Change User-agent field used in HTTP requests.
Parameters:
  userAgent - user-agent field used in HTTPrequests.
public  Objectclone()
     Clone a DownloadParameters object.
public  StringgetAcceptedMIMETypes()
     Get accepted MIME types.
public  intgetCrawlTimeout()
     Get timeout on entire crawl. maximum length of time (in seconds) that crawler will runbefore aborting.
public  intgetDownloadTimeout()
     Get download timeout value. length of time (in seconds) that crawler will wait for a page to downloadbefore aborting it.timeout.
public  booleangetInteractive()
     Get interactive flag. true if a user is available to respond todialog boxes (for instance, to enter passwords forauthentication).
public  intgetMaxPageSize()
     Get maximum page size.
public  intgetMaxThreads()
     Get maximum threads.
public  booleangetObeyRobotExclusion()
     Get obey-robot-exclusion flag.
public  booleangetUseCaches()
     Get use-caches flag.
public  StringgetUserAgent()
     Get User-agent header used in HTTP requests. user-agent field used in HTTP requests,or null if the Java library's default user-agentis used.

Field Detail
DEFAULT
final public static DownloadParameters DEFAULT(Code)



NO_LIMITS
final public static DownloadParameters NO_LIMITS(Code)




Constructor Detail
DownloadParameters
public DownloadParameters()(Code)
Make a DownloadParameters object with default settigns.




Method Detail
changeAcceptedMIMETypes
public DownloadParameters changeAcceptedMIMETypes(String types)(Code)
Change accepted MIME types.
Parameters:
  types - list of MIME types that can be handledby the crawler. Use null if the crawler can handle anything. new DownloadParameters object with the specified parameter changed.



changeCrawlTimeout
public DownloadParameters changeCrawlTimeout(int timeout)(Code)
Change timeout value.
Parameters:
  timeout - maximum length of time (in seconds) that crawler will run.Use a negative value to turn off timeout. new DownloadParameters object with the specified parameter changed.



changeDownloadTimeout
public DownloadParameters changeDownloadTimeout(int timeout)(Code)
Change download timeout value.
Parameters:
  timeout - length of time (in seconds) to wait for a page to downloadUse a negative value to turn off timeout. new DownloadParameters object with the specified parameter changed.



changeInteractive
public DownloadParameters changeInteractive(boolean f)(Code)
Change interactive flag.
Parameters:
  f - true if a user is available to respondto dialog boxes new DownloadParameters object with the specified parameter changed.



changeMaxPageSize
public DownloadParameters changeMaxPageSize(int maxPageSize)(Code)
Change maximum page size. Pages larger than this limit are treated as leaves in the crawl graph -- neither downloaded nor parsed.
Parameters:
  maxPageSize - maximum page size in kilobytes new DownloadParameters object with the specified parameter changed.



changeMaxThreads
public DownloadParameters changeMaxThreads(int maxthreads)(Code)
Set maximum threads.
Parameters:
  maxthreads - maximum number of background threads used by crawler new DownloadParameters object with the specified parameter changed.



changeObeyRobotExclusion
public DownloadParameters changeObeyRobotExclusion(boolean f)(Code)
Change obey-robot-exclusion flag.
Parameters:
  f - If true, then thecrawler checks robots.txt on the remote Web sitebefore downloading a page. new DownloadParameters object with the specified parameter changed.



changeUseCaches
public DownloadParameters changeUseCaches(boolean f)(Code)
Change use-caches flag.
Parameters:
  f - true if cached pages should be used whenever possible new DownloadParameters object with the specified parameter changed.



changeUserAgent
public DownloadParameters changeUserAgent(String userAgent)(Code)
Change User-agent field used in HTTP requests.
Parameters:
  userAgent - user-agent field used in HTTPrequests. Pass null to use the Java library's defaultuser-agent field. new DownloadParameters object with the specified parameter changed.



clone
public Object clone()(Code)
Clone a DownloadParameters object.



getAcceptedMIMETypes
public String getAcceptedMIMETypes()(Code)
Get accepted MIME types. list of MIME types that can be handled by the crawler (which are passed as the Accept headerin the HTTP request).Default is null.



getCrawlTimeout
public int getCrawlTimeout()(Code)
Get timeout on entire crawl. maximum length of time (in seconds) that crawler will runbefore aborting. Default is -1 (no limit).



getDownloadTimeout
public int getDownloadTimeout()(Code)
Get download timeout value. length of time (in seconds) that crawler will wait for a page to downloadbefore aborting it.timeout. Default is 60 seconds.



getInteractive
public boolean getInteractive()(Code)
Get interactive flag. true if a user is available to respond todialog boxes (for instance, to enter passwords forauthentication). Default is true.



getMaxPageSize
public int getMaxPageSize()(Code)
Get maximum page size. Pages larger than this limit are neither downloaded nor parsed. Default value is 100 (KB). 0 or negative values mean no limit. maximum page size in kilobytes



getMaxThreads
public int getMaxThreads()(Code)
Get maximum threads. maximum number of background threads used by crawler.Default is 4.



getObeyRobotExclusion
public boolean getObeyRobotExclusion()(Code)
Get obey-robot-exclusion flag. true iff thecrawler checks robots.txt on the remote Web sitebefore downloading a page. Default is false.



getUseCaches
public boolean getUseCaches()(Code)
Get use-caches flag. true if cached pages should be used wheneverpossible



getUserAgent
public String getUserAgent()(Code)
Get User-agent header used in HTTP requests. user-agent field used in HTTP requests,or null if the Java library's default user-agentis used. Default value is null (but for a Crawler,the default DownloadParameters has the Crawler'sname as its default user-agent).



Methods inherited from java.lang.Object
native protected Object clone() throws CloneNotSupportedException(Code)(Java Doc)
public boolean equals(Object obj)(Code)(Java Doc)
protected void finalize() throws Throwable(Code)(Java Doc)
final native public Class getClass()(Code)(Java Doc)
native public int hashCode()(Code)(Java Doc)
final native public void notify()(Code)(Java Doc)
final native public void notifyAll()(Code)(Java Doc)
public String toString()(Code)(Java Doc)
final native public void wait(long timeout) throws InterruptedException(Code)(Java Doc)
final public void wait(long timeout, int nanos) throws InterruptedException(Code)(Java Doc)
final public void wait() throws InterruptedException(Code)(Java Doc)

www.java2java.com | Contact Us
Copyright 2009 - 12 Demo Source and Support. All rights reserved.
All other trademarks are property of their respective owners.