Java Doc for CoreAttributeConstants.java in  » Web-Crawler » heritrix » org » archive » crawler » datamodel » Java Source Code / Java DocumentationJava Source Code and Java Documentation

Java Source Code / Java Documentation
1. 6.0 JDK Core
2. 6.0 JDK Modules
3. 6.0 JDK Modules com.sun
4. 6.0 JDK Modules com.sun.java
5. 6.0 JDK Modules sun
6. 6.0 JDK Platform
7. Ajax
8. Apache Harmony Java SE
9. Aspect oriented
10. Authentication Authorization
11. Blogger System
12. Build
13. Byte Code
14. Cache
15. Chart
16. Chat
17. Code Analyzer
18. Collaboration
19. Content Management System
20. Database Client
21. Database DBMS
22. Database JDBC Connection Pool
23. Database ORM
24. Development
25. EJB Server geronimo
26. EJB Server GlassFish
27. EJB Server JBoss 4.2.1
28. EJB Server resin 3.1.5
29. ERP CRM Financial
30. ESB
31. Forum
32. GIS
33. Graphic Library
34. Groupware
35. HTML Parser
36. IDE
37. IDE Eclipse
38. IDE Netbeans
39. Installer
40. Internationalization Localization
41. Inversion of Control
42. Issue Tracking
43. J2EE
44. JBoss
45. JMS
46. JMX
47. Library
48. Mail Clients
49. Net
50. Parser
51. PDF
52. Portal
53. Profiler
54. Project Management
55. Report
56. RSS RDF
57. Rule Engine
58. Science
59. Scripting
60. Search Engine
61. Security
62. Sevlet Container
63. Source Control
64. Swing Library
65. Template Engine
66. Test Coverage
67. Testing
68. UML
69. Web Crawler
70. Web Framework
71. Web Mail
72. Web Server
73. Web Services
74. Web Services apache cxf 2.0.1
75. Web Services AXIS2
76. Wiki Engine
77. Workflow Engines
78. XML
79. XML UI
Java
Java Tutorial
Java Open Source
Jar File Download
Java Articles
Java Products
Java by API
Photoshop Tutorials
Maya Tutorials
Flash Tutorials
3ds-Max Tutorials
Illustrator Tutorials
GIMP Tutorials
C# / C Sharp
C# / CSharp Tutorial
C# / CSharp Open Source
ASP.Net
ASP.NET Tutorial
JavaScript DHTML
JavaScript Tutorial
JavaScript Reference
HTML / CSS
HTML CSS Reference
C / ANSI-C
C Tutorial
C++
C++ Tutorial
Ruby
PHP
Python
Python Tutorial
Python Open Source
SQL Server / T-SQL
SQL Server / T-SQL Tutorial
Oracle PL / SQL
Oracle PL/SQL Tutorial
PostgreSQL
SQL / MySQL
MySQL Tutorial
VB.Net
VB.Net Tutorial
Flash / Flex / ActionScript
VBA / Excel / Access / Word
XML
XML Tutorial
Microsoft Office PowerPoint 2007 Tutorial
Microsoft Office Excel 2007 Tutorial
Microsoft Office Word 2007 Tutorial
Java Source Code / Java Documentation » Web Crawler » heritrix » org.archive.crawler.datamodel 
Source Cross Reference  Class Diagram Java Document (Java Doc) 


org.archive.crawler.datamodel.CoreAttributeConstants

All known Subclasses:   org.archive.crawler.extractor.ExtractorCSS,  org.archive.crawler.fetcher.FetchHTTP,  org.archive.crawler.frontier.WorkQueueFrontier,  org.archive.crawler.framework.WriterPoolProcessor,  org.archive.crawler.extractor.ExtractorJS,  org.archive.crawler.admin.SeedRecord,  org.archive.crawler.io.RuntimeErrorFormatter,  org.archive.crawler.extractor.ExtractorImpliedURI,  org.archive.crawler.datamodel.CandidateURI,  org.archive.crawler.processor.recrawl.FetchHistoryProcessor,  org.archive.crawler.extractor.ExtractorHTMLTest,  org.archive.crawler.util.CrawledBytesHistotable,  org.archive.crawler.extractor.ExtractorHTTP,  org.archive.crawler.extractor.ExtractorPDF,  org.archive.crawler.extractor.JerichoExtractorHTML,  org.archive.crawler.writer.ARCWriterProcessor,  org.archive.crawler.postprocessor.CrawlStateUpdater,  org.archive.crawler.io.LocalErrorFormatter,  org.archive.crawler.frontier.AbstractFrontier,  org.archive.crawler.extractor.ExtractorDOC,  org.archive.crawler.deciderules.recrawl.IdenticalDigestDecideRule,  org.archive.crawler.extractor.ExtractorHTML,  org.archive.crawler.writer.ExperimentalV10WARCWriterProcessor,  org.archive.crawler.extractor.JerichoExtractorHTMLTest,  org.archive.crawler.fetcher.FetchDNS,  org.archive.crawler.prefetch.PreconditionEnforcer,  org.archive.crawler.extractor.ExtractorURI,  org.archive.crawler.extractor.ExtractorSWF,  org.archive.crawler.frontier.AdaptiveRevisitFrontier,  org.archive.crawler.writer.MirrorWriterProcessor,  org.archive.crawler.writer.ExperimentalWARCWriterProcessor,  org.archive.crawler.fetcher.FetchFTP,  org.archive.crawler.extractor.ExtractorXML,  org.archive.crawler.io.UriProcessingFormatter,  org.archive.crawler.io.UriErrorFormatter,  org.archive.crawler.writer.Kw3WriterProcessor,  org.archive.crawler.extractor.ExtractorUniversal,  org.archive.crawler.framework.ToeThread,  org.archive.crawler.deciderules.NotExceedsDocumentLengthTresholdDecideRule,  org.archive.crawler.extractor.TrapSuppressExtractor,
CoreAttributeConstants
public interface CoreAttributeConstants (Code)
CrawlURI attribute keys used by the core crawler classes.
author:
   gojomo


Field Summary
public static  StringA_ANNOTATIONS
    
final public static  StringA_CONTENT_DIGEST
    
public static  StringA_CONTENT_TYPE
    
final public static  StringA_CREDENTIAL_AVATARS_KEY
     Key to get credential avatars from A_LIST.
public static  StringA_DELAY_FACTOR
    
public static  StringA_DISTANCE_FROM_SEED
    
public static  StringA_DNS_FETCH_TIME
    
public static  StringA_DNS_SERVER_IP_LABEL
    
final public static  StringA_ETAG_HEADER
    
public static  StringA_FETCH_BEGAN_TIME
    
public static  StringA_FETCH_COMPLETED_TIME
    
final public static  StringA_FETCH_HISTORY
    
final public static  StringA_FORCE_RETIRE
    
final public static  StringA_HERITABLE_KEYS
     Key to (optional) attribute specifying a list of keys that are passed to CandidateURIs that 'descend' (are discovered via) this URI.
public static  StringA_HTML_BASE
    
final public static  StringA_HTTP_PROXY_HOST
    
final public static  StringA_HTTP_PROXY_PORT
    
public static  StringA_HTTP_TRANSACTION
    
final public static  StringA_LAST_MODIFIED_HEADER
    
public static  StringA_LOCALIZED_ERRORS
    
public static  StringA_META_ROBOTS
    
public static  StringA_MINIMUM_DELAY
     Minimum delay before fetching another item of th same class (eg host).
public static  StringA_MIRROR_PATH
     Define for org.archive.crawler.writer.MirrorWriterProcessor.
public static  StringA_PREREQUISITE_URI
    
final public static  StringA_REFERENCE_LENGTH
    
public static  StringA_RETRY_DELAY
    
public static  StringA_RRECORD_SET_LABEL
    
public static  StringA_RUNTIME_EXCEPTION
    
public static  StringA_SOURCE_TAG
     a 'source' (usu.
final public static  StringA_STATUS
    
final public static  StringHEADER_TRUNC
    
final public static  StringLENGTH_TRUNC
    
final public static  StringTIMER_TRUNC
    
final public static  StringTRUNC_SUFFIX
     Fetch truncation codes present in CrawlURI annotations.



Field Detail
A_ANNOTATIONS
public static String A_ANNOTATIONS(Code)
shorthand string tokens indicating notable occurences, separated by commas



A_CONTENT_DIGEST
final public static String A_CONTENT_DIGEST(Code)
content digest



A_CONTENT_TYPE
public static String A_CONTENT_TYPE(Code)
Extracted MIME type of fetched content; should be set immediately by fetching module if possible (rather than waiting for a later analyzer)



A_CREDENTIAL_AVATARS_KEY
final public static String A_CREDENTIAL_AVATARS_KEY(Code)
Key to get credential avatars from A_LIST.



A_DELAY_FACTOR
public static String A_DELAY_FACTOR(Code)
Multiplier of last fetch duration to wait before fetching another item of the same class (eg host)



A_DISTANCE_FROM_SEED
public static String A_DISTANCE_FROM_SEED(Code)



A_DNS_FETCH_TIME
public static String A_DNS_FETCH_TIME(Code)



A_DNS_SERVER_IP_LABEL
public static String A_DNS_SERVER_IP_LABEL(Code)



A_ETAG_HEADER
final public static String A_ETAG_HEADER(Code)
header name (and AList key) for ETag



A_FETCH_BEGAN_TIME
public static String A_FETCH_BEGAN_TIME(Code)



A_FETCH_COMPLETED_TIME
public static String A_FETCH_COMPLETED_TIME(Code)



A_FETCH_HISTORY
final public static String A_FETCH_HISTORY(Code)
fetch history array



A_FORCE_RETIRE
final public static String A_FORCE_RETIRE(Code)
flag indicating the containing queue should be retired



A_HERITABLE_KEYS
final public static String A_HERITABLE_KEYS(Code)
Key to (optional) attribute specifying a list of keys that are passed to CandidateURIs that 'descend' (are discovered via) this URI.



A_HTML_BASE
public static String A_HTML_BASE(Code)



A_HTTP_PROXY_HOST
final public static String A_HTTP_PROXY_HOST(Code)
local override of proxy host



A_HTTP_PROXY_PORT
final public static String A_HTTP_PROXY_PORT(Code)
local override of proxy port



A_HTTP_TRANSACTION
public static String A_HTTP_TRANSACTION(Code)



A_LAST_MODIFIED_HEADER
final public static String A_LAST_MODIFIED_HEADER(Code)
header name (and AList key) for last-modified timestamp



A_LOCALIZED_ERRORS
public static String A_LOCALIZED_ERRORS(Code)



A_META_ROBOTS
public static String A_META_ROBOTS(Code)



A_MINIMUM_DELAY
public static String A_MINIMUM_DELAY(Code)
Minimum delay before fetching another item of th same class (eg host). Even if lastFetchTime*delayFactor is less than this, this period will be waited.



A_MIRROR_PATH
public static String A_MIRROR_PATH(Code)
Define for org.archive.crawler.writer.MirrorWriterProcessor.



A_PREREQUISITE_URI
public static String A_PREREQUISITE_URI(Code)



A_REFERENCE_LENGTH
final public static String A_REFERENCE_LENGTH(Code)
reference length (content length or virtual length



A_RETRY_DELAY
public static String A_RETRY_DELAY(Code)



A_RRECORD_SET_LABEL
public static String A_RRECORD_SET_LABEL(Code)



A_RUNTIME_EXCEPTION
public static String A_RUNTIME_EXCEPTION(Code)



A_SOURCE_TAG
public static String A_SOURCE_TAG(Code)
a 'source' (usu. URI) that's inherited by discovered URIs



A_STATUS
final public static String A_STATUS(Code)
key for status (when in history)



HEADER_TRUNC
final public static String HEADER_TRUNC(Code)



LENGTH_TRUNC
final public static String LENGTH_TRUNC(Code)



TIMER_TRUNC
final public static String TIMER_TRUNC(Code)



TRUNC_SUFFIX
final public static String TRUNC_SUFFIX(Code)
Fetch truncation codes present in CrawlURI annotations. All truncation annotations have a TRUNC_SUFFIX suffix (TODO: Make for-sure unique or redo truncation so definitive flag marked against CrawlURI ).





www.java2java.com | Contact Us
Copyright 2009 - 12 Demo Source and Support. All rights reserved.
All other trademarks are property of their respective owners.