| java.lang.Object org.archive.extractor.CharSequenceLinkExtractor org.archive.extractor.RegexpJSLinkExtractor
RegexpJSLinkExtractor | public class RegexpJSLinkExtractor extends CharSequenceLinkExtractor (Code) | | Uses regular expressions to find likely URIs inside Javascript.
ROUGH DRAFT IN PROGRESS / incomplete... untested...
author: gojomo |
JAVASCRIPT_STRING_EXTRACTOR | final static Pattern JAVASCRIPT_STRING_EXTRACTOR(Code) | | |
STRING_URI_DETECTOR | final static Pattern STRING_URI_DETECTOR(Code) | | |
findNextLink | protected boolean findNextLink()(Code) | | |
reset | public void reset()(Code) | | |
Methods inherited from org.archive.extractor.CharSequenceLinkExtractor | protected CharSequence charSequenceFrom(InputStream content, Charset charset)(Code)(Java Doc) protected CharSequence createCharSequenceFrom(InputStream content, Charset charset)(Code)(Java Doc) public static void extract(CharSequence content, UURI source, UURI base, List<Link> collector, ExtractErrorListener extractErrorListener)(Code)(Java Doc) abstract protected boolean findNextLink()(Code)(Java Doc) public boolean hasNext()(Code)(Java Doc) protected static CharSequenceLinkExtractor newDefaultInstance()(Code)(Java Doc) public Object next()(Code)(Java Doc) public Link nextLink()(Code)(Java Doc) public void remove()(Code)(Java Doc) public void reset()(Code)(Java Doc) public void setup(UURI source, UURI base, InputStream content, Charset charset, ExtractErrorListener listener)(Code)(Java Doc) public void setup(UURI source, UURI base, CharSequence content, ExtractErrorListener listener)(Code)(Java Doc) public void setup(UURI sourceandbase, CharSequence content, ExtractErrorListener listener)(Code)(Java Doc) public void setup(UURI sourceandbase, InputStream content, Charset charset, ExtractErrorListener listener)(Code)(Java Doc)
|
|
|