Method Summary |
|
public ArrayList | extractURIs() Extract URIs from all objects found in a Pdf document's catalog. |
protected void | extractURIs(PdfObject entity) |
protected void | getInFromFile(String doc) Read a file named 'doc' and store its' bytes for later processing. |
public ArrayList | getURIs() Get a list of URIs retrieved from the Pdf during the
extractURIs operation. |
protected boolean | haveSeen(int generation, int id) Indicates, based on a PDFObject's generation/id pair whether
the parser has already encountered this object (or a reference to it)
so we don't infinitely loop on circuits within the PDF. |
protected void | initialize() Initialize opens the document for reading. |
public static void | main(String[] argv) |
protected void | markAsSeen(int generation, int id) Note that an object (id/generation pair) has been seen by this parser
so that it can be handled differently when it is encountered again. |
protected void | resetState() Reinitialize the object as though a new one were created. |
public void | resetState(byte[] doc) Reset the object and initialize it with a new byte array (the document). |
public void | resetState(String doc) |