| |
|
| java.lang.Object org.archive.util.ms.Doc
Doc | public class Doc (Code) | | Reads .doc files.
author: pjack |
getText | public static SeekReader getText(String docFilename) throws IOException(Code) | | Returns the text of the .doc file with the given file name.
Parameters: docFilename - the name of the file whose text to return the text of that file throws: IOException - if an IO error occurs |
getText | public static SeekReader getText(File doc) throws IOException(Code) | | Returns the text of the given .doc file.
Parameters: doc - the .doc file whose text to return the text of that file throws: IOException - if an IO error occurs |
getText | public static SeekReader getText(BlockFileSystem wordDoc, int cacheSize) throws IOException(Code) | | Returns the text for the given .doc file. The given cacheSize refers
to the number of the .doc file's piece table entries to cache. Most
.doc files only have 1 piece table entry; however, a "fast-saved"
.doc file might have several. A cacheSize of 20 should be ample for
most .doc files in the world. Since piece table entries are small --
only 12 bytes each -- caching them prevents many otherwise necessary
file pointer repositionings.
Parameters: wordDoc - the .doc file as a BlockFileSystem Parameters: cacheSize - the number of piece table entries to cache a reader that will return the text in the file throws: IOException - if an IO error occurs |
|
|
|