| java.lang.Object com.flexive.extractor.HtmlExtractor
HtmlExtractor | public class HtmlExtractor (Code) | | This class allows meta data and text extraction from a HTML stream (file).
author: Gregor Schober (gregor.schober@flexive.com), UCS - unique computing solutions gmbh (http://www.ucs.at) |
extract | public static ExtractedData extract(InputStream in)(Code) | | Extracts the text informations from the html stream.
Parameters: in - the input stream to read from the extraxted informations, or null if no text extraction was possible |
extract | public static ExtractedData extract(String html)(Code) | | Extracts the text informations from the html stream.
Parameters: html - the HTML data the extraxted informations, or null if no text extraction was possible |
|
|