| This class breaks up HTML into tokens.
This class differs slightly from LexML as follows: after certain tags,
like the <script> tag, the body that follows is
uninterpreted data and ends only at the next, in this case,
</script> tag, not at the just the next
"<" or ">" character. This is one way that HTML is not fully
compliant with XML.
The default set of tags that have this special processing is
<script> , <style> , and
<xmp> . The user can change this by retrieving
the Vector of special tags via
getClosingTags , and modifying it as needed.
author: Colin Stevens (colin.stevens@sun.com) version: 1.9, 00/05/31 |