Works on HTML input to clean it out to a
limited subset of HTML, mostly focussing on structural/semantic
elements (actually, what should be kept is configurable).
The input events should be in no namespace and contain
the html and body tags. All elements and attributes should already
be lowercased.
All elements and attributes that are not explicitely allowed
in the configuration will be dropped (but
their character content will remain).
Span and div elements are treated specially. The will only be
kept if their class attribute has one of the allowed values, specified
in the configuration of this component. Span elements that contain
a style attribute specifying bold and/or italic styling will
be converted to the equivalent strong/em tags.
|