Package org.htmlcleaner

Interface Summary
BaseToken Base token interface.
ITagInfoProvider Provides set of TagInfo instances.
TagNode.ITagNodeCondition Used as base for different node checkers.
 

Class Summary
BrowserCompactXmlSerializer Broswer compact XML serializer - creates resulting XML by stripping whitespaces wherever possible, but preserving single whitespace where at least one exists.
CleanerProperties Properties defining cleaner's behaviour Created by: Vladimir Nikic
Date: March, 2008.
CleanerTransformations Contains transformation collection.
CommandLine Command line usage class.
CommentToken HTML comment token.
CompactXmlSerializer Compact XML serializer - creates resulting XML by stripping whitespaces.
ConfigFileTagProvider Default tag provider - reads XML file in specified format and creates tag infos Created by: Vladimir Nikic
Date: April, 2008.
ContentToken HTML text token.
DefaultTagProvider This class is automatically created from ConfigFileTagProvider which reads default XML configuration file with tag descriptions.
DoctypeToken HTML doctype token.
DomSerializer DOM serializer - creates xml DOM.
EndTagToken HTML tag end token.
HtmlCleaner Main HtmlCleaner class.
HtmlCleanerForAnt Support for ANT.
HtmlTokenizer Main HTML tokenizer.
JDomSerializer DOM serializer - creates xml DOM.
PrettyXmlSerializer Pretty XML serializer - creates resulting XML with indenting lines.
SimpleXmlSerializer Simple XML serializer - creates resulting XML without indenting lines.
SpecialEntities This class contains map with special entities used in HTML and their unicodes.
TagInfo Class contains information about single HTML tag.
It also contains rules for tag balancing.
TagNode XML node tag - basic node of the cleaned HTML tree.
TagToken HTML tag token - descendants are start (TagNode) and end token (EndTagToken).
TagTransformation Describes how specified tag is transformed to another one, or is ignored during parsing
Utils Common utilities.
XmlSerializer Abstract XML serializer - contains common logic for descendants.
XPather Utility for searching cleaned document tree with XPath expressions.
 

Exception Summary
HtmlCleanerException General HtmlCleaner runtime exception.
XPatherException Exception that could occure during XPather evaluation.