|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.htmlcleaner.HtmlCleaner
Main HtmlCleaner class.
It represents public interface to the user. It's task is to call tokenizer with specified source HTML, traverse list of produced token list and create internal object model. It also offers a set of methods to write resulting XML to string, file or any output stream.
Typical usage is the following:
| Field Summary | |
static java.lang.String |
DEFAULT_CHARSET
|
| Constructor Summary | |
HtmlCleaner()
Constructor - creates cleaner instance with default tag info provider and default properties. |
|
HtmlCleaner(CleanerProperties properties)
Constructor - creates the instance with default tag info provider and specified properties |
|
HtmlCleaner(ITagInfoProvider tagInfoProvider)
Constructor - creates the instance with specified tag info provider and default properties |
|
HtmlCleaner(ITagInfoProvider tagInfoProvider,
CleanerProperties properties)
Constructor - creates the instance with specified tag info provider and specified properties |
|
| Method Summary | |
TagNode |
clean(java.io.File file)
|
TagNode |
clean(java.io.File file,
java.lang.String charset)
|
TagNode |
clean(java.io.InputStream in)
|
TagNode |
clean(java.io.InputStream in,
java.lang.String charset)
|
TagNode |
clean(java.io.Reader reader)
Basic version of the cleaning call. |
TagNode |
clean(java.lang.String htmlContent)
|
TagNode |
clean(java.net.URL url)
|
TagNode |
clean(java.net.URL url,
java.lang.String charset)
|
java.util.Set |
getAllTags()
|
java.lang.String |
getInnerHtml(TagNode node)
For the specified node, returns it's content as string. |
CleanerProperties |
getProperties()
|
java.util.Set |
getPruneTagSet()
|
ITagInfoProvider |
getTagInfoProvider()
|
CleanerTransformations |
getTransformations()
|
void |
setInnerHtml(TagNode node,
java.lang.String content)
For the specified tag node, defines it's html content. |
void |
setTransformations(CleanerTransformations transformations)
Sets tranformations for this cleaner instance. |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
public static final java.lang.String DEFAULT_CHARSET
| Constructor Detail |
public HtmlCleaner()
public HtmlCleaner(ITagInfoProvider tagInfoProvider)
tagInfoProvider - Provider for tag filtering and balancingpublic HtmlCleaner(CleanerProperties properties)
properties - Properties used during parsing and serializing
public HtmlCleaner(ITagInfoProvider tagInfoProvider,
CleanerProperties properties)
tagInfoProvider - Provider for tag filtering and balancingproperties - Properties used during parsing and serializing| Method Detail |
public TagNode clean(java.lang.String htmlContent)
throws java.io.IOException
java.io.IOException
public TagNode clean(java.io.File file,
java.lang.String charset)
throws java.io.IOException
java.io.IOException
public TagNode clean(java.io.File file)
throws java.io.IOException
java.io.IOException
public TagNode clean(java.net.URL url,
java.lang.String charset)
throws java.io.IOException
java.io.IOException
public TagNode clean(java.net.URL url)
throws java.io.IOException
java.io.IOException
public TagNode clean(java.io.InputStream in,
java.lang.String charset)
throws java.io.IOException
java.io.IOException
public TagNode clean(java.io.InputStream in)
throws java.io.IOException
java.io.IOException
public TagNode clean(java.io.Reader reader)
throws java.io.IOException
reader -
java.io.IOExceptionpublic CleanerProperties getProperties()
public java.util.Set getPruneTagSet()
public java.util.Set getAllTags()
public ITagInfoProvider getTagInfoProvider()
public CleanerTransformations getTransformations()
public void setTransformations(CleanerTransformations transformations)
transformations - public java.lang.String getInnerHtml(TagNode node)
node -
public void setInnerHtml(TagNode node,
java.lang.String content)
node - content -
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||