org.htmlcleaner
Class HtmlTokenizer
java.lang.Object
org.htmlcleaner.HtmlTokenizer
public class HtmlTokenizer
- extends Object
Main HTML tokenizer.
It's task is to parse HTML and produce list of valid tokens:
open tag tokens, end tag tokens, contents (text) and comments.
As soon as new item is added to token list, cleaner is invoked
to clean current list at the end.
Created by: Vladimir Nikic.
Date: November, 2006
Constructor Summary |
HtmlTokenizer(HtmlCleaner cleaner,
Reader reader,
org.htmlcleaner.CleanTimeValues cleanTimeValues)
Constructor - creates instance of the parser with specified content. |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
HtmlTokenizer
public HtmlTokenizer(HtmlCleaner cleaner,
Reader reader,
org.htmlcleaner.CleanTimeValues cleanTimeValues)
- Constructor - creates instance of the parser with specified content.
- Parameters:
cleaner
- reader
-
getDocType
public DoctypeToken getDocType()
Copyright © 2006-2014. All Rights Reserved.