A HTML tokenizer
This is a partial port of the functionality behind Perl's TokeParser Provided a page it progressively returns tokens from that page