This package is for processing tag-like markup languages - things with anglybrackets. HTML, XHTML, WML, XML and other SGML dialects.
Strengths:
It has 2 APIs you can use:
The {@link TagTokenizer} scans through a document and fires events as it encounters {@link Tag}s of interest. Anything that does not qualify as a {@link Tag} will be treated as a {@link Text} token.
This is a similar approach to the SAX API for XML processing.
The {@link TagProcessor} is built on top of the {@link TagTokenizer} and acts as a registry for {@link TagRule}s and {@link TextFilter}s. It also supports multiple {@link State}s, allowing different rules to be applied in different sections of document.