NekoHTML is a simple HTML scanner and tag balancer
that enables Java application programmers to parse
HTML documents and access the information using
standard XML interfaces. The parser can scan HTML
files and "fix up" many common mistakes that human
(and computer) authors make in writing HTML
documents. NekoHTML is written using the Xerces
Native Interface (XNI) that is the foundation of
the Xerces2 implementation. This enables
application programmers to use the NekoHTML parser
with existing XNI tools without modification or
rewriting code.