Add HTML parsing support to sxd_document. This enables to evaluate XPath expressions on HTML documents.
A HTML table parser based on sxd_html.
Web content extraction library inspired by trafilatura. Extracts main text, metadata, and comments from HTML.
XPath evaluation on HTML documents for kawat
Unified data extraction — Regex, XPath 1.0, CSS Selectors, and JMESPath behind one query interface
Represent an XML as a read-only tree.
Crate for determining the file format of a given file or stream.