A HTML table parser based on sxd_html.
Web content extraction library inspired by trafilatura. Extracts main text, metadata, and comments from HTML.
Crate for determining the file format of a given file or stream.
Unified data extraction — Regex, XPath 1.0, CSS Selectors, and JMESPath behind one query interface