Parse, search and stream HTML tabular data using Node.js and isaacs/sax-js.
Validate XML, Parse XML, Build XML without C/C++ based libraries
HTML to React parser.
A very fast HTML parser, generating a simplified DOM, with basic element query support.
HTML to DOM parser.
Streaming HTML parser with scripting support.
Fast & forgiving HTML/XML parser
An evented streaming XML parser in JavaScript
Parse HTML/XML to PostHTMLTree
HTML parser and serializer.
lezer-based HTML grammar
Streaming SAX-style HTML parser.
A HTML parser extracted from Angular with some modifications
<p align="left"> <img src="https://github.com/yeonjuan/es-html-parser/actions/workflows/main.yml/badge.svg?branch=main" alt="CI Badge" /> <a href="https://codecov.io/gh/yeonjuan/es-html-parser" > <img src="https://codecov.io/gh/yeonjuan/es-html-parser/bra
An evented streaming XML parser in JavaScript
A very fast HTML parser, generating a simplified DOM, with basic element query support.
Parse, validate, traverse, transform, and optimize Oniguruma regular expressions
Parser for the content attribute of the meta viewport
Node.js body parsing middleware
An ESLint custom parser which leverages TypeScript ESTree
the mighty option parser used by yargs
Liquid HTML parser by Shopify
A specification compliant robots.txt parser with wildcard (*) matching support.
Parser for @html-eslint/eslint-plugin
There are three main function of this gem read html, search data, rebuild html.
Scrapetor is a Ruby HTML parsing + scraping toolkit. The parser is a native C arena DOM with structural indexes built at parse time and NEON SIMD scanners in the SAX hot loop. A streaming extraction engine compiles the schema DSL into a single forward pass — no DOM materialised, one Ruby boundary crossing per document. On builds where libcurl is available, Scrapetor::Fetcher adds an HTTP/2-capable fetch layer with per-thread connection cache, shared DNS + TLS session pool, in-process gzip / deflate / brotli / zstd decoding, iconv charset transcoding, retry + exponential backoff, ETag / Last-Modified disk cache with bulk revalidation, per-host throttle, cookie jar, basic + bearer auth, proxy, and three bulk concurrency models (parallel_fetch / multi_fetch / streaming multi_each). Scrapetor::Session ties the cookie / auth / throttle / retry policies together. Also ships robots.txt + sitemap.xml parsers, a bounded-memory streaming HTML parser, and structured-data extractors (JSON-LD, OpenGraph, Schema.org, Microdata, RDFa, Twitter Cards). The Net::HTTP-based Scrapetor.fetch is preserved as the no-libcurl fallback.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.