Fast dom parser based on regexps
HTML to DOM parser.
TypeScript definitions for dom-parser
fastest XML DOM Parser for node/browser/worker
XML DOM, Parser & Stringifier
dom parser using javascript regex.
HTML/XML to DOM parser for browsers and Node.js
MP DOM parser & configurations
AdaptJS DOM Parser
Cross-environment (nodejs/web) DOM parser for XML and HTML
A fast and minimalistic HTML/XML DOM parser with CSS selectors
A DOM parser for the react library, used for instantiating react components declaratively
Dependency-free and lean DOM parser that outputs Markdown
DOM parser for OpenReceipt
A lightweight DOM parser for server-side HTML parsing and manipulation with full DOM API support
A small parser that converts HTML to React using the DOMParser API.
Prototyped HTML DOM Parser and compiler
Fast & forgiving HTML/XML parser
Handler for htmlparser2 that turns pages into a dom
A simple HTML parser
### Usage
Scanner and parser for JSON with comments.
An evented streaming XML parser in JavaScript
Utilities for working with htmlparser2's dom
A CSS parser based on the DOM API
There are three main function of this gem read html, search data, rebuild html.
Green Button Data is a Ruby gem that can consume Green Button APIs and parse the Green Button data XML schema very quickly. It uses an event-driven SAX parser which parses XML data without building an entire DOM in memory.
Scrapetor is a Ruby HTML parsing + scraping toolkit. The parser is a native C arena DOM with structural indexes built at parse time and NEON SIMD scanners in the SAX hot loop. A streaming extraction engine compiles the schema DSL into a single forward pass — no DOM materialised, one Ruby boundary crossing per document. On builds where libcurl is available, Scrapetor::Fetcher adds an HTTP/2-capable fetch layer with per-thread connection cache, shared DNS + TLS session pool, in-process gzip / deflate / brotli / zstd decoding, iconv charset transcoding, retry + exponential backoff, ETag / Last-Modified disk cache with bulk revalidation, per-host throttle, cookie jar, basic + bearer auth, proxy, and three bulk concurrency models (parallel_fetch / multi_fetch / streaming multi_each). Scrapetor::Session ties the cookie / auth / throttle / retry policies together. Also ships robots.txt + sitemap.xml parsers, a bounded-memory streaming HTML parser, and structured-data extractors (JSON-LD, OpenGraph, Schema.org, Microdata, RDFa, Twitter Cards). The Net::HTTP-based Scrapetor.fetch is preserved as the no-libcurl fallback.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.