No description provided.
Microdata parser. Extract and parse microdata from any website
node.js html5 microdata parser
A fast and lightweight streaming Microdata to RDF parser
Schema.org microdata parser
Microdata to json and json-ld parser
A microdata rdf-parse-html actor
A utility package to parse strings
A JavaScript parser
the mighty option parser used by yargs
Node.js body parsing middleware
An ESLint custom parser which leverages TypeScript ESTree
> Selector parser with built in methods for working with selector strings.
Detect Browser, Engine, OS, CPU, and Device type/model from User-Agent & Client Hints data. Supports browser & node.js environment
Range header field string parser
Algorithms to help you parse CSS from an array of tokens.
Validate XML, Parse XML, Build XML without C/C++ based libraries
An inline style parser.
No description provided.
Streaming HTML parser with scripting support.
A parser to Amazon Resource Names
Streaming, source-agnostic EventSource/Server-Sent Events parser
A pure JS HTTP parser for node.
Parser for the client for the realtime Engine
A Microdata parser and extractor library which includes support for the schema.org vocabularies
Microdata schema.org vocabularies
Scrapetor is a Ruby HTML parsing + scraping toolkit. The parser is a native C arena DOM with structural indexes built at parse time and NEON SIMD scanners in the SAX hot loop. A streaming extraction engine compiles the schema DSL into a single forward pass — no DOM materialised, one Ruby boundary crossing per document. On builds where libcurl is available, Scrapetor::Fetcher adds an HTTP/2-capable fetch layer with per-thread connection cache, shared DNS + TLS session pool, in-process gzip / deflate / brotli / zstd decoding, iconv charset transcoding, retry + exponential backoff, ETag / Last-Modified disk cache with bulk revalidation, per-host throttle, cookie jar, basic + bearer auth, proxy, and three bulk concurrency models (parallel_fetch / multi_fetch / streaming multi_each). Scrapetor::Session ties the cookie / auth / throttle / retry policies together. Also ships robots.txt + sitemap.xml parsers, a bounded-memory streaming HTML parser, and structured-data extractors (JSON-LD, OpenGraph, Schema.org, Microdata, RDFa, Twitter Cards). The Net::HTTP-based Scrapetor.fetch is preserved as the no-libcurl fallback.