A fast, efficient JSON data stream parser for processing large datasets in Node.js. It provides a streaming API to read JSON objects line-by-line or in chunks.
A micro-library of stream components for building custom JSON and JSONC processing pipelines with a minimal memory footprint — parse, filter, and transform JSON far larger than available memory with a SAX-inspired token API, on Node.js or Web Streams.
minimal implementation of a PassThrough stream
Streaming HTML parser with scripting support.
Streaming, source-agnostic EventSource/Server-Sent Events parser
Streaming JSON parser in Javascript for Node.js, Deno and the browser
Pure Javascript JSON to CSV converter.
Parser adapter for parsing JSON documents into base namespace.
Option parsing for Node, supporting types, shorthands, etc. Used by npm.
Scanner and parser for JSON with comments.
Generic interruptible "parser" mixin for Transform & Writable streams
Parse, Resolve, and Dereference JSON Schema $ref pointers
CSV parsing implementing the Node.js `stream.Transform` API
Node.js Transform and Async interface to convert JSON into CSV.
tar-stream is a streaming tar parser and generator and nothing else. It operates purely using streams which means you can easily extract/parse tarballs without ever hitting the file system.
Lightweight, streamed JSON parser in 748 bytes
Parse partial JSON generated by LLM
git-log-parser
Node.js body parsing middleware
JSON.Stringify as a readable stream
An evented streaming XML parser in JavaScript
MermaidJS parser
Big-friendly JSON. Asynchronous streaming functions for large JSON data sets.
Parse and compile gettext po and mo files to/from json, nothing more, nothing less
A streaming JSON parser (generates SAX-like events) and "JSON Path" like implementation to parse small amount of data in a large JSON file.
Scrapetor is a Ruby HTML parsing + scraping toolkit. The parser is a native C arena DOM with structural indexes built at parse time and NEON SIMD scanners in the SAX hot loop. A streaming extraction engine compiles the schema DSL into a single forward pass — no DOM materialised, one Ruby boundary crossing per document. On builds where libcurl is available, Scrapetor::Fetcher adds an HTTP/2-capable fetch layer with per-thread connection cache, shared DNS + TLS session pool, in-process gzip / deflate / brotli / zstd decoding, iconv charset transcoding, retry + exponential backoff, ETag / Last-Modified disk cache with bulk revalidation, per-host throttle, cookie jar, basic + bearer auth, proxy, and three bulk concurrency models (parallel_fetch / multi_fetch / streaming multi_each). Scrapetor::Session ties the cookie / auth / throttle / retry policies together. Also ships robots.txt + sitemap.xml parsers, a bounded-memory streaming HTML parser, and structured-data extractors (JSON-LD, OpenGraph, Schema.org, Microdata, RDFa, Twitter Cards). The Net::HTTP-based Scrapetor.fetch is preserved as the no-libcurl fallback.