A library and command-line tool to extract clean content from web pages using Mozilla Readability and convert it to Markdown or JSON.
A lightweight Node.js module to fetch, parse web content, extract meta tags, and capture webpage screenshots efficiently.
MCP server for extracting web content using web-content-extract library
Fito Plugin — fetch web content, extract knowledge components (execution facts, domain concepts, usage scenarios) from API/tool documentation, and generate content for social networks.
A function to recursively extract files and their object paths within a value, replacing them with null in a deep clone without mutating the original value. FileList instances are treated as File instance arrays. Files are typically File and Blob instance
Extract meaning from JS Errors
extracts CSS into separate files
A CSS Modules transform to extract local aliases for inline imports
Array manipulation, ordering, searching, summarizing, etc.
unzip a zip file into a directory using 100% javascript
Zero-runtime Stylesheets-in-TypeScript
Zero-runtime Stylesheets-in-TypeScript
A light-weight module that brings Fetch API to node.js
Zero-runtime Stylesheets-in-TypeScript
AI SDK tools for Parallel Web
After Effects plugin for exporting animations to SVG + JavaScript or canvas + JavaScript
filesystem bindings for tar-stream
Zero-runtime Stylesheets-in-TypeScript
Zero-runtime Stylesheets-in-TypeScript
PDF extraction and rendering across all JavaScript runtimes
Extract meaning from JS Errors
Zero-runtime Stylesheets-in-TypeScript
Create multi-variant styles with a type-safe runtime API, heavily inspired by https://stitches.dev
Babel plugin to extract translatable messages from source code into Lingui catalogs
A pure ruby implementation of the boilerpipe web content extraction algorithm
A flexible web-scraper with built in content extraction
Extracts content like title, summary, and images from web pages like Dracula extracts blood: with care and finesse.
ReadabilityJs is a Ruby wrapper gem for the mozilla readability library to extract the main content from web pages. It uses the Nodo gem to run the JavaScript Readability library in a Node.js environment, allowing for efficient and accurate content extraction within Ruby applications.
This library can be used to call the Yahoo Term Extraction Web Service from Ruby. The Term Extraction Web Service provides a list of significant words or phrases extracted from a larger content.
Ruby port of Mozilla Readability.js - extracts the main content from web pages, like Firefox Reader View
Nous crawls same-host web pages, extracts readable content, and serializes clean Markdown as text or JSON.
The content API allows site owners, service developers and web-analytics analysts to extract information about products on the Yandex.Market. The API provides data from model cards (including prices, descriptions, photos and reviews), as well as complete information about stores and the availability of goods in them.
Official Ruby SDK for Capture (capture.page). Capture screenshots, generate PDFs, extract content and metadata from web pages.
FerrumMCP is a browser automation server that implements the Model Context Protocol (MCP), enabling AI assistants to interact with web pages through a standardized interface. Features include navigation, form interaction, content extraction, screenshot capture, JavaScript execution, cookie management, and advanced capabilities like smart cookie banner detection and AI-powered CAPTCHA solving.
The SpiderCloud gem implements a lightweight interface to the Spider Cloud API. Spider Cloud provides powerful web scraping and crawling capabilities with support for JavaScript rendering, proxy rotation, and anti-bot measures. This gem supports scrape, crawl, screenshot, and links endpoints with comprehensive options for content extraction, filtering, and automation.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.