Utility for extracting title and main contents from an HTML text.
get browser context for AI Agent
library for extracting content from HTML strings
Shared HTML→Markdown cleaning + AIO scoring engine. Powers @ontosdk/next, the Onto Read API, and the AIO scanner.
A crawler implemented using a headless browser (Chrome).
Simple but solid node-based web crawler, includes a content extractor
Automatically grab the main text out of a webpage
A library and command-line tool to extract clean content from web pages using Mozilla Readability and convert it to Markdown or JSON.
Browser automation toolkit for A2R agent task execution
A powerful web content extractor that converts articles to clean markdown
Site configuration loader for Graby-TS with dynamic imports
MCP Server for web search and content fetching
a lightweight JavaScript library for parsing, extracting, and converting Markdown content. It provides a set of utility functions and loaders to help you efficiently handle Markdown files in Node.js or browser environments.
A tool for extracting the file structure and file content from web development projects. Useful for copying the context for analysis by other tools, like ChatGPT or GPT4.
Takes a text file and splits it into 2 files based on a filter