a web promo scraper using node.js
AI-powered tools for web scraping and searching
A rust crate, that is used to get html from a website, and scrape the content in it
Web scraper integration for flows.network
Web scraping logic for docbox, document parsing, favicon resolution, ogp resolution
Scraps paper informations from sci-hub.
Constant-frequency recursive CLI web scraper with frequency, filtering, file directory, and many other options for scraping HTML, images and other files.
Core scraping engine for Halldyll - high-performance async web scraper for AI agents
MCP (Model Context Protocol) server for the CRW web scraper
HTTP and CDP browser rendering engine for the CRW web scraper
Firecrawl-compatible API server for the CRW web scraper
HTML extraction and markdown conversion engine for the CRW web scraper
Core types, config, and error handling for the CRW web scraper
Web scraper for Crabs - tokio version
Web Scraper is a library to build APIs by scraping static sites and use data as models.
It's an utility to scrape web pages
A decent web scraping gem.Scrapes website's title, description,social profiles such as linkedin, facebook, twitter, instgram, vimeo,pinterest, youtube channel and contact details such as emails, phone numbers.
pikuri-pdf plugs PDF → text extraction into pikuri-core's +Pikuri::Extractor+ registry. The bundled +Pikuri::Extractors::PDF+ extractor wraps the pure-Ruby pdf-reader gem and extracts lazily: paged reads (the +read+ tool's windows) parse only the pages the window needs, so the first page of a 500-page PDF never pays for the other 499. Shipped separately from pikuri-core so the core's dependency tree stays minimal and auditable: pdf-reader and its transitive deps (Ascii85, afm, hashery, ruby-rc4, ttfunk) ride along only for hosts that opt into PDF support. Registration is explicit — +Pikuri::Extractors::PDF.register+ — so requiring the gem changes nothing by itself; the host script picks which extractors it wires in. One registration extends the +read+ tool, +web_scrape+, and the pikuri-vectordb indexer simultaneously.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.