BETAmodules.com is in beta — open to partnerships & joint ventures.Build with us

Home Search Compare Equivalents

One search box and one honest, consistent read on every open-source library — across every ecosystem.

npmPyPIcrates.ioRubyGemsGoMavenNuGet

Discover

Tools

Compare Equivalents

Data

deps.dev OSV advisories npm registry PyPI

About

Methodology Partner with us

© 2026 Modules · A precision instrument for picking dependencies.Data refreshed continuously from public registries, deps.dev & OSV

cross-ecosystem search · live

Results for crawler-data-web

Found in 3 of 7 ecosystemsnpm 1–24 of 1,853,258 · 28 matches across other registries

npm1853258 RubyGems8 NuGet20

How we search: free-text on npm, crates.io, RubyGems, NuGet and Maven. PyPI and Go do exact-name lookup only. Tip: click an ecosystem chip below to filter; click Show all ecosystems to come back.

Sort

Auto-load on scroll

npm matches

Showing 24 of 1,853,258 · JavaScript

See all npm →

crawler-data-webv1.0.3

This is personal project for web crawling/scraping topics. It includes few ways to crawl the data mainly using [Node.js](https://nodejs.org/en/) such as:

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

The fastest directory crawler & globbing alternative to glob, fast-glob, & tiny-glob. Crawls 1m files in < 1s

MaintenanceAging

PopularityUnknown

Aging — last published 10 months ago — check before adopting.

@ckeditor/ckeditor5-dev-web-crawlerv56.1.0

Used to run a web crawler that checks for errors on specified pages.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

simplecrawlerv1.1.9

Very straightforward, event driven web crawler. Features a flexible queue interface and a basic cache mechanism with extensible backend.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 6 years ago.

node-network-devtoolsv1.0.30

Inspecting Node.js's Network with Chrome DevTools

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

linkedomv0.18.12

A triple-linked lists based DOM implementation

MaintenanceAging

PopularityUnknown

Aging — last published 9 months ago — check before adopting.

async-mutexv0.5.0

A mutex for guarding async workflows

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

crawler-user-agentsv1.50.0

This repository contains a list of of HTTP user-agents used by robots, crawlers, and spiders as in single JSON file.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

Crawler is a ready-to-use web spider that works with proxies, asynchrony, rate limit, configurable request pools, jQuery, and HTTP/2 support.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

es6-crawler-detectv4.0.2

This is an ES6 adaptation of the original PHP library CrawlerDetect, this library will help you detect bots/crawlers/spiders vie the useragent.

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

@ptrumpis/snap-lens-web-crawlerv1.2.4

Crawl and download Snap Lenses from *lens.snapchat.com* with ease.

MaintenanceAging

PopularityUnknown

Aging — last published 11 months ago — check before adopting.

notion-md-crawlerv1.0.2

A library to recursively retrieve and serialize Notion pages with customization for machine learning applications.

MaintenanceAging

PopularityUnknown

Aging — last published 11 months ago — check before adopting.

prerender-nodev3.8.3

express middleware for serving prerendered javascript-rendered pages for SEO

MaintenanceAging

PopularityUnknown

Aging — last published 9 months ago — check before adopting.

npm-license-crawlerv0.2.1

Analyzes license information for multiple node.js modules (package.json files) as part of your software project.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 7 years ago.

@nuxtjs/devicev4.0.0

Device detection module for Nuxt

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

tavily-mcpv0.2.20

MCP server for advanced web search using Tavily

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

thredds-catalog-crawlerv0.0.7

A module for crawling thredds catalogs

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

js-crawlerv0.3.21

Web crawler for Node.js

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 7 years ago.

@web-master/node-web-crawlerv0.10.0

Crawl web as easy as possible

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 6 years ago.

x-ray-crawlerv2.0.5

x-ray's crawler

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 5 years ago.

is-web-crawlerv1.1.0

A light weight JS library to check if a user agent is a web crawler.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 5 years ago.

iqy-web-crawlerv6.3.4

web crawler

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 7 years ago.

web-streams-polyfillv4.3.0

Web Streams, based on the WHATWG spec reference implementation

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

Array manipulation, ordering, searching, summarizing, etc.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 3 years ago.

1 2 3 4 5…77220

RubyGems matches

Showing 7 of 8 · Ruby

See all RubyGems →

web_crawlerv0.5.4

Web crawler help you with parse and collect data from the web

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 14 years ago.

event-crawlerv0.1.0

Generic Web crawler with a DSL that parses event-related data from web pages

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 14 years ago.

Wgit was primarily designed to crawl static HTML websites to index and search their content - providing the basis of any search engine; but Wgit is suitable for many application domains including: URL parsing, data mining and statistical analysis.

MaintenanceAging

PopularityNiche

Aging — last published 10 months ago — check before adopting.

Generic Web crawler with a DSL that parses structured data from web pages

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

crawler_guruv0.1.0

Crawler Guru provides all basic functionalities to extract data from web pages

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 4 years ago.

simplecrawlerv0.1.8

The SimpleCrawler module is a library for crawling web sites. The crawler provides comprehensive data from the page crawled which can be used for page analysis, indexing, accessibility checks etc. Restrictions can be specified to limit crawling of binary files.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 14 years ago.

skyscraperv0.1.0

Easy to use DSL that helps scraping data from websites. Thanks to it, writing web crawlers would be very fast and intuitive. Traversing through html nodes and fetching all of the HTML attributes, would be possible. Just like in jQuery - you will find methods like parent, children, first, find, siblings etc. Furthermore, you are able to download images, web pages, and store all content in the database. Please visit my Github account for more details.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 14 years ago.

NuGet matches

Showing 12 of 20 · .NET

See all NuGet →

fiftyone.devicedetection.cloudv4.5.106

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

fiftyone.devicedetection.sharedv4.5.106

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 4 years ago.

fiftyone.devicedetection.hash.engine.onpremisev4.5.106

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

fiftyone.devicedetectionv4.5.106

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

crawlerlib.enginev2.3.5544.21265

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 11 years ago.

aspose.htmlv26.5.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

plurcrawlerv1.0.2

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 8 years ago.

dataextractingsdkv1.0.1

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 14 years ago.

webpx.webcrawlerv7.0.0

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 8 years ago.

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

webreaperv11.2.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.