BETAmodules.com is in beta — open to partnerships & joint ventures.Build with us

Home Search Compare Equivalents

One search box and one honest, consistent read on every open-source library — across every ecosystem.

npmPyPIcrates.ioRubyGemsGoMavenNuGet

Discover

Tools

Compare Equivalents

Data

deps.dev OSV advisories npm registry PyPI

About

Methodology Partner with us

© 2026 Modules · A precision instrument for picking dependencies.Data refreshed continuously from public registries, deps.dev & OSV

cross-ecosystem search · live

Results for extract-text-html

Found in 3 of 7 ecosystemsnpm 1–24 of 397,657 · 266 matches across other registries

npm397657 RubyGems12 NuGet254

How we search: free-text on npm, crates.io, RubyGems, NuGet and Maven. PyPI and Go do exact-name lookup only. Tip: click an ecosystem chip below to filter; click Show all ecosystems to come back.

Sort

Auto-load on scroll

npm matches

Showing 24 of 397,657 · JavaScript

See all npm →

extract-text-htmlv0.3.0

Extract text from HTML. Excludes content from metadata tags by default.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

node-fetchv3.3.2

A light-weight module that brings Fetch API to node.js

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

mini-css-extract-pluginv2.10.2

extracts CSS into separate files

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

optimize-css-assets-webpack-pluginv6.0.1

A Webpack plugin to optimize \ minimize CSS assets.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 4 years ago.

@emotion/serverv11.11.0

Extract and inline critical css with emotion for server side rendering.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 3 years ago.

@supabase/node-fetchv2.6.13

A light-weight module that brings window.fetch to node.js

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

extract-loaderv5.1.0

webpack loader to extract HTML and CSS from the bundle

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 6 years ago.

html-to-textv10.0.0

Advanced html to plain text converter

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

extract-cssv3.0.2

Extract the CSS from an HTML document.

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

PDF extraction and rendering across all JavaScript runtimes

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

natural-comparev1.4.0

Compare strings containing a mix of letters and numbers in the way a human being would in sort order.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 9 years ago.

Remove reply quotations from emails

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

@pnpm/node-fetchv1.0.0

A light-weight module that brings Fetch API to node.js

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 3 years ago.

unicode-emoji-utilsv1.3.1

A collection of utilities for emojis

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

postcss-modules-extract-importsv3.1.0

A CSS Modules transform to extract local aliases for inline imports

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

@capsizecss/vanilla-extractv2.0.4

Vanilla-extract integration for capsize

MaintenanceAging

PopularityUnknown

Aging — last published 6 months ago — check before adopting.

extract-zipv2.0.1

unzip a zip file into a directory using 100% javascript

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 5 years ago.

html-minifierv4.0.0

Highly configurable, well-tested, JavaScript-based HTML minifier.

MaintenanceAbandoned

PopularityUnknown

Security1 advisory

Has 1 high-severity advisory. Verify a patched version exists before using.

negotiatorv1.0.0

HTTP content negotiation

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published over a year ago.

topojson-clientv3.1.0

Manipulate TopoJSON and convert it to GeoJSON.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 6 years ago.

@types/html-to-textv9.0.4

TypeScript definitions for html-to-text

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

extract-urlsv1.4.1

Extract urls from a string and returns an array

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

d3-sankeyv0.12.3

Visualize flow between nodes in a directed acyclic network.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 6 years ago.

pdf-text-extractv1.5.0

Extract text from pdfs that contain searchable pdf text

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 9 years ago.

1 2 3 4 5…16570

RubyGems matches

Exact match · Ruby

Deba takes a HTML document or fragment and extracts the textual content into a plaintext format that is easy for humans to read.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 9 years ago.

extractcontentv0.0.1

This module is to extract the text from web page(html).

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 16 years ago.

chupa-text-decomposer-htmlv1.0.5

This is a ChupaText decomposer plugin for to extract text and meta-data from HTML. You can use `html` decomposer.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published over a year ago.

url_extractorv0.0.1

A tool for extracting and replacing URLs from inside a block of text or HTML.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 12 years ago.

ContactDetectivev1.0.0

A simple Ruby API for extracting contact data, such as emails, addresses, and phone numbers from text documents and hyperlinks. Also has the ability to save the extracted data as JSON objects and files. For more info, see https://github.com/jweinst1/ContactDetective

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 10 years ago.

hirobumi-chocolate_disco-jrubyv0.1.5

Provides methods to extract texts from various file formats like Microsoft Office (<= 2002, as well as >= 2007,) PDF and HTML.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 11 years ago.

Generate HTML tables which popular spreadsheet software packages know how to read

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 11 years ago.

escapementv2.0.0

Given a HTML formatted string, escapement will extract descendant tags into a device agnostic attributes array that can be used for formatting the text anywhere.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 9 years ago.

Pismo extracts and retrieves content-related metadata from HTML pages - you can use the resulting data in an organized way, such as a summary/first paragraph, body text, keywords, RSS feed URL, favicon, etc.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 13 years ago.

kreuzbergv4.9.8

Kreuzberg is a high-performance document intelligence library with a Rust core and native Ruby bindings via Magnus. Extract text, metadata, and structured data from 75+ file formats including PDF, DOCX, PPTX, XLSX, HTML, RTF, images (with OCR), email, archives, and more. Features async/sync APIs, text chunking, language detection, and keyword extraction.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

nddrylliog_pismov0.7.4

Pismo extracts and retrieves content-related metadata from HTML pages - you can use the resulting data in an organized way, such as a summary/first paragraph, body text, keywords, RSS feed URL, favicon, etc.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 13 years ago.

langextractv0.1.0

Extract structured information from text with source grounding, deterministic serialization, and HTML visualization.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

NuGet matches

Showing 12 of 254 · .NET

See all NuGet →

aspose.pdfv26.5.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

bitmiracle.docotic.pdfv9.9.19928

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

pdftron.net.x64v11.13.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

textdiscoveryv1.0.3

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 9 years ago.

select.pdf.netcorev26.2.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

aspose.pdf.drawingv26.5.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

freespire.pdfv12.4.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

groupdocs.parserv26.4.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

apitron.pdf.kitv2.0.57

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

select.pdfv26.2.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.