BETAmodules.com is in beta — open to partnerships & joint ventures.Build with us

Home Search Compare Equivalents

One search box and one honest, consistent read on every open-source library — across every ecosystem.

npmPyPIcrates.ioRubyGemsGoMavenNuGet

Discover

Tools

Compare Equivalents

Data

deps.dev OSV advisories npm registry PyPI

About

Methodology Partner with us

© 2026 Modules · A precision instrument for picking dependencies.Data refreshed continuously from public registries, deps.dev & OSV

cross-ecosystem search · live

Results for text-extract

Found in 4 of 7 ecosystemsnpm 1–24 of 313,063 · 1196 matches across other registries

npm313063 crates.io54 RubyGems12 NuGet1130

How we search: free-text on npm, crates.io, RubyGems, NuGet and Maven. PyPI and Go do exact-name lookup only. Tip: click an ecosystem chip below to filter; click Show all ecosystems to come back.

Sort

Auto-load on scroll

npm matches

Showing 24 of 313,063 · JavaScript

See all npm →

text-extractv1.0.5

A robust Node.js utility for extracting text from PDF, DOCX, DOC, XLSX, and TXT buffers.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

pdf-text-extractv1.5.0

Extract text from pdfs that contain searchable pdf text

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 9 years ago.

ai-localize-vscodev2.0.1

Highlight hardcoded text, extract locale keys, and validate translations

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@effishai/pdf-extract-node-win32-arm64-msvcv0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

expo-pdf-text-extractv1.1.0

Native PDF text extraction for React Native and Expo. Extract text content from PDF files using platform-native APIs (PDFKit on iOS, PDFBox on Android). Works with Expo development builds.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@effishai/pdf-extract-nodev0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

@effishai/pdf-extract-node-linux-arm64-muslv0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

@effishai/pdf-extract-node-win32-x64-msvcv0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

@paul_sizon/expo-pdf-text-extractv1.0.1

Native PDF text extraction for React Native and Expo. Extract text content from PDF files using platform-native APIs (PDFKit on iOS, PDFBox on Android). Works with Expo development builds.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

mini-css-extract-pluginv2.10.2

extracts CSS into separate files

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@effishai/pdf-extract-node-linux-x64-muslv0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

@effishai/pdf-extract-node-darwin-arm64v0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

@effishai/pdf-extract-node-linux-x64-gnuv0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

@effishai/pdf-extract-node-darwin-universalv0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

@effishai/pdf-extract-node-darwin-x64v0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

@effishai/pdf-extract-node-linux-arm64-gnuv0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

summarizer-ai-mcpv1.0.0

MCP server for summarizer ai. Features summarize text, extract key points, generate abstract. From MEOK AI Labs.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

optimize-css-assets-webpack-pluginv6.0.1

A Webpack plugin to optimize \ minimize CSS assets.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 4 years ago.

@hyper.fun/fluentui-icon-document-text-extract-filledv2.0.0

👉 https://hyper.fun/c/fluentui-icon-document-text-extract-filled/2.0.0

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 3 years ago.

PDF extraction and rendering across all JavaScript runtimes

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

node-fetchv3.3.2

A light-weight module that brings Fetch API to node.js

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

postcss-modules-extract-importsv3.1.0

A CSS Modules transform to extract local aliases for inline imports

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

unicode-emoji-utilsv1.3.1

A collection of utilities for emojis

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

extract-zipv2.0.1

unzip a zip file into a directory using 100% javascript

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 5 years ago.

1 2 3 4 5…13045

crates.io matches

Showing 12 of 54 · Rust

See all crates.io →

spectre_pdfv1.0.0

Native Rust PDF extraction engine: text, markdown for RAG, AcroForm widgets, image decoding, and encrypted PDFs. Lazy parser, persistent Document handle, no C dependencies.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

pdf-text-extractv0.2.0

Extract text, tables, and structured content from PDF files

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

keyword_extractionv1.5.0

Collection of algorithms for keyword extraction from text

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published over a year ago.

papyrus-corev0.1.0

PDF-to-Markdown conversion engine with smart heading detection, bold/italic text extraction, and CommonMark output. Pure Rust, best-effort parsing for corrupted PDFs.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

pdf_oxidev0.3.61

The fastest Rust PDF library with text extraction: 0.8ms mean, 100% pass rate on 3,830 PDFs. 5× faster than pdf_extract, 17× faster than oxidize_pdf. Extract, create, and edit PDFs.

MaintenanceHealthy

PopularityRising

Worth a look. Actively maintained and growing, actively maintained.

arabic_pdf_to_textv0.1.0

A CLI tool to convert Arabic PDFs to text using Google's Gemini API

MaintenanceAging

PopularityNiche

Aging — last published 11 months ago — check before adopting.

bolivar-cliv1.7.0

PDF text extraction CLI tools

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

bolivar-corev1.7.0

PDF content extraction library

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

deformatv0.15.1

Extract plain text from HTML, PDF, and other document formats

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

docx-litev0.2.0

Lightweight, fast DOCX text extraction library with minimal dependencies

MaintenanceAging

PopularityNiche

Aging — last published 8 months ago — check before adopting.

elizaos-plugin-pdfv2.0.0

elizaOS PDF Plugin - PDF reading and text extraction

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

epub-parserv0.3.4

A Rust library for extracting metadata, table of contents, text, cover, and images from EPUB files.

MaintenanceHealthy

PopularityRising

Worth a look. Actively maintained and growing, actively maintained.

RubyGems matches

Exact match · Ruby

simple_text_extractv3.0.10

Extract text from various file types before resorting to an OCR solution.

MaintenanceAging

PopularityNiche

Aging — last published over a year ago — check before adopting.

Grim is a simple gem for extracting a page from a pdf and converting it to an image as well as extract the text from the page as a string. It basically gives you an easy to use api to ghostscript, imagemagick, and pdftotext specific to this use case.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 2 years ago.

Treat is a natural language processing framework for Ruby.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 11 years ago.

textractv0.0.22

Extracts article text from a URL

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 11 years ago.

text_extractorv0.6.0

Easily extract data from text

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 6 years ago.

detabulatorv0.1.0

Extract columnar data from tabulated fixed-width text

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 15 years ago.

mongoid_fulltextv0.8.2

Full-text search for the Mongoid ORM, using n-grams extracted from text.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 8 years ago.

date_extractorv0.1.1

Extract dates from a text.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 8 years ago.

plaintextv0.3.7

Extract text from common office files. Based on the file's content type a command line tool is selected to do the job.

MaintenanceAging

PopularityNiche

Aging — last published 6 months ago — check before adopting.

ddr-extractionv0.3.0

File text and metadata extraction service.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 11 years ago.

adjectifierv0.0.2

The aim of this gem is to simply extract adjectives, or 'describing words' from arbitrary free text.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 15 years ago.

text-extractorv0.1.0

Extract text from common office files. Based on the file's content type a command line tool is selected to do the job.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 8 years ago.

NuGet matches

Showing 12 of 1,130 · .NET

See all NuGet →

itextsharpv5.5.13.5

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

azure.ai.textanalyticsv5.3.0

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

tikaondotnet.textextractorv1.17.1

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 8 years ago.

itextsharp.xmlworkerv5.5.13.5

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

microsoft.programsynthesis.extraction.textv10.16.5

No description provided.

MaintenanceAging

PopularityUnknown

Aging — last published 8 months ago — check before adopting.

pdfsharptextextractorv1.0.2

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 7 years ago.

uipathv9.0.6736.25739

No description provided.

MaintenanceDeprecated

PopularityUnknown

Deprecated. Don't start a new project on this.

bitmiracle.docotic.pdfv9.9.19928

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

pdf-extractv1.0.1

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 10 years ago.

evopdf.pdftotextv14.0.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

bytescout.textrecognitionv2.6.1.323

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

bytescout.pdfextractorv13.4.1.4801

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.