BETAmodules.com is in beta — open to partnerships & joint ventures.Build with us

Home Search Compare Equivalents

One search box and one honest, consistent read on every open-source library — across every ecosystem.

npmPyPIcrates.ioRubyGemsGoMavenNuGet

Discover

Tools

Compare Equivalents

Data

deps.dev OSV advisories npm registry PyPI

About

Methodology Partner with us

© 2026 Modules · A precision instrument for picking dependencies.Data refreshed continuously from public registries, deps.dev & OSV

cross-ecosystem search · live

Results for pdf-text-extract

Found in 4 of 7 ecosystemsnpm 1–24 of 326,995 · 986 matches across other registries

npm326995 crates.io1 RubyGems12 NuGet973

How we search: free-text on npm, crates.io, RubyGems, NuGet and Maven. PyPI and Go do exact-name lookup only. Tip: click an ecosystem chip below to filter; click Show all ecosystems to come back.

Sort

Auto-load on scroll

npm matches

Showing 24 of 326,995 · JavaScript

See all npm →

pdf-text-extractv1.5.0

Extract text from pdfs that contain searchable pdf text

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 9 years ago.

expo-pdf-text-extractv1.1.0

Native PDF text extraction for React Native and Expo. Extract text content from PDF files using platform-native APIs (PDFKit on iOS, PDFBox on Android). Works with Expo development builds.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@effishai/pdf-extract-node-win32-arm64-msvcv0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

@effishai/pdf-extract-node-linux-arm64-muslv0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

@effishai/pdf-extract-nodev0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

@effishai/pdf-extract-node-win32-x64-msvcv0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

@effishai/pdf-extract-node-darwin-arm64v0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

@paul_sizon/expo-pdf-text-extractv1.0.1

Native PDF text extraction for React Native and Expo. Extract text content from PDF files using platform-native APIs (PDFKit on iOS, PDFBox on Android). Works with Expo development builds.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@effishai/pdf-extract-node-darwin-universalv0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

@effishai/pdf-extract-node-linux-x64-muslv0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

@effishai/pdf-extract-node-linux-x64-gnuv0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

@effishai/pdf-extract-node-darwin-x64v0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

@effishai/pdf-extract-node-linux-arm64-gnuv0.0.5

PDF text extract

MaintenanceAging

PopularityUnknown

Aging — last published over a year ago — check before adopting.

PDF extraction and rendering across all JavaScript runtimes

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@cantoo/pdf-libv2.7.1

Create and modify PDF files with JavaScript

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

mini-css-extract-pluginv2.10.2

extracts CSS into separate files

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@vestfoldfylke/pdf-text-extractv2.0.3

Node module that extracts metadata, text-content, and styling from readable pdf-files

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@adobe/pdfservices-node-sdkv4.1.0

The Adobe PDF Services Node.js SDK provides APIs for creating, combining, exporting and manipulating PDFs.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published over a year ago.

pdf.js-extractv1.0.1

super-simple async PDF reader that extracts text with x,y page positions based on pdf.js

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

PDF text extraction in TypeScript

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

Create and modify PDF files with JavaScript

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 4 years ago.

react-pdfv10.4.1

Display PDFs in your React app as easily as if they were images.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

@syncfusion/ej2-pdf-data-extractv33.2.8

This repository provides advanced support for data extraction from PDF documents

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

pdf-parsev2.4.5

Pure TypeScript, cross-platform module for extracting text, images, and tabular data from PDFs. Run directly in your browser or in Node!

MaintenanceAging

PopularityUnknown

Aging — last published 7 months ago — check before adopting.

1 2 3 4 5…13625

crates.io matches

1 match · Rust

pdf-text-extractv0.2.0

Extract text, tables, and structured content from PDF files

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

RubyGems matches

Exact match · Ruby

Grim is a simple gem for extracting a page from a pdf and converting it to an image as well as extract the text from the page as a string. It basically gives you an easy to use api to ghostscript, imagemagick, and pdftotext specific to this use case.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 2 years ago.

pdfbox_text_extractionv1.2.0

This gem lets you extract plain text from PDF documents. It is a Jruby wrapper for the Apache PDFBox library.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 9 years ago.

chupa-text-decomposer-pdfv1.1.1

This is a ChupaText decomposer plugin for to extract text and meta-data from PDF. You can use `pdf` decomposer.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 7 years ago.

textractorv0.2.0

simple wrapper around CLI for extracting text from PDF and Word documents

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 14 years ago.

kreuzbergv4.9.8

Kreuzberg is a high-performance document intelligence library with a Rust core and native Ruby bindings via Magnus. Extract text, metadata, and structured data from 75+ file formats including PDF, DOCX, PPTX, XLSX, HTML, RTF, images (with OCR), email, archives, and more. Features async/sync APIs, text chunking, language detection, and keyword extraction.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

HexaPDF is a pure Ruby library with an accompanying application for working with PDF files. In short, it allows creating new PDF files, manipulating existing PDF files, merging multiple PDF files into one, extracting meta information, text, images and files from PDF files, securing PDF files by encrypting them and optimizing PDF files for smaller file size or other criteria. HexaPDF was designed with ease of use and performance in mind. It uses lazy loading and lazy computing when possible and tries to produce small PDF files by default.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

hirobumi-chocolate_disco-jrubyv0.1.5

Provides methods to extract texts from various file formats like Microsoft Office (<= 2002, as well as >= 2007,) PDF and HTML.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 11 years ago.

act_as_page_extractorv0.7.3

Library (Docsplit wrapper) for text extraction from pdf, doc/x, txt files with OpenOffice

MaintenanceAging

PopularityNiche

Aging — last published 9 months ago — check before adopting.

Extracts text from PDF files using Tesseract, the text is added to the PDF as a background layer.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 3 years ago.

Provides a very simple extraction resource for extracing text from slices of a PDF.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 14 years ago.

pdf_table_extractorv0.1.0

Extracts tables from PDF text using spacing and position heuristics.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

chupa-text-decomposer-libreofficev1.0.2

This is a ChupaText decomposer plugin for to extract text and meta-data from office files such as Microsoft Word file, Microsoft Excel file and OpenDocument Format file. It uses [LibreOffice](https://www.libreoffice.org/). You can use `libreoffice` decomposer. It depends on `pdf` decomposer. Because it converts a office file to PDF file and extracts text and meta-data by `pdf` decomposer.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 7 years ago.

NuGet matches

Showing 12 of 973 · .NET

See all NuGet →

itextsharpv5.5.13.5

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

aspose.pdfv26.5.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

syncfusion.pdf.net.corev33.2.8

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

bitmiracle.docotic.pdfv9.9.19928

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

itextsharp.xmlworkerv5.5.13.5

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

xdoc.pdfv12.6.1

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

spire.pdfv12.5.8

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

freespire.pdfv12.4.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

aspose.pdf.drawingv26.5.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

select.pdf.netcorev26.2.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

select.pdfv26.2.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

apitron.pdf.kitv2.0.57

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.