BETAmodules.com is in beta — open to partnerships & joint ventures.Build with us

cross-ecosystem search · live

Results for pdf-extract-text

Found in 3 of 7 ecosystemsnpm 1–24 of 327,049 · 789 matches across other registries

How we search: free-text on npm, crates.io, RubyGems, NuGet and Maven. PyPI and Go do exact-name lookup only. Tip: click an ecosystem chip below to filter; click Show all ecosystems to come back.

Sort

Auto-load on scroll

npm matches

Showing 24 of 327,049 · JavaScript

See all npm →

pdf-extract-textv0.1.3

npm

A fast, native Node.js module to extract and process text from PDF files using Rust and N-API. Built with [Tokio](https://tokio.rs/), [`pdf-extract`](https://docs.rs/pdf-extract), and [`text-splitter`](https://crates.io/crates/text-splitter), this package

Aging — last published over a year ago — check before adopting.

unpdfv1.6.2

npm

PDF extraction and rendering across all JavaScript runtimes

Maintained. Maintained, actively maintained.

pdf-text-extractv1.5.0

npm

Extract text from pdfs that contain searchable pdf text

Abandoned. Last published 9 years ago.

@cantoo/pdf-libv2.7.1

npm

Create and modify PDF files with JavaScript

Maintained. Maintained, actively maintained.

mini-css-extract-pluginv2.10.2

npm

extracts CSS into separate files

Maintained. Maintained, actively maintained.

@adobe/pdfservices-node-sdkv4.1.0

npm

The Adobe PDF Services Node.js SDK provides APIs for creating, combining, exporting and manipulating PDFs.

Abandoned. Last published over a year ago.

pdf.js-extractv1.0.1

npm

super-simple async PDF reader that extracts text with x,y page positions based on pdf.js

Maintained. Maintained, actively maintained.

pdf-tsv0.0.2

npm

PDF text extraction in TypeScript

Abandoned. Last published 2 years ago.

pdf-libv1.17.1

npm

Create and modify PDF files with JavaScript

Abandoned. Last published 4 years ago.

react-pdfv10.4.1

npm

Display PDFs in your React app as easily as if they were images.

Maintained. Maintained, actively maintained.

@syncfusion/ej2-pdf-data-extractv33.2.10

npm

This repository provides advanced support for data extraction from PDF documents

Maintained. Maintained, actively maintained.

pdf-parsev2.4.5

npm

Pure TypeScript, cross-platform module for extracting text, images, and tabular data from PDFs. Run directly in your browser or in Node!

Aging — last published 7 months ago — check before adopting.

@react-pdf/primitivesv4.3.0

npm

Define uninitialized elements

Maintained. Maintained, actively maintained.

office-text-extractorv4.0.0

npm

Yet another library to extract text from MS Office and PDF files

Maintained. Maintained, actively maintained.

@react-pdf/textkitv6.3.0

npm

An advanced text layout framework

Maintained. Maintained, actively maintained.

optimize-css-assets-webpack-pluginv6.0.1

npm

A Webpack plugin to optimize \ minimize CSS assets.

Abandoned. Last published 4 years ago.

pdf-parse-forkv1.2.0

npm

Pure javascript cross-platform module to extract text from PDFs.

Abandoned. Last published 2 years ago.

scribe.js-ocrv0.12.4

npm

High-quality OCR and text extraction for images and PDFs.

Maintained. Maintained, actively maintained.

n8n-nodes-htmlcsstopdfv3.2.5

npm

n8n community node to convert HTML and CSS to PDF using PdfMunk API - perfect for invoices, reports, certificates, and document generation

Maintained. Maintained, actively maintained.

pdf-extractv1.0.11

npm

Node PDF is a set of tools that takes in PDF files and converts them to usable formats for data processing. The library supports both extracting text from searchable pdf files as well as performing OCR on pdfs which are just scanned images of text

Abandoned. Last published 9 years ago.

@react-pdf/pdfkitv5.1.1

npm

A PDF generation library for Node.js

Maintained. Maintained, actively maintained.

@react-pdf/rendererv4.5.1

npm

Create PDF files on the browser and server

Maintained. Maintained, actively maintained.

expo-pdf-text-extractv1.1.0

npm

Native PDF text extraction for React Native and Expo. Extract text content from PDF files using platform-native APIs (PDFKit on iOS, PDFBox on Android). Works with Expo development builds.

Maintained. Maintained, actively maintained.

pdf2htmlv4.4.0

npm

PDF to HTML or Text conversion using Apache Tika. Also generate PDF thumbnail using Apache PDFBox.

Aging — last published 11 months ago — check before adopting.