BETAmodules.com is in beta — open to partnerships & joint ventures.Build with us

Home Search Compare Equivalents

One search box and one honest, consistent read on every open-source library — across every ecosystem.

npmPyPIcrates.ioRubyGemsGoMavenNuGet

Discover

Tools

Compare Equivalents

Data

deps.dev OSV advisories npm registry PyPI

About

Methodology Partner with us

© 2026 Modules · A precision instrument for picking dependencies.Data refreshed continuously from public registries, deps.dev & OSV

cross-ecosystem search · live

Results for pdf_extractor

Found in 5 of 7 ecosystems · 107 matches across other registries

PyPI1 crates.io47 RubyGems3 Maven3 NuGet53

How we search: free-text on npm, crates.io, RubyGems, NuGet and Maven. PyPI and Go do exact-name lookup only. Tip: click an ecosystem chip below to filter; click Show all ecosystems to come back.

Sort

Auto-load on scroll

PyPI matches

Exact match · Python

pdf_extractorv0.1.0

Extracts text from PDF files, utilises multiple cores.

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 9 years ago.

crates.io matches

Showing 12 of 47 · Rust

See all crates.io →

hanzo-extractv0.1.0

Content extraction with built-in sanitization via hanzo-guard

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

project-ragv0.1.0

RAG-based codebase indexing and semantic search - dual purpose library and MCP server

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

extractousv0.3.0

Extractous provides a fast and efficient way to extract content from all kind of file formats including PDF, Word, Excel CSV, Email etc... Internally it uses a natively compiled Apache Tika for formats are not supported natively by the Rust core

MaintenanceAging

PopularityRising

Aging — last published over a year ago — check before adopting.

indicator-extractorv0.2.0

Extract indicators (IP, domain, email, hashes, etc.) from a string or a PDF file

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published over a year ago.

PDF → Markdown extractor with figure rasterization, table & banner detection. Built on pdfium-render.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

corpus-preprocv0.1.0

A preprocessor for text and HTML corpora

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 4 years ago.

A pure-Rust PDF library — create, parse, and render PDF documents with zero C dependencies

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

deformatv0.15.1

Extract plain text from HTML, PDF, and other document formats

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

pdf-interpretv0.5.6

A crate for interpreting PDF files.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

nosy: various contents summarization tool powered by artificial intelligence

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

Get markdown out of any document — Pandoc + pdfium + platform-native OCR, dispatched per format.

MaintenanceHealthy

PopularityNiche

Maintained. Niche but maintained, actively maintained.

screenplay-doc-parser-rsv0.1.10

Tools to parse Screenplay-formatted documents into semantically-typed structs.

MaintenanceAging

PopularityNiche

Aging — last published 8 months ago — check before adopting.

RubyGems matches

3 matches · Ruby

pdf_extractorv0.1.1

PDFTk wrapper to extract form fiels

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 6 years ago.

nameday_vvc_pdf_extractorv0.1.3

Nameday data extraction from Valsts valodas centrs PDF

MaintenanceAbandoned

PopularityNiche

Abandoned. Last published 4 years ago.

pdf_table_extractorv0.1.0

Extracts tables from PDF text using spacing and position heuristics.

MaintenanceAging

PopularityNiche

Aging — last published 6 months ago — check before adopting.

Maven matches

3 matches · Java

com.beehyv:pdf-extractorv1.0

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 5 years ago.

de.cit-ec.scie:pdf-extractor-guiv2.0.1

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 11 years ago.

de.cit-ec.scie:pdf-extractorv2.0.1

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 11 years ago.

NuGet matches

Showing 12 of 53 · .NET

See all NuGet →

pdfsharptextextractorv1.0.2

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 7 years ago.

evopdf.clientv14.0.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

groupdocs.parserv26.4.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

winnovative.pdfimagesextractorv14.0.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

deftsoft.eextractorv0.1.0.9

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 8 years ago.

evopdf.pdfimagesextractorv14.0.0

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

textextractorv1.0.0

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 8 years ago.

bytescout.pdfextractorv13.4.1.4801

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

melville.pdf.imageextractorv0.6.5

No description provided.

MaintenanceHealthy

PopularityUnknown

Maintained. Maintained, actively maintained.

pdftextextractorv1.0.1

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 2 years ago.

pdftextractv1.0.4961.18500

No description provided.

MaintenanceAbandoned

PopularityUnknown

Abandoned. Last published 12 years ago.

melville.pdf.textextractorv0.6.4

No description provided.

MaintenanceAging

PopularityUnknown

Aging — last published 12 months ago — check before adopting.