Extract pages from a PDF into canvas elements on the client side
[](https://www.npmjs.com/package/rs-ocr) [](https://opensource.org/licenses/MIT)
trigonometriaparaleigospdf !new!
A library to extract content from pdfs
Fast Rust CLI wrapper around pdf_oxide for LLM-friendly PDF extraction
A library to extract content from pdfs
A library to extract content from pdfs
A library to extract content from pdfs
The fastest Rust PDF library with text extraction: 0.8ms mean, 100% pass rate on 3,830 PDFs. 5× faster than pdf_extract, 17× faster than oxidize_pdf. Extract, create, and edit PDFs.
Fast pure-Rust PDF extraction library and CLI — ~10-50x faster than pdfplumber for text, word, table, layout, image, and metadata extraction from PDFs. By Clark Labs Inc.
Extract text from email attachments (PDF + image OCR). PDF text via `pdf-extract` (pure Rust); OCR via the `tesseract` CLI subprocess (not linked as a C library). Two-stage fallback for scanned PDFs: try embedded text first, fall back to OCR on the raw bytes if the text is too short. Returns `ExtractionResult` with text + language + confidence + page count + JSON metadata.
Content extraction with built-in sanitization via hanzo-guard
Extract text, tables, and structured content from PDF files
A Rust toolkit for detecting and extracting metadata, text, and content from various file formats
Extract footprint/land-pattern drawings from PDF datasheets.
PDF content extraction tool and library.
description yo
A command line utility for extracting annotation and field metadata from a PDF in JSON format.
Extract all images with format conversions based upon Pdf::Reader library
PDFTk wrapper to extract form fiels
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.