Ultra-fast, offline, and free PDF OCR using native macOS Vision Framework and PDFKit. Supports Vietnamese & English.
Fast PDF classification and text extraction. Detect text-based vs scanned PDFs, extract text by region with quality checks. Native Rust performance via napi-rs.
High-quality OCR and text extraction for images and PDFs.
The Adobe PDF Services Node.js SDK provides APIs for creating, combining, exporting and manipulating PDFs.
Fast PDF classification and text extraction. Detect text-based vs scanned PDFs, extract text by region with quality checks. Native Rust performance via napi-rs.
A Node.js wrapper for the opendataloader-pdf Java CLI.
Apple Vision OCR + image/PDF analysis for Node.js, with optional Ollama-driven Markdown pipeline — native, fast, offline
Read text and parse tables from PDF files. Supports tabular data with automatic column detection, and rule-based parsing.
A robust, strictly-typed Node.js and Browser library for parsing office files (.docx, .pptx, .xlsx, .odt, .odp, .ods, .pdf, .rtf, .csv, .md, .html) and generating high-fidelity outputs in Markdown, HTML, CSV, RTF, and RAG-focused chunks.
Fast PDF classification, text extraction, and image extraction. Native Rust performance via napi-rs.
PDF parsing library using Docling SDK with OCR support for macOS
A Node.js wrapper for the Tesseract OCR API
Nuktaa helps AI teams turn public or private source material into usable knowledge for LLM applications.
Node PDF is a set of tools that takes in PDF files and converts them to usable formats for data processing. The library supports both extracting text from searchable pdf files as well as performing OCR on pdfs which are just scanned images of text
n8n community node to convert HTML and CSS to PDF using PdfMunk API - perfect for invoices, reports, certificates, and document generation
Asynchronous Node.js wrapper for the Poppler PDF rendering utilities
Pure TypeScript, cross-platform module for extracting text, images, and tabular data from PDFs. Run directly in your browser or in Node!
Display PDFs in your React app as easily as if they were images.
A simple wrapper around command-line utils to assist in PDF / Image OCR (Optical Character Recognition) processing using Tesseract.
MCP server wrapping Apple Vision Framework for local OCR and image analysis — no cloud, no API keys.
Create and modify PDF files with JavaScript
Pi extension: Zero-setup multi-backend OCR — MinerU (free cloud), Ollama (local GPU, LaTeX formulas), Pix2Text (local Python). Extract text, formulas, and tables from images and PDFs. Default: zero config, works out of the box.
OCR addon for qvac
Small, fast and advanced PNG / APNG encoder and decoder
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.