Fast, zero-dependency PDF toolkit for Node.js, browsers, and edge runtimes — text extraction, markdown/HTML conversion, search, form filling, creation, and editing. Rust core compiled to WebAssembly.
[FIPS 140-3 validated build] High-performance PDF parsing and text extraction library — prebuilt native bindings, no build toolchain required
High-performance PDF parsing and text extraction library — prebuilt native bindings, no build toolchain required
The fastest Rust PDF library with text extraction: 0.8ms mean, 100% pass rate on 3,830 PDFs. 5× faster than pdf_extract, 17× faster than oxidize_pdf. Extract, create, and edit PDFs.
CLI for pdf-oxide — the fastest PDF toolkit. 22 commands: text extraction, PDF to markdown, search, merge, split, images, compress, encrypt, watermark, forms, and more.
MCP server for PDF extraction — gives Claude, Cursor, and AI assistants the ability to read PDFs locally. Text, markdown, and HTML output. Powered by pdf_oxide.
REST API for oxidizePdf (Community edition)
Pure Rust PDF library for AI/RAG: structure-aware chunking with bounding boxes, heading context, and token estimates. No Python, no ML, no C bindings.
Command-line interface for oxidizePdf
Extract plain text from HTML, PDF, and other document formats
Fast Rust CLI wrapper around pdf_oxide for LLM-friendly PDF extraction
High-performance document intelligence library for Rust. Extract text, metadata, and structured data from PDFs, Office documents, images, and 90+ formats and 300+ programming languages via tree-sitter code intelligence with async/sync APIs.
The PDF Operating Layer for AI Agents — 57 tools for inspect, extract, generate, convert, manipulate, secure, and fill PDFs
Token-aware, multi-format text chunking library with language-aware semantic splitting
Source line analysis tool with CLI, web UI, HTML/PDF reports, and CI/CD integration
No description provided.
No description provided.