Extracting text from files of various type including html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf, text/*, and various open office.
Extracting text from files of various type including html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf, text/*, and various open office.
Extracting text from files of various type including html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf, text/*, and various open office.
Convert any document (PDF, DOC, DOCX) to clean Markdown for RAG
Search PDF, DOC and DOCX files
High-level PDF manipulation and generation module for Node.js and browser
This allows you to render documents like PDF, DOC, XLS and PPT.
Generate pdf tables with javascript (jsPDF plugin)
The Adobe PDF Services Node.js SDK provides APIs for creating, combining, exporting and manipulating PDFs.
Image structural similarity (SSIM). In TypeScript/JavaScript. For browser/server.
Create and modify PDF files with JavaScript
A render engine for Node and the browser
Print Commands & Files, Manage Printers & Scan Docs from Javascript. JSPrintManager Solution allows you to print RAW Printer Commands as well as known File Formats (PDF, JPG, PNG, TIFF, etc.) from Javascript right to any printer available at the client ma
Extracting text from files of various type including html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf, text/*, and various open office.
Angular document viewer.
Extracting text from files of various type including html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf, text/*, and various open office.
Extracting text from files of various type including html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf, text/*, and various open office.
Pure TypeScript, cross-platform module for extracting text, images, and tabular data from PDFs. Run directly in your browser or in Node!
Office Preview Plugin for React Native(support png,pdf,doc,docx,xls,txt...)
Display PDFs in your React app as easily as if they were images.
Create and modify PDF files with JavaScript
Atom A28 — Multi-Modal PDF/Resume Parser with Vision LLM. PDF/Doc content to structured extraction via Claude vision or OpenAI. Sprint 11 Cross-cutting.
Extracting text from files of various type including html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf, text/*, and various open office.
Small, fast and advanced PNG / APNG encoder and decoder
Create a writing document and save to PDF with Rust.
Compatibility document facade types for GraphitePDF.
PDF to DOCX conversion with text, tables, and images
Turn documentation stored in doc folder in your rails app in a pdf file using wkhtmltopdf
Converts a folder structure containing .doc/.docx files into a folder structure of .pdf files
Transform Markdown docs into two-column PDFs.
Scrape text from common file formats (.pdf,.doc,.docx, .sketch, .txt) with a single convenient command.
DANFE and DACTE pdf generator for Brazilian invoices and transportation docs.
A simple wrapper calling, for each supported input format, a given command-line tool
Read text and metadata from files and documents (.doc, .docx, .pages, .odt, .rtf, .pdf)
Preenchedor de fichas PDF & Doc
Kreuzberg is a high-performance document intelligence library with a Rust core and native Ruby bindings via Magnus. Extract text, metadata, and structured data from 75+ file formats including PDF, DOCX, PPTX, XLSX, HTML, RTF, images (with OCR), email, archives, and more. Features async/sync APIs, text chunking, language detection, and keyword extraction.
Read text and metadata from files and documents using Apache Tika toolkit
Convert PDF docs to beautiful HTML files without losing text or format. This gem uses pdf2htmlEX to do the conversion.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.