Display PDFs in your React app as easily as if they were images.
Create and modify PDF files with JavaScript
Define uninitialized elements
Pure TypeScript, cross-platform module for extracting text, images, and tabular data from PDFs. Run directly in your browser or in Node!
PDF extraction and rendering across all JavaScript runtimes
Generic build of Mozilla's PDF.js library.
An advanced text layout framework
A PDF generation library for Node.js
Create PDF files on the browser and server
Node.js utility to convert PDF file/buffer pages to PNG files/buffers. No build-time compilation required — pre-built native binaries included for all major platforms.
A PDF generation library for Node.js
html table export
Format data using ECMA-376 spreadsheet Format Codes
PDF embed component for Vue
Create and modify PDF files with JavaScript
Formatting user's text input on-the-fly
String formatting library inspired from Python
Syncfusion TypeScript Component
Database to mime-format based on content-type header and content
A robust, strictly-typed Node.js and Browser library for parsing office files (.docx, .pptx, .xlsx, .odt, .odp, .ods, .pdf, .rtf, .csv, .md, .html) and generating high-fidelity outputs in Markdown, HTML, CSV, RTF, and RAG-focused chunks.
Read text and parse tables from PDF files. Supports tabular data with automatic column detection, and rule-based parsing.
The Adobe PDF Services Node.js SDK provides APIs for creating, combining, exporting and manipulating PDFs.
A PDF generation library for Node.js
Dead simple pdf text reader
Scrape text from common file formats (.pdf,.doc,.docx, .sketch, .txt) with a single convenient command.
Kreuzberg is a high-performance document intelligence library with a Rust core and native Ruby bindings via Magnus. Extract text, metadata, and structured data from 75+ file formats including PDF, DOCX, PPTX, XLSX, HTML, RTF, images (with OCR), email, archives, and more. Features async/sync APIs, text chunking, language detection, and keyword extraction.
Convert PDF docs to beautiful HTML files without losing text or format. This gem uses pdf2htmlEX to do the conversion.
A nifty gem, in pure Ruby, to parse PDF files and combine (merge) them with other PDF files, number the pages, watermark them or stamp them, create tables, add basic text objects etc` (all using the PDF file format).
If you have multiple format files like Image files(Eg. .png, .jpg, .jpeg etc.), Document files(Eg. .doc, .docx, text etc.) or PDF files and now you want to make a single PDF with all format of files, this gem will help you to compile multiple format files into single pdf.
Provides methods to extract texts from various file formats like Microsoft Office (<= 2002, as well as >= 2007,) PDF and HTML.
Docsplit is a command-line utility and Ruby library for splitting apart documents into their component parts: searchable UTF-8 plain text, page images or thumbnails in any format, PDFs, single pages, and document metadata (title, author, number of pages...)
Aspose.PDF Cloud is a REST API for creating and editing PDF files. Most popular features proposed by Aspose.PDF Cloud: PDF to Word, Convert PDF to Image, Merge PDF, Split PDF, Add Images to PDF, Rotate PDF. It can also be used to convert PDF files to different formats like DOC, HTML, XPS, TIFF and many more. Aspose.PDF Cloud gives you control: create PDFs from scratch or from HTML, XML, template, database, XPS or an image. Render PDFs to image formats such as JPEG, PNG, GIF, BMP, TIFF and many others. Aspose.PDF Cloud helps you manipulate elements of a PDF file like text, annotations, watermarks, signatures, bookmarks, stamps and so on. Its REST API also allows you to manage PDF pages by using features like merging, splitting, and inserting. Add images to a PDF file or convert PDF pages to images.
This is a ChupaText decomposer plugin for to extract text and meta-data from office files such as Microsoft Word file, Microsoft Excel file and OpenDocument Format file. It uses [LibreOffice](https://www.libreoffice.org/). You can use `libreoffice` decomposer. It depends on `pdf` decomposer. Because it converts a office file to PDF file and extracts text and meta-data by `pdf` decomposer.
This is a ChupaText decomposer plugin for to extract text and meta-data from office files such as Microsoft Word file, Microsoft Excel file and OpenDocument Format file. It uses [LibreOffice](https://www.libreoffice.org/). You can use `libreoffice` decomposer. It depends on `pdf` decomposer. Because it converts a office file to PDF file and extracts text and meta-data by `pdf` decomposer.
Eleanor is a Ruby script and accompanying library for formatting speculative screenplays. It parses plain text written in a simple format and outputs pretty PDF that conforms to standard rules of screenplay layout. Eleanor's primary goal is to create PDF that is indistinguishable from PDF produced by professional screenwriting software such as Final Draft.
This is a ChupaText decomposer plugin to extract text and meta-data from OpenDocument Text file format file (`.odt` file). This plugin uses [LibreOffice](https://www.libreoffice.org/). You can use `libreoffice-opendocument-text` decomposer. It depends on `pdf` decomposer. Because it converts a office file to PDF file and extracts text and meta-data by `pdf` decomposer.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.