A robust Node.js utility for extracting text from PDF, DOCX, DOC, XLSX, and TXT buffers.
Extract text from pdfs that contain searchable pdf text
Highlight hardcoded text, extract locale keys, and validate translations
PDF text extract
Native PDF text extraction for React Native and Expo. Extract text content from PDF files using platform-native APIs (PDFKit on iOS, PDFBox on Android). Works with Expo development builds.
PDF text extract
PDF text extract
PDF text extract
Native PDF text extraction for React Native and Expo. Extract text content from PDF files using platform-native APIs (PDFKit on iOS, PDFBox on Android). Works with Expo development builds.
extracts CSS into separate files
PDF text extract
PDF text extract
PDF text extract
PDF text extract
PDF text extract
PDF text extract
MCP server for summarizer ai. Features summarize text, extract key points, generate abstract. From MEOK AI Labs.
A Webpack plugin to optimize \ minimize CSS assets.
👉 https://hyper.fun/c/fluentui-icon-document-text-extract-filled/2.0.0
PDF extraction and rendering across all JavaScript runtimes
A light-weight module that brings Fetch API to node.js
A CSS Modules transform to extract local aliases for inline imports
A collection of utilities for emojis
unzip a zip file into a directory using 100% javascript
Native Rust PDF extraction engine: text, markdown for RAG, AcroForm widgets, image decoding, and encrypted PDFs. Lazy parser, persistent Document handle, no C dependencies.
Extract text, tables, and structured content from PDF files
Collection of algorithms for keyword extraction from text
PDF-to-Markdown conversion engine with smart heading detection, bold/italic text extraction, and CommonMark output. Pure Rust, best-effort parsing for corrupted PDFs.
The fastest Rust PDF library with text extraction: 0.8ms mean, 100% pass rate on 3,830 PDFs. 5× faster than pdf_extract, 17× faster than oxidize_pdf. Extract, create, and edit PDFs.
A CLI tool to convert Arabic PDFs to text using Google's Gemini API
PDF text extraction CLI tools
PDF content extraction library
Extract plain text from HTML, PDF, and other document formats
Lightweight, fast DOCX text extraction library with minimal dependencies
elizaOS PDF Plugin - PDF reading and text extraction
A Rust library for extracting metadata, table of contents, text, cover, and images from EPUB files.
Extract text from various file types before resorting to an OCR solution.
Grim is a simple gem for extracting a page from a pdf and converting it to an image as well as extract the text from the page as a string. It basically gives you an easy to use api to ghostscript, imagemagick, and pdftotext specific to this use case.
Treat is a natural language processing framework for Ruby.
Extracts article text from a URL
Easily extract data from text
Extract columnar data from tabulated fixed-width text
Full-text search for the Mongoid ORM, using n-grams extracted from text.
Extract dates from a text.
Extract text from common office files. Based on the file's content type a command line tool is selected to do the job.
File text and metadata extraction service.
The aim of this gem is to simply extract adjectives, or 'describing words' from arbitrary free text.
Extract text from common office files. Based on the file's content type a command line tool is selected to do the job.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.