plain text extractor plugin for markdown-it markdown parser
A text extractor library for Node.js
A PDF to Text Extractor
Yet another library to extract text from MS Office and PDF files
Easy key-based text extractor and loader for Nuxt 3
A text extractor for extracting text from HTML, PDF, Image and other files.
A React Native library for efficient text recognition (OCR) using MLKit and Vision, enabling seamless text extraction from images.
Production-grade SEC EDGAR toolkit — XBRL parser, filing text extractor, and financial analysis utilities
Fork of office-text-extractor with unreleased changes that include browser support
A universal text extractor for files.
Directus Text Extractor
Text Extractor for nodejs
lightweight heading text extractor from markdown files
Very Fast Pure JS Text Extractor For Your Office Files
Xcrap Image Text Extractor is a package of the Xcrap framework that abstracts the extraction of texts from images using the node-tesseract-ocr library.
A text extractor for extracting text from HTML, PDF, Image and other files.
No description provided.
Node.js package to read Word .doc files
SWC plugin for extracting inline messages.
a JS/TS text extractor
A helper library for loading and saving the .api.json files created by API Extractor
Analyze the exported API for a TypeScript library and generate reviews, documentation, and .d.ts rollups
Gettext extractor for JavaScript, TypeScript, JSX and HTML
A library for bundling selected files and dependencies into a deployable package.
Basic structured text extraction using mupdf-rs.
Extracts gettext strings from Javascript/TypeScript files
Easily extract data from text
Extract text from common office files. Based on the file's content type a command line tool is selected to do the job.
twitter text extraction utilities
ChupaText is an extensible text extractor. You can plug your custom text extractor in ChupaText. You can write your plugin by Ruby.
document converter and plain text extractor
Website crawler and fulltext indexer.
Ruby bindings for the hwarang Rust library. Extracts text from HWP and HWPX documents.
PDF::Extractor is a library that provides high level access to the text objects of a PDF document.
SelectPdf Online REST API is a professional solution for managing PDF documents online. SelectPdf cloud API consists of the following: HTML to PDF REST API – SelectPdf HTML To PDF Online REST API is a professional solution that lets you create PDF from web pages and raw HTML code in your applications. PDF to TEXT REST API – SelectPdf Pdf To Text REST API is an online solution that lets you extract text from your PDF documents or search your PDF document for certain words. PDF Merge REST API – SelectPdf Pdf Merge REST API is an online solution that lets you merge local or remote PDFs into a final PDF document.
Using a variety of APIs (Yahoo term Extractor and Alchemy are currently supported), semantic_extraction can automatically return a collection of keywords for an arbitrary block of text. If using Alchemy, it can also return named entities.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.