Document classification using tesseract.js and string-similarity-js.
A Node.js wrapper for the Tesseract OCR API
A robust, strictly-typed Node.js and Browser library for parsing office files (.docx, .pptx, .xlsx, .odt, .odp, .ods, .pdf, .rtf, .csv, .md, .html) and generating high-fidelity outputs in Markdown, HTML, CSV, RTF, and RAG-focused chunks.
The official TypeScript library for the Llama Cloud API
A React Native Vision Camera plugin for real-time OCR (Optical Character Recognition). This package enables seamless integration of on-device OCR by using Google ML Kit on Android and Apple's Vision Framework on iOS. It provides fast, efficient, and cross
FrankieOne OneSDK
High-quality OCR and text extraction for images and PDFs.
OCR library built on Tesseract
n8n community node for Deep-OCR document processing API
Fast PDF classification and text extraction. Detect text-based vs scanned PDFs, extract text by region with quality checks. Native Rust performance via napi-rs.
PDF to Markdown and DOCX conversion powered by Mistral OCR.
Apple Vision OCR + image/PDF analysis for Node.js, with optional Ollama-driven Markdown pipeline — native, fast, offline
IDVerse Identity Verification SDK for web.
The JS version of DdddOcr
Lightweight, probably the fastest PaddleOCR SDK in TypeScript. Runs anywhere JavaScript runs: Node.js, Bun, Deno, web browsers, and browser extensions. Docker & CLI supported. The official SDK is browser-only. Accurate text detection and recognition for d
A WebdriverIO service that is using Tesseract OCR for Desktop/Mobile Web and Mobile Native App tests.
An Appium 2.0 plugin that uses Tesseract to find screen regions by visual text
Guten OCR is a high accurate text detection (OCR) Javascript/Typescript library that runs on Node.js, Browser, React Native and C++. Based on PaddleOCR and ONNX runtime
`@contracthero/common` contains utility functions that are used across the ContractHero platform.
Node.js wrapper for ocr.space APIs.
MCP server wrapping Apple Vision Framework for local OCR and image analysis — no cloud, no API keys.
Aspose.Words Cloud SDK for Node.js
Official Talonic MCP server. Lets AI agents extract structured, schema-validated data from any document via the Model Context Protocol.
English | [简体中文](README-CN.md) 
No description provided.
No description provided.