Fast, ultra-accurate text extraction from any image or PDF, even challenging ones, with structured markdown output powered by vision models.
Adaptive RAG agent for document analysis — OCR + LLM Vision + structured extraction.
- `OcrLlmSDK` 是一个基于轮询机制的 OCR + LLM 异步任务管理 SDK,提供任务创建、结果获取、状态监听等功能,支持 **图片 URL**、**Base64**、**二进制文件** 方式发起任务,并可自动轮询更新任务状态与结果。
A Node.js wrapper for the Tesseract OCR API
Lightweight, probably the fastest PaddleOCR SDK in TypeScript. Runs anywhere JavaScript runs: Node.js, Bun, Deno, web browsers, and browser extensions. Docker & CLI supported. The official SDK is browser-only. Accurate text detection and recognition for d
A WebdriverIO service that is using Tesseract OCR for Desktop/Mobile Web and Mobile Native App tests.
An Appium 2.0 plugin that uses Tesseract to find screen regions by visual text
Guten OCR is a high accurate text detection (OCR) Javascript/Typescript library that runs on Node.js, Browser, React Native and C++. Based on PaddleOCR and ONNX runtime
Utilitários de ancoragem OCR para extração de biomarcadores de PDFs de resultados laboratoriais
Node.js wrapper for ocr.space APIs.
Framework-agnostic desktop computer-use tool for AI agents: screenshot, mouse, keyboard, OCR, overlay
Nuktaa helps AI teams turn public or private source material into usable knowledge for LLM applications.
Guten OCR is a high accurate text detection (OCR) Javascript/Typescript library that runs on Node.js, Browser, React Native and C++. Based on PaddleOCR and ONNX runtime
Enforce real-time token budgets and spending limits for OpenAI, Anthropic Claude, and Google Gemini API calls in Node.js
React Native Vision Camera plugin for on-device text recognition (OCR) and translation using ML Kit. Maintained fork of react-native-vision-camera-text-recognition
A React Native Vision Camera plugin for real-time OCR (Optical Character Recognition). This package enables seamless integration of on-device OCR by using Google ML Kit on Android and Apple's Vision Framework on iOS. It provides fast, efficient, and cross
This project is designed to facilitate the extraction of text and their corresponding bounding box coordinates from images, using the Tesseract.js library. It supports various input formats such as file paths, URLs, or Base64 encoded strings and can handl
Lightweight, zero-dependency LLM API cost & token usage tracker for OpenAI, Anthropic, Gemini, Mistral, Groq, and DeepSeek
Google on-device MLKit text recognition for React Native
Official Talonic MCP server. Lets AI agents extract structured, schema-validated data from any document via the Model Context Protocol.
OCR addon for qvac
OCR via system provided API
Typescript bindings for langchain
Display language model outputs in your React project.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.