BETAmodules.com is in beta — open to partnerships & joint ventures.Build with us

cross-ecosystem search · live

Results for ocr-llm

Found in 3 of 7 ecosystemsnpm 1–24 of 21,492 · 11 matches across other registries

How we search: free-text on npm, crates.io, RubyGems, NuGet and Maven. PyPI and Go do exact-name lookup only. Tip: click an ecosystem chip below to filter; click Show all ecosystems to come back.

Sort

Auto-load on scroll

npm matches

Showing 24 of 21,492 · JavaScript

See all npm →

ocr-llmv0.4.15

npm

Fast, ultra-accurate text extraction from any image or PDF, even challenging ones, with structured markdown output powered by vision models.

Aging — last published over a year ago — check before adopting.

plugin-docpixiev1.1.1

npm

Adaptive RAG agent for document analysis — OCR + LLM Vision + structured extraction.

Maintained. Maintained, actively maintained.

ocr-llm-sdkv1.1.1

npm

- `OcrLlmSDK` 是一个基于轮询机制的 OCR + LLM 异步任务管理 SDK，提供任务创建、结果获取、状态监听等功能，支持 **图片 URL**、**Base64**、**二进制文件** 方式发起任务，并可自动轮询更新任务状态与结果。

Aging — last published 9 months ago — check before adopting.

node-tesseract-ocrv2.2.1

npm

A Node.js wrapper for the Tesseract OCR API

Has 1 high-severity advisory. Verify a patched version exists before using.

ppu-paddle-ocrv5.8.3

npm

Lightweight, probably the fastest PaddleOCR SDK in TypeScript. Runs anywhere JavaScript runs: Node.js, Bun, Deno, web browsers, and browser extensions. Docker & CLI supported. The official SDK is browser-only. Accurate text detection and recognition for d

Maintained. Maintained, actively maintained.

@wdio/ocr-servicev2.2.9

npm

A WebdriverIO service that is using Tesseract OCR for Desktop/Mobile Web and Mobile Native App tests.

Maintained. Maintained, actively maintained.

appium-ocr-pluginv0.3.0

npm

An Appium 2.0 plugin that uses Tesseract to find screen regions by visual text

Aging — last published 9 months ago — check before adopting.

@gutenye/ocr-modelsv1.4.2

npm

Guten OCR is a high accurate text detection (OCR) Javascript/Typescript library that runs on Node.js, Browser, React Native and C++. Based on PaddleOCR and ONNX runtime

Abandoned. Last published 2 years ago.

@precisa-saude/fhir-ocr-utilsv0.15.5

npm

Utilitários de ancoragem OCR para extração de biomarcadores de PDFs de resultados laboratoriais

Maintained. Maintained, actively maintained.

ocr-space-api-wrapperv2.4.7

npm

Node.js wrapper for ocr.space APIs.

Maintained. Maintained, actively maintained.

@atomicbotai/computer-usev0.1.12

npm

Framework-agnostic desktop computer-use tool for AI agents: screenshot, mouse, keyboard, OCR, overlay

Maintained. Maintained, actively maintained.

nuktaav0.1.24

npm

Nuktaa helps AI teams turn public or private source material into usable knowledge for LLM applications.

Maintained. Maintained, actively maintained.

@gutenye/ocr-commonv1.4.8

npm

Guten OCR is a high accurate text detection (OCR) Javascript/Typescript library that runs on Node.js, Browser, React Native and C++. Based on PaddleOCR and ONNX runtime

Aging — last published over a year ago — check before adopting.

llm-spend-guardv2.0.6

npm

Enforce real-time token budgets and spending limits for OpenAI, Anthropic Claude, and Google Gemini API calls in Node.js

Maintained. Maintained, actively maintained.

react-native-vision-camera-ocr-plusv2.0.0

npm

React Native Vision Camera plugin for on-device text recognition (OCR) and translation using ML Kit. Maintained fork of react-native-vision-camera-text-recognition

Maintained. Maintained, actively maintained.

@bear-block/vision-camera-ocrv1.1.2

npm

A React Native Vision Camera plugin for real-time OCR (Optical Character Recognition). This package enables seamless integration of on-device OCR by using Google ML Kit on Android and Apple's Vision Framework on iOS. It provides fast, efficient, and cross

Maintained. Maintained, actively maintained.

@fnet/ocr-text-coordsv0.1.1

npm

This project is designed to facilitate the extraction of text and their corresponding bounding box coordinates from images, using the Tesseract.js library. It supports various input formats such as file paths, URLs, or Base64 encoded strings and can handl

Abandoned. Last published over a year ago.

ai-cost-meterv1.0.0

npm

Lightweight, zero-dependency LLM API cost & token usage tracker for OpenAI, Anthropic, Gemini, Mistral, Groq, and DeepSeek