English | [简体中文](README-CN.md) 
Guten OCR is a high accurate text detection (OCR) Javascript/Typescript library that runs on Node.js, Browser, React Native and C++. Based on PaddleOCR and ONNX runtime
React Native Vision Camera plugin for on-device text recognition (OCR) and translation using ML Kit. Maintained fork of react-native-vision-camera-text-recognition
This project is designed to facilitate the extraction of text and their corresponding bounding box coordinates from images, using the Tesseract.js library. It supports various input formats such as file paths, URLs, or Base64 encoded strings and can handl
TypeScript SDK for Docling - Bridge between Python Docling ecosystem and JavaScript/TypeScript. Supports both CLI and API modes with dual publishing.
Digital onboarding document capture
OCR via system provided API
Google on-device MLKit text recognition for React Native
OCR addon for qvac
ocr documents using gpt-4o-mini
document extension for tiptap
A minimal DOM implementation
Categorized data on third party entities on the web.
Fast, lightweight PDF and document parsing with spatial text extraction
Guten OCR is a high accurate text detection (OCR) Javascript/Typescript library that runs on Node.js, Browser, React Native and C++. Based on PaddleOCR and ONNX runtime
Provides expert functionality to convert, optimize, compress, produce, merge, split, ocr, enrich, archive, print documents and PDFs.
Distills a series of editing steps into deleted and added ranges
Fast PDF classification, text extraction, and image extraction. Native Rust performance via napi-rs.
Classifies text into intent groups or word matches
React Native Plugin for Genius Scan SDK
A simple text document implementation for Node LSP servers
A simple wrapper around command-line utils to assist in PDF / Image OCR (Optical Character Recognition) processing using Tesseract.
The Adobe PDF Services Node.js SDK provides APIs for creating, combining, exporting and manipulating PDFs.
Anyline Web SDK