Webcomponent ocr following open-wc recommendations
A powerful React Native OCR (Optical Character Recognition) module powered by Google ML Kit. Supports multiple languages and scripts with selective model loading for optimized app size.
Tesseract OCR wrapper for React Native
Fork of tesseract.js used for scribe.js. Pure Javascript Multilingual OCR
Load node modules according to tsconfig paths, in run-time or via API.
## **Content:**
Link → clean text → summary.
Node.js wrapper for Tesseract OCR CLI.
Cross platform child_process#spawn and child_process#spawnSync
**This plugin is still in development stage.Everything may change, but some features are available now.**
Fast, lightweight PDF and document parsing with spatial text extraction
Fast PDF classification and text extraction. Detect text-based vs scanned PDFs, extract text by region with quality checks. Native Rust performance via napi-rs.
Ignore is a manager and filter for .gitignore rules, the one used by eslint, gitbook and many others.
The_powerful_Optical_Character_Recognition__OCR_APIs_let_you_convert_scanned_images_of_pages_into_recognized_text_
High-performance client-side OCR with ONNX Runtime, RapidOCR and PPU PaddleOCR integration. 100+ language support. Process text from images entirely in the browser with state-of-the-art accuracy and complete privacy.
Nuktaa helps AI teams turn public or private source material into usable knowledge for LLM applications.
`@contracthero/common` contains utility functions that are used across the ContractHero platform.
Provides a way to make requests
Determines if an object can be used as an array
Build tool and bindings loader for node-gyp that supports prebuilds
[中文版](./README_cn.md)
TypeScript execution environment and REPL for node.js, with source map support
Guten OCR is a high accurate text detection (OCR) Javascript/Typescript library that runs on Node.js, Browser, React Native and C++. Based on PaddleOCR and ONNX runtime
Image to markdown (OCR) with Llama 3.2 Vision.