Powerful and flexible web scraper with YAML configuration, supporting pagination, data transformations, caching, and multiple output formats
Fast extractive text summarizer in Rust (with 30-70% compression).
Lightweight, fast DOCX text extraction library with minimal dependencies
Vision/OCR connector for OxiFY workflows
A high-performance library for extracting HWP/HWPX documents into structured Markdown
a simple cli tool for extracting html tags based on a css selector from html text
Simple Rust library to extract readable text from specific document format like Word Document (docx). Currently only support several format, other format coming soon.
Basic structured text extraction using mupdf-rs.
High-performance PDF text extraction library for vectorization pipelines
HTML manipulation and tools plugin for the Lava language
AI/Human task management system with file-based storage
A highly parallel Perl 5 interpreter written in Rust
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.