Get markdown out of any document — Pandoc + pdfium + platform-native OCR, dispatched per format.
Simple Rust library to extract readable text from specific document format like Word Document (docx). Currently only support several format, other format coming soon.
A blazingly fast command line tool written in pure safe Rust to automatically extract email addresses from files in a given path.
A Rust toolkit for detecting and extracting metadata, text, and content from various file formats
Fast PGS subtitle extraction, encoding, and round-trip transformation for MKV and M2TS containers
No description provided.
No description provided.
No description provided.