Extract text from PDF files with support for multiple output formats
File format parsers for PDF Office (docx, xlsx, pptx, odt)
A library for extracting text from various file formats including PDF, DOCX, XLSX, PPTX, images via OCR, and more
Pure-Rust AV1 codec — orphan-rebuild scaffold pending clean-room re-implementation.
A highly parallel Perl 5 interpreter written in Rust
Universal data comprehension engine — understands structure, infers meaning, tracks lineage
Add --describe flag to clap subcommands for structured command schema output
Tools to parse Screenplay-formatted documents into semantically-typed structs.
Web API for extracting text from various file formats
Command-line interface for extracting text from various file formats
A Rust toolkit for detecting and extracting metadata, text, and content from various file formats
Portable RAG system on a USB drive. Single binary, local embeddings, MCP protocol.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.