Extract tables from PDF file
Extract tables from PDF file
Extract tables from PDF file
Extract tables from PDF file
PDF → Markdown extractor with figure rasterization, table & banner detection. Built on pdfium-render.
A pure-Rust PDF library — create, parse, and render PDF documents with zero C dependencies
RAG-based codebase indexing and semantic search - dual purpose library and MCP server
Four formats, one engine. PDF, DOCX, XLSX, HTML → Markdown and typed JSON. 15–40× faster than equivalent-quality OSS tools, with pipeline pre-flight and element-level provenance.
Extract plain text from HTML, PDF, and other document formats
Semantic memory with SQLite and Qdrant for Zeph agent
A preprocessor for text and HTML corpora
Kowalski Academic Agent: A Rust-based agent for interacting with Ollama models
HTML main-content extraction (article body, title, metadata) — Rust ports of Mozilla Readability, Trafilatura, and htmldate.
A Rust toolkit for detecting and extracting metadata, text, and content from various file formats
deepwiki-rs(also known as Litho) is a high-performance automatic generation engine for C4 architecture documentation, developed using Rust. It can intelligently analyze project structures, identify core components, parse dependency relationships, and leverage large language models (LLMs) to automatically generate professional architecture documentation.
A web article downloader
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.