Fast pure-Rust PDF extraction library and CLI — ~10-50x faster than pdfplumber for text, word, table, layout, image, and metadata extraction from PDFs. By Clark Labs Inc.
A Ruby gem that wraps the pdfsink-rs CLI, a fast pure-Rust PDF extraction tool, providing text, word, object, table, link, and regex-search extraction from PDFs for use in Rails applications.