BETAmodules.com is in beta — open to partnerships & joint ventures.Build with us

kreuzberg

v4.9.8RubyGems· Ruby

Kreuzberg is a high-performance document intelligence library with a Rust core and native Ruby bindings via Magnus. Extract text, metadata, and structured data from 75+ file formats including PDF, DOCX, PPTX, XLSX, HTML, RTF, images (with OCR), email, archives, and more. Features async/sync APIs, text chunking, language detection, and keyword extraction.

The verdict
Maintained. Niche but maintained, actively maintained.
Live from the RubyGems registry · derived rules, not AI
How it scores
MaintenanceHealthy
PopularityNiche
SecurityClean
LicenseOther
DepsZero deps
Maintenance
Last published this month.
Popularity
91 downloads / week
Security
No known advisories for this version (OSV).
License
Elastic-2.0
Dependencies
No runtime dependencies
Recent releases
  • 5.0.0.pre.rc.1this month
  • 4.9.8this month
  • 4.9.7this month
  • 4.9.6this month
  • 4.10.0.pre.rc.151 month ago
  • 4.10.0.pre.rc.141 month ago
  • 4.10.0.pre.rc.121 month ago
  • 4.10.0.pre.rc.111 month ago
kreuzberg — Kreuzberg is a high-performance document intelligence library with a Rust core and native Ruby bindings via Magnus. Extract text, metadata, and structured data from 75+ file formats including PDF, DOCX, PPTX, XLSX, HTML, RTF, images (with OCR), email, archives, and more. Features async/sync APIs, text chunking, language detection, and keyword extraction. (Ruby / RubyGems) · Modules