pikuri-pdf

v0.0.6RubyGems· Ruby

pikuri-pdf plugs PDF → text extraction into pikuri-core's +Pikuri::Extractor+ registry. The bundled +Pikuri::Extractors::PDF+ extractor wraps the pure-Ruby pdf-reader gem and extracts lazily: paged reads (the +read+ tool's windows) parse only the pages the window needs, so the first page of a 500-page PDF never pays for the other 499. Shipped separately from pikuri-core so the core's dependency tree stays minimal and auditable: pdf-reader and its transitive deps (Ascii85, afm, hashery, ruby-rc4, ttfunk) ride along only for hosts that opt into PDF support. Registration is explicit — +Pikuri::Extractors::PDF.register+ — so requiring the gem changes nothing by itself; the host script picks which extractors it wires in. One registration extends the +read+ tool, +web_scrape+, and the pikuri-vectordb indexer simultaneously.

The verdict

Maintained. Niche but maintained, actively maintained.

Live from the RubyGems registry · derived rules, not AI

How it scores

MaintenanceHealthy

PopularityNiche

SecurityClean

LicensePermissive

DepsZero deps

Maintenance

Last published this month.

Popularity

88 downloads / week

Security

No known advisories for this version (OSV).

License

MIT

Dependencies

No runtime dependencies

Recent releases

0.0.6this month