File system handling wrapper to safely ingest raw external data sources.
Data Processing & ETL for HazelJS framework
Node-based file parsers that return Contextrie core source instances.
A source for Écoule that import data from a Comma-separated values-file
Converts a CSV file to SQL Insert Statements.
CSV file source connector for the faucet-stream ecosystem
A composable, deterministic text data pipeline for ML. Ingest, denoise, chunk, split, and sample multi-source corpora into reproducible training triplets.
High-performance embeddable OLAP cube library built on Apache Arrow and DataFusion, with support for dynamic aggregations, calculated fields, and incremental updates
Source components for Fluxus stream processing engine
A tiny Rust query engine that supports SQL-like filters, CSV scanning, projections, and a custom DSL powered by Pest.