A tokio based runtime library for data integration app
Config-driven HTTP scraping API with Actix Web, optional Athena and Redis
Multi-user, multi-model memory system for distributed AI agents (Halldyll ecosystem)
Common types and utilities for the Argus web crawler
Configuration management for the Argus web crawler
A production-ready web crawler capable of handling billions of URLs
Content deduplication utilities for web crawling
HTTP fetching utilities with retry logic for web crawling
URL frontier implementations for web crawling
HTML and sitemap parsing utilities for web crawling
Robots.txt parsing and caching for web crawling
Storage backends for crawled web data