Shared model download and cache layer for Blazen local-inference backends
GPU-accelerated ML inference server with Stripe billing, Hugging Face model caching, and SSE streaming
End-to-end llama.cpp toolkit: API client, HuggingFace Hub, server orchestration, benchmarks
Tauri plugin to interact with LEAP & Liquid LFMs
High-performance inference engine for BitNet models
HTTP API server for local LLM inference
Local GraphRAG memory for LLMs in a single SQLite file
CoreML inference engine for Candle tensors - provides Apple CoreML/ANE integration with real tokenization, safety fixes, and model calibration awareness
A minimal memory layer for AI agents
Command-line interface for Kreuzberg document intelligence
Core library for AI-powered audio stem separation
Fast license plate OCR inference in pure Rust - a port of fast-plate-ocr with ONNX model support
CachedModel caches simple (by id) finds in memcached reducing the amount of work the database needs to perform for simple queries.
Rails gem that gives you the ability to transparently cache single active record objects using memcached.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.