Right-size LLM models to your system hardware. Interactive TUI and CLI to match models against available RAM, CPU, and GPU.
Core library for llmfit — hardware detection, model fitting, and provider integration
Genetic algorithm library for n-dimensional optimization problems
LLM provider implementations (Anthropic, OpenAI Chat + Responses, Google Gemini, Ollama, Bedrock, Vertex AI, local llama.cpp) for the Brainwires Agent Framework. Speech (TTS/STT) providers live in `brainwires-provider-speech`.
CNN feature extraction for image embeddings with SIMD acceleration
Mesh builder for tile maps using using texture atlases
Small, sweet, easy framework for full-stack web application
Stretto is a high performance thread-safe memory-bound Rust cache.
Tiny LLM inference for ESP32 microcontrollers with INT8/INT4 quantization, multi-chip federation, RuVector semantic memory, and SNN-gated energy optimization
Pure-Rust inference for VoxCPM2 on top of the Burn framework (Vulkan + CPU).
Senior SysAdmin, Network Admin, Data Analyst, and Software Engineer living in your terminal. A high-precision local AI agent harness for LM Studio, Ollama, and other local OpenAI-compatible runtimes that runs 100% on your own silicon. Reads repos, edits files, runs builds, inspects full network state and workstation telemetry, and runs real Python/JS for data analysis.
turbolite - SQLite VFS with sub-50ms cold queries from S3 + page-level compression and encryption