Chunk package for the Linux x64 CUDA fallback backend used by node-llama-cpp (2/6)
Chunk package for the Windows x64 CUDA fallback backend used by node-llama-cpp (2/6)
Extension of @node-llama-cpp/linux-x64-cuda - prebuilt binary for node-llama-cpp for Linux x64 with CUDA support
Prebuilt binary for node-llama-cpp for Linux x64
Prebuilt binary for node-llama-cpp for Linux x64 with CUDA support
Prebuilt binary for node-llama-cpp for Linux armv7l
Prebuilt binary for node-llama-cpp for Linux x64 with Vulkan support
Prebuilt binary for node-llama-cpp for Linux arm64
Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
Extension of @node-llama-cpp/win-x64-cuda - prebuilt binary for node-llama-cpp for Windows x64 with CUDA support
Prebuilt binary for node-llama-cpp for Windows x64 with CUDA support
Prebuilt binary for node-llama-cpp for Windows x64
Prebuilt binary for node-llama-cpp for Windows x64 with Vulkan support
Prebuilt binary for node-llama-cpp for macOS arm64 with Metal support
Prebuilt binary for node-llama-cpp for Windows arm64
Prebuilt binary for node-llama-cpp for macOS x64
Prebuilt binary for node-llama-cpp for Linux armv7l
Prebuilt binary for node-llama-cpp for Windows arm64
Prebuilt binary for node-llama-cpp for Linux arm64
Prebuilt binary for node-llama-cpp for Linux x64
Prebuilt binary for node-llama-cpp for Linux x64 with Vulkan support
Prebuilt binary for node-llama-cpp for Windows x64
Prebuilt binary for node-llama-cpp for macOS arm64 with Metal support
Prebuilt binary for node-llama-cpp for Windows x64 with Vulkan support
llama.cpp bindings for Rust
LLM provider implementations (Anthropic, OpenAI Chat + Responses, Google Gemini, Ollama, Bedrock, Vertex AI, local llama.cpp) for the Brainwires Agent Framework. Speech (TTS/STT) providers live in `brainwires-provider-speech`.
A crate for run llama.cpp in Rust. based on llama-cpp-2
An opinionated, simple Rust interface for local LLMs, powered by llama-cpp-2
rqmd: search engine core (store, db, chunking, collections, LLM integration)
Pull-based image-generation worker for the minis.gg studio.
Local on-device LLM inference for swink-agent using llama.cpp
A3S Power — Privacy-preserving LLM inference for TEE environments
Rig completion provider for local GGUF models via llama.cpp, with streaming, tool calling, reasoning, and multimodal (mtmd) support.
Simplified zero-cost wrapper over llama.cpp powered by lama-cpp-2.
Low Level Bindings to llama.cpp
Low Level Bindings to llama.cpp
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.