Chunk package for the Linux x64 CUDA fallback backend used by node-llama-cpp (4/6)
Chunk package for the Windows x64 CUDA fallback backend used by node-llama-cpp (4/6)
Extension of @node-llama-cpp/linux-x64-cuda - prebuilt binary for node-llama-cpp for Linux x64 with CUDA support
Prebuilt binary for node-llama-cpp for Linux x64
Prebuilt binary for node-llama-cpp for Linux x64 with Vulkan support
Prebuilt binary for node-llama-cpp for Linux arm64
Prebuilt binary for node-llama-cpp for Linux x64 with CUDA support
Prebuilt binary for node-llama-cpp for Linux armv7l
Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
Prebuilt binary for node-llama-cpp for Windows x64 with Vulkan support
Prebuilt binary for node-llama-cpp for Windows arm64
Prebuilt binary for node-llama-cpp for Windows x64
Prebuilt binary for node-llama-cpp for macOS arm64 with Metal support
Extension of @node-llama-cpp/win-x64-cuda - prebuilt binary for node-llama-cpp for Windows x64 with CUDA support
Prebuilt binary for node-llama-cpp for Windows x64 with CUDA support
Prebuilt binary for node-llama-cpp for macOS x64
Prebuilt binary for node-llama-cpp for Linux armv7l
Prebuilt binary for node-llama-cpp for Windows arm64
Prebuilt binary for node-llama-cpp for Windows x64 with Vulkan support
Prebuilt binary for node-llama-cpp for Windows x64
Prebuilt binary for node-llama-cpp for macOS arm64 with Metal support
Extension of @realtimex/linux-x64-cuda - prebuilt binary for node-llama-cpp for Linux x64 with CUDA support
Prebuilt binary for node-llama-cpp for Linux arm64
Prebuilt binary for node-llama-cpp for Linux x64 with Vulkan support
llama.cpp bindings for Rust
Rust implementation of OpenTSLM using Burn, WGPU, and llama.cpp
Rust port of NeuTTS — on-device voice-cloning TTS with GGUF backbone and NeuCodec decoder
Low Level Bindings to llama.cpp
A flexible, multi-backend, customizable AI agent framework, entirely based on Rust.
Procedural macros for Ambi
LLM provider implementations (Anthropic, OpenAI Chat + Responses, Google Gemini, Ollama, Bedrock, Vertex AI, local llama.cpp) for the Brainwires Agent Framework. Speech (TTS/STT) providers live in `brainwires-provider-speech`.
Sub-microsecond exact phrase matching for LLM context retrieval using Roaring bitmaps
A3S Power — Privacy-preserving LLM inference for TEE environments
Functional wrapper around Llama.cpp with Rust Dynamic datatypes and Vector store support for creating RAG applications
High-performance Rust library for generating text embeddings using llama-cpp
Pull-based image-generation worker for the minis.gg studio.
No description provided.
No description provided.