Node.js native addon for llama.cpp — direct in-process inference
llama-cpp-node-api native binary for Linux x64
llama-cpp-node-api native binary for macOS x64
llama-cpp-node-api native binary for Windows x64
llama-cpp-node-api native binary for macOS ARM64
Extension of @node-llama-cpp/linux-x64-cuda - prebuilt binary for node-llama-cpp for Linux x64 with CUDA support
Prebuilt binary for node-llama-cpp for Linux x64 with Vulkan support
Prebuilt binary for node-llama-cpp for Linux arm64
Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
Prebuilt binary for node-llama-cpp for Linux x64
Prebuilt binary for node-llama-cpp for Linux x64 with CUDA support
Prebuilt binary for node-llama-cpp for Linux armv7l
Prebuilt binary for node-llama-cpp for Windows x64 with Vulkan support
Prebuilt binary for node-llama-cpp for Windows arm64
Prebuilt binary for node-llama-cpp for Windows x64
Prebuilt binary for node-llama-cpp for macOS arm64 with Metal support
Extension of @node-llama-cpp/win-x64-cuda - prebuilt binary for node-llama-cpp for Windows x64 with CUDA support
Prebuilt binary for node-llama-cpp for Windows x64 with CUDA support
Prebuilt binary for node-llama-cpp for macOS x64
Prebuilt binary for node-llama-cpp for Windows x64 with Vulkan support
Prebuilt binary for node-llama-cpp for Windows x64
Prebuilt binary for node-llama-cpp for macOS arm64 with Metal support
Extension of @realtimex/linux-x64-cuda - prebuilt binary for node-llama-cpp for Linux x64 with CUDA support
Prebuilt binary for node-llama-cpp for Linux armv7l