Prebuilt binary for node-llama-cpp for Windows x64 with CUDA support
Prebuilt binary for node-llama-cpp for Linux x64 with CUDA support
Speech-to-text recognition API for Tauri with multi-language support
Native module for An another Node binding of llama.cpp (win32-x64-cuda)
Native module for An another Node binding of llama.cpp (linux-arm64-cuda)
Whisper.cpp Node.js binding with auto model offloading strategy.
Native module for An another Node binding of whisper.cpp (win32-x64-cuda)
Project-scoped memory layer for Claude Code, Codex, Hermes, MCP, and dashboards
Native module for An another Node binding of whisper.cpp (linux-arm64-cuda)
Scrypted ONNX Object Detection
High-performance CUDA to WebAssembly/WebGPU transpiler with Rust safety - Run GPU kernels in browsers and Node.js
Sweet Search native binaries for Linux arm64 (glibc) with NVIDIA CUDA backend (candle-cuda + flash-attn) — Jetson Orin, Grace Hopper, and arm64 server GPUs
Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
Sweet Search native binaries for Linux x64 (glibc) with NVIDIA CUDA backend (candle-cuda + flash-attn, SM 7.0+)
Drop-in replacement for onnxruntime-node with DirectML and Cuda support
An another Node binding of whisper.cpp to make same API with whisper.rn as much as possible.
Gemini CLI Workspace Kit
ESLint configuration for CUDA
An another Node binding of llama.cpp
High-performance CUDA to WebAssembly/WebGPU transpiler with Rust safety - Run GPU kernels in browsers and Node.js
EVM and Tron vanity wallet generator powered by CUDA
Linux x64 provanity-worker standalone binary for external automation
xInfer — a high-performance LLM inference engine in Rust with CUDA/Metal acceleration
Free cloud GPUs for learning CUDA