A minimal llama.cpp provider for the Vercel AI SDK implementing LanguageModelV3 and EmbeddingModelV3
Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
Extension of @node-llama-cpp/linux-x64-cuda - prebuilt binary for node-llama-cpp for Linux x64 with CUDA support
Prebuilt binary for node-llama-cpp for Linux x64 with Vulkan support
Prebuilt binary for node-llama-cpp for Linux arm64
Prebuilt binary for node-llama-cpp for Linux x64
Prebuilt binary for node-llama-cpp for Linux x64 with CUDA support
Prebuilt binary for node-llama-cpp for Linux armv7l
Prebuilt binary for node-llama-cpp for Windows x64 with Vulkan support
Prebuilt binary for node-llama-cpp for Windows arm64
Prebuilt binary for node-llama-cpp for Windows x64
Prebuilt binary for node-llama-cpp for macOS arm64 with Metal support
Extension of @node-llama-cpp/win-x64-cuda - prebuilt binary for node-llama-cpp for Windows x64 with CUDA support
This package provides a foundation for implementing providers that expose an OpenAI-compatible API.
Prebuilt binary for node-llama-cpp for Windows x64 with CUDA support
Prebuilt binary for node-llama-cpp for macOS x64
The **[Together.ai provider](https://ai-sdk.dev/providers/ai-sdk-providers/togetherai)** for the [AI SDK](https://ai-sdk.dev/docs) contains language model support for the [Together.ai](https://together.ai) platform.
A native Capacitor plugin that embeds llama.cpp directly into mobile apps, enabling offline AI inference with chat-first API design. Complete iOS and Android support: text generation, chat, multimodal, TTS, LoRA, embeddings, and more.
The **[DeepInfra provider](https://ai-sdk.dev/providers/ai-sdk-providers/deepinfra)** for the [AI SDK](https://ai-sdk.dev/docs) contains language model support for the DeepInfra API, giving you access to models like Llama 3, Mixtral, and other state-of-th
Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
Prebuilt binary for node-llama-cpp for Windows x64 with Vulkan support
Prebuilt binary for node-llama-cpp for Windows x64
Prebuilt binary for node-llama-cpp for macOS arm64 with Metal support
Extension of @realtimex/linux-x64-cuda - prebuilt binary for node-llama-cpp for Linux x64 with CUDA support