Node.js bindings for LlamaCPP, a C++ library for running language models.
Typescript client for the Hugging Face Inference Providers and Inference Endpoints
A native Capacitor plugin that embeds llama.cpp directly into mobile apps, enabling offline AI inference with chat-first API design. Complete iOS and Android support: text generation, chat, multimodal, TTS, LoRA, embeddings, and more.
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference
Pi extension for llama.cpp integration. Supports both router and single modes.
The repo is for one of the backend: [llama.cpp](https://github.com/ggerganov/llama.cpp)
React Native binding of llama.cpp
An another Node binding of llama.cpp
a GGUF parser that works on remotely hosted files
llama.cpp gguf file parser for javascript
Utility functions for working with TypeScript's API. Successor to the wonderful tsutils. 🛠️️
React Native binding of llama.cpp
Eliza mobile llama.cpp adapter — wraps llama-cpp-capacitor and maps its contextId-based API onto Eliza's LocalInferenceLoader contract.
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference
Use self-hosted LLMs with an OpenAI compatible API
CLI coding agent for complex agentic workflows and long-horizon development. Local models (llama.cpp / Ollama), BYOK across 17 providers, parallel agents, persistent memory, scanned skills.
Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
Node.js bindings for llama.cpp with QBNN architecture support
Turn your device into a GEIANT Hive compute node. Earn GNS tokens.
OpenCode plugin for enhanced llama.cpp support with auto-detection and dynamic model discovery
React Native binding of llama.cpp
TypeScript compiler wrapper for static analysis and code manipulation.
Perform async work synchronously in Node.js using `worker_threads` with first-class TypeScript support.
A Jest transformer with source map support that lets you use Jest to test projects written in TypeScript
LFM2.5-VL embedding provider for the Mimirswell knowledge graph
Rust implementation of OpenTSLM using Burn, WGPU, and llama.cpp
Tauri plugin to interact with LEAP & Liquid LFMs
Pull-based image-generation worker for the minis.gg studio.
Lightweight Ollama-compatible inference server with native SafeTensors support. No Python dependencies, cross-platform WebGPU acceleration via Airframe.
A context compiler for AI coding agents
Lua-first Agent Runtime built on AgentMesh
Fast local semantic search for codebases and knowledge bases with AI-powered features
Fast local semantic search for codebases and knowledge bases with AI-powered features
KnishIO validator orchestration CLI — Docker control, cell management, benchmarks, and health checks