A CLI speech recognition tool, using OpenAI Whisper, supports audio file transcription and near-realtime microphone input.
MCP server for audio transcription with OpenAI API, whisper-cli, or whisper.cpp
Local speech-to-text with the Whisper CLI (no API key).
Work with the output of the OpenAI Whisper API
Helpers for installing and using Whisper.cpp
Node bindings for OpenAI's Whisper. Optimized for CPU.
Run Whisper on Node.js
Run Whisper on Node.js
Node.js bindings for OpenAI's Whisper. Runs local on CPU.
Helpers for using Whisper.cpp in browser using WASM
后台同步手机通知数据到本地存储,供 Agent 查询消费
AI-powered security scanner to secure codebases
Speech-to-text recognition API for Tauri with multi-language support
Run Whisper on Node.js
Windows-native MCP server for local audio transcription using whisper.cpp with Vulkan GPU acceleration
Whisper.cpp Node.js binding with auto model offloading strategy.
NEURO skill: Local speech-to-text with the Whisper CLI (no API key).
Terminal-first relay for paired AI coding agents (Claude + Codex), driven by structured workflows.
An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
A task-based automation app. Leiningen style.
Browser-native audio transcription powered by WebGPU Whisper — zero server, fully local.
Local audio/video transcription with speaker diarization and live audio support. No API keys. Powered by faster-whisper.
A GPU accelerated .node addon for whisper.cpp with prebuilt binaries
TypeScript port of SYSTRAN/faster-whisper for Node.js, built on CTranslate2, Koffi, FFmpeg, and ONNX Runtime.
A command line interface for whisper-rs
Native speech-to-text voice dictation for Hyprland (Rust implementation)
Push-to-talk speech-to-text daemon for Wayland (Hyprland)
CLI control client for the kloyce speech-to-text daemon
AVI/video → SRT subtitle generator using whisper.cpp via whisper-rs
[DO NOT USE — UNDER ACTIVE DEVELOPMENT, NOT PRODUCTION-READY] Captcha solver scaffolding for chromiumoxide-driven browsers. The architecture is in place (vendor solvers, retry-loop iframe walking, VLM provider abstraction, real-WAF bench harness) but the live-vendor success rate is still 0% — Cloudflare Turnstile / hCaptcha / reCAPTCHA detect us at a TLS / CDP fingerprint layer that no flag-based stealth has cleared. Watch the repo; do not depend on this for any real workload.
The post-processing layer Whisper should have shipped with - segment dedup, foreign-script rejection, noise-marker collapse, voice-command strip
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.