A library for doing speech recognition using a Coqui STT model
Voice pipeline for Cloudflare Agents — STT, TTS, VAD, streaming, and SFU utilities
Voice capabilities: TTS, STT, and conversational AI
Mastra Inworld AI voice integration — streaming TTS and batch STT
Storybook for stt using Create-React-app
React Native wrapper for sherpa-onnx TTS and STT capabilities
Real-time streaming STT via Deepgram WebSocket API for AgentOS voice pipeline
Local STT via Kyutai moshi (on-device, GPU, Python/uv)
SIP telephony plugin for OpenClaw. Single-process pure TypeScript. STT + agent + TTS over real SIP+RTP.
STT manager and browser adapters for Charivo
Mozilla Voice STT NodeJS bindings
A polyfill for SpeechRecognition utilizing a SEPIA STT server.
Chunked sliding-window streaming STT via OpenAI Whisper HTTP API for AgentOS voice pipeline
Official VoicePilot JavaScript SDK — TTS, STT, Agents, and real-time conversations.
Voice input/output plugin for Zhin.js — STT via Whisper + TTS via edge-tts
GENSHI Works STT SDK — high-accuracy domain-specific speech-to-text
BitGo SDK coin library for Somnia
Voice Connect — standalone OpenClaw voice channel plugin (SPA + STT + TTS)
Native Node addon for @genshiai/stt (linux x64)
Klarisent STT SDK
A simple and lightweight proxy for seamless integration with multiple STT (Speech-to-Text) providers including Whisper.cpp
Pi package for speech-first interaction with pluggable STT/TTS providers, defaulting to Sarvam AI.
Voiceapp CLI for STT and TTS workflows
STT.
Library for transcription using whisper ai model
The autonomous, self-improving AI agent. Single Rust binary. Every channel. Install with: cargo install opencrabs
Audio intelligence and pipeline orchestration for ADK-Rust agents
A Rust library for speech-to-text processing using Whisper
AI-native macOS menu bar dictation for developers.
Type-safe Rust client for ElevenLabs Speech-To-Text API
STT harness: an agentic streaming pipeline that diarizes, transcribes, and accumulates a conversation record aligned to agentic structures.
Optional axum backend + embedded React SPA for reviewing and editing diarized STT conversations.
AssemblyAI STT backend (REST + Universal-Streaming WebSocket) for atomr-agents.
Deepgram STT backend (REST + WebSocket) for atomr-agents.
Plug-and-play speech-to-text for Rust. Add local transcription to any app in a few lines, with automatic GPU acceleration and zero configuration.
Audio decoding (symphonia) and microphone capture (cpal) for atomr-agents speech-to-text.
API Wrapper for the Microsoft Translator Text API 3.1 (Cognitive Services)
The AssemblyAI Ruby SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async, audio intelligence models, as well as the latest LeMUR models. The Ruby SDK does not support Streaming STT at this time.
Ruby client library for HINOW AI - Access LLMs, image generation, TTS, STT, video generation, and embeddings.
lib to set and get terminal line settings
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.