Voice synthesis and transcription tools for AgentOS via OpenAI, ElevenLabs, Deepgram, and local Ollama/Whisper-compatible runtimes
Voice input/output plugin for Zhin.js — STT via Whisper + TTS via edge-tts
MCP server for fal.ai - Run 600+ AI models for image, video, audio generation and more
Node.js library for writing TJBot recipes. Special package for updated packages.
Soniox integration for LangChain.js
Node.js library for writing TJBot recipes. Special package for updated packages.
Provider-agnostic AI gateway with capability-based routing, in-memory rate limiting, and observability hooks.
Provider-agnostic AI gateway with capability-based routing, in-memory rate limiting, and observability hooks.
Rust SDK for Sarvam AI APIs — chat, translation, speech-to-text, text-to-speech, transliteration, and language identification
AI-native macOS menu bar text-to-speech and MCP server for agents.
ElevenLabs provider for the LLM Kit - text-to-speech and speech-to-text
Safe Rust wrapper for sherpa-onnx speech recognition toolkit
AI-native macOS menu bar dictation for developers.
Local-first floating voice-to-text (STT) and text-to-speech (TTS) tool for Linux, macOS, and Windows
Native text-to-speech plugin for Tauri with multi-language and voice selection
A general-purpose voice <-> crate — text-to-speech, speech-to-text, and audio-to-audio transformations. Also supports realtime conversations.
Audio I/O, speech-to-text, and text-to-speech for the Brainwires Agent Framework
Command-line interface for the ElevenLabs API
Comprehensive async Rust SDK for ElevenLabs API with TTS, STT, voice management, and WebSocket streaming
A Rust implementation of Kokoro TTS (Text-to-Speech) synthesis
speech to text using different services
Text-to-Speech converts text or Speech Synthesis Markup Language (SSML) input into audio data of natural human speech.
Text-to-Speech converts text or Speech Synthesis Markup Language (SSML) input into audio data of natural human speech. Note that google-cloud-text_to_speech-v1 is a version-specific client library. For most uses, we recommend installing the main client library google-cloud-text_to_speech instead. See the readme for more details.
Text-to-Speech converts text or Speech Synthesis Markup Language (SSML) input into audio data of natural human speech. Note that google-cloud-text_to_speech-v1beta1 is a version-specific client library. For most uses, we recommend installing the main client library google-cloud-text_to_speech instead. See the readme for more details.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.