Speech / Language tools
Provides text-to-speech functionality.
TypeScript definitions for dom-speech-recognition
Cloud Speech Client Library for Node.js
Speech recognition for your React app
Speech Recognition for React Native Expo projects
Microsoft Cognitive Services Speech SDK for JavaScript
TypeScript definitions for react-speech-recognition
A standalone speech rule engine for XML structures, based on the original engine from ChromeVox.
Cloud Text-to-Speech API client for Node.js
n8n node for integrating Palatine Speech API into workflow
Polyfill Web Speech API with Cognitive Services Speech-to-Text service
A native plugin for speech recognition
Capacitor plugin for synthesizing speech from text.
React hooks for in-browser Speech Recognition and Speech Synthesis.
Cross browser Speech Synthesis
Web components for Attendi's speech service.
Add real-time speech to text functionality into your website with no effort
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Speech-command recognizer in TensorFlow.js
An easy-to-use React.js library that leverages the Web Speech API to convert text to speech.
OCI NodeJS client for Ai Speech Service
Capacitor plugin for comprehensive on-device speech recognition with live partial results.
Javascript client library for Soniox Speech-to-Text websocket API
Safe Rust bindings for Apple's Speech framework — on-device speech recognition (SFSpeechRecognizer) on macOS
Safe Rust wrapper for sherpa-onnx speech recognition toolkit
Rust bindings for Microsoft Speech SDK.
Raw FFI bindings to the sherpa-onnx C API
Rust SDK for Sarvam AI APIs — chat, translation, speech-to-text, text-to-speech, transliteration, and language identification
A Tauri plugin for cross-platform device AI capabilities including speech, vision, and text processing
Azure AI Speech adapter helpers for Vona
Shared native MLX speech model loading utilities for Vona
Cross-platform text translator: global hotkey, interactive terminal, and CLI modes using Google Translate
Native text-to-speech plugin for Tauri with multi-language and voice selection
Umbrella crate for Vona real-time speech-to-speech runtime contracts and optional adapters
Core traits, event types, session driver, skill registry and runtime policy surface for real-time speech-to-speech runtimes
Prepare SSML and GRXML documents with ease
Google Speech-to-Text enables developers to convert audio to text by applying powerful neural network models in an easy-to-use API. The API recognizes more than 120 languages and variants to support your global user base. You can enable voice command-and-control, transcribe audio from call centers, and more. It can process real-time streaming or prerecorded audio, using Google's machine learning technology.
Adventures in the Land of Speech Recognition
This is a gem to call the google speech api.
A Ruby library for consuming v3 of the AT&T Speech API for speech->text, and text->speech. Takes in either .wav or specific other audio files, and returns a text string of the spoken words. Can also take in either a text string or .txt file and returns a string of bytes from which a .wav file can be created of the spoken text.
This software is a SpeechBalloon plugin for PettanR
speech to text using different services
This software is a SpeechBalloon plugin for PettanR
This software is a SpeechBalloon plugin for PettanR
Google Speech-to-Text enables developers to convert audio to text by applying powerful neural network models in an easy-to-use API. The API recognizes more than 120 languages and variants to support your global user base. You can enable voice command-and-control, transcribe audio from call centers, and more. It can process real-time streaming or prerecorded audio, using Google's machine learning technology. Note that google-cloud-speech-v1 is a version-specific client library. For most uses, we recommend installing the main client library google-cloud-speech instead. See the readme for more details.
Google Speech-to-Text enables developers to convert audio to text by applying powerful neural network models in an easy-to-use API. The API recognizes more than 120 languages and variants to support your global user base. You can enable voice command-and-control, transcribe audio from call centers, and more. It can process real-time streaming or prerecorded audio, using Google's machine learning technology. Note that google-cloud-speech-v1p1beta1 is a version-specific client library. For most uses, we recommend installing the main client library google-cloud-speech instead. See the readme for more details.
Provides a simple interface for using the AVSpeechSynthesizer related classes available natively in iOS 7.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.