An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
This is an MCP server that allows you to directly download transcripts of YouTube videos.
n8n node for integrating Palatine Speech API into workflow
A React component to make correcting automated transcriptions of audio and video easier and faster. Using the Slate Editor
TypeScript implementation of YouTube Transcript API
Extract clean, timestamped YouTube captions, subtitles, transcripts, and video metadata for AI summaries, RAG, search, and slide-ready workflows.
Voice pipeline for Cloudflare Agents — STT, TTS, VAD, streaming, and SFU utilities
[](https://github.com/kaltura/playkit-js-transcript/actions/workflows/run_canary_full_flow.yaml) [ configuration files / plugins
Local CLI for turning YouTube captions into rich Markdown context files for agents.