Local audio/video transcription with speaker diarization and live audio support. No API keys. Powered by faster-whisper.
CLI tool to record audio and transcribe it using OpenAI Whisper
AWS SDK for JavaScript Transcribe Client for Node.js, Browser and React Native
[](https://www.npmjs.com/package/@aws-sdk/middleware-sdk-transcribe-streaming) [** for the [AI SDK](https://ai-sdk.dev/docs) contains transcription model support for the Deepgram transcription API and speech model support for the Deepgram text-to-speech
The **[ElevenLabs provider](https://ai-sdk.dev/providers/ai-sdk-providers/elevenlabs)** for the [AI SDK](https://ai-sdk.dev/docs) contains language model support for the ElevenLabs chat and completion APIs and embedding model support for the ElevenLabs em
Wasm build based on whisper.cpp.
Bun + TypeScript CLI foundation for deterministic media-intake workflows
A client for Amazon Transcribe using the websocket interface
CLI tool to transcribe audio/video files to SRT format using OpenAI Whisper API
CLI tool for Genspark Tool API - search, crawl, analyze images, generate media
Simple cross-browser speech to text using react hooks.
Transcribe speech to text in the browser.
Creates searchable transcripts from text tracks
Windows-native MCP server for local audio transcription using whisper.cpp with Vulkan GPU acceleration
Offline transcription of iPhone voice memos on macOS — 3 engines (mlx/cpp/gigaam) optimized for Russian
Agent-first CLI for UNIR (Universidad Internacional de La Rioja) campus online. Surfaces course materials from Moodle + LTI hub + Panopto, downloads, transcribes, summarizes, and publishes to a personal Starlight site.
Generate Markdown documentation from code comments
convert an AWS transcribe JSON body into a .vtt file