Speech & Transcription
21 skills in this category
assemblyai-transcribe
SafeTranscribe audio/video with AssemblyAI (local upload.
audio-gen
SafeGenerate audiobooks, podcasts, or educational audio content on demand.
audio-reply
CautionGenerate audio replies using TTS. Trigger with "read it to me [URL]" to fetch.
edge-tts
Safe|.
gettr-transcribe-summarize
SafeDownload audio from a GETTR post (via HTML og:video), transcribe it locally.
llmwhisperer
SafeExtract text and layout from images and PDFs using LLMWhisperer API.
local-whisper
SafeLocal speech-to-text using OpenAI Whisper. Runs fully offline after model download.
mlx-whisper
SafeLocal speech-to-text with MLX Whisper (Apple Silicon optimized, no API key).
openai-whisper
SafeLocal speech-to-text with the Whisper CLI (no API key).
openai-whisper-api
SafeTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
parakeet-mlx
SafeLocal speech-to-text with Parakeet MLX (ASR) for Apple Silicon (no API key).
parakeet-stt
Safe>-.
pocket-transcripts
SafeRead transcripts and summaries from Pocket AI (heypocket.com) recording devices.
pocket-tts
Safepocket-tts
tts-whatsapp
SafeSend high-quality text-to-speech voice messages on WhatsApp in 40+ languages with automatic delivery.
video-subtitles
SafeGenerate SRT subtitles from video/audio with translation support.
voice-transcribe
SafeTranscribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints.
elevenlabs-voices
SafeElevenLabs voice synthesis: 18 personas, 32 languages, sound effects.
elevenlabs-media
SafeElevenLabs music generation and speech-to-text (Scribe v2).
elevenlabs-agents
SafeCreate and manage ElevenLabs conversational AI agents.
tts
SafeText-to-speech using Hume AI or OpenAI API.