Home/Speech & Transcription/openai-whisper-api

openai-whisper-api

Safe
Speech & Transcription

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

SKILL.md

# OpenAI Whisper API (curl) Transcribe an audio file via OpenAI’s `/v1/audio/transcriptions` endpoint. ## Quick start ```bash {baseDir}/scripts/transcribe.sh /path/to/audio.m4a ``` Defaults: - Model: `whisper-1` - Output: `<input>.txt` ## Useful flags ```bash {baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1 --out /tmp/transcript.txt {baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en {baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel" {baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json ``` ## API key Set `OPENAI_API_KEY`, or configure it in `~/.clawdbot/clawdbot.json`: ```json5 { skills: { "openai-whisper-api": { apiKey: "OPENAI_KEY_HERE" } } } ```

More in Speech & Transcription