Files
clawdbot/skills/openai-whisper-api/SKILL.md
2025-12-20 21:12:57 +01:00

1.1 KiB
Raw Blame History

name, description, homepage, metadata
name description homepage metadata
openai-whisper-api Transcribe audio via OpenAI Audio Transcriptions API (Whisper). https://platform.openai.com/docs/guides/speech-to-text
clawdis
emoji requires primaryEnv
☁️
bins env
curl
OPENAI_API_KEY
OPENAI_API_KEY

OpenAI Whisper API (curl)

Transcribe an audio file via OpenAIs /v1/audio/transcriptions endpoint.

Quick start

{baseDir}/scripts/transcribe.sh /path/to/audio.m4a

Defaults:

  • Model: whisper-1
  • Output: <input>.txt

Useful flags

{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1 --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel"
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json

API key

Set OPENAI_API_KEY, or configure it in ~/.clawdis/clawdis.json:

{
  skills: {
    "openai-whisper-api": {
      apiKey: "OPENAI_KEY_HERE"
    }
  }
}