Files

Peter Steinberger e637bbdfb5 feat: add Deepgram audio transcription

Co-authored-by: Safzan Pirani <safzanpirani@users.noreply.github.com>

2026-01-17 08:53:42 +00:00

1.4 KiB

Raw Blame History

summary, read_when

summary

read_when

Deepgram transcription for inbound voice notes

You want Deepgram speech-to-text for audio attachments

You need a quick Deepgram config example

Deepgram (Audio Transcription)

Deepgram is a speech-to-text API. In Clawdbot it is used for inbound audio/voice note transcription via tools.media.audio.

When enabled, Clawdbot uploads the audio file to Deepgram and injects the transcript into the reply pipeline ({{Transcript}} + [Audio] block). This is not streaming; it uses the pre-recorded transcription endpoint.

Website: https://deepgram.com
Docs: https://developers.deepgram.com

Quick start

Set your API key:

DEEPGRAM_API_KEY=dg_...

Enable the provider:

{
  tools: {
    media: {
      audio: {
        enabled: true,
        models: [{ provider: "deepgram", model: "nova-3" }]
      }
    }
  }
}

Options

model: Deepgram model id (default: nova-3)
language: language hint (optional)

Example with language:

{
  tools: {
    media: {
      audio: {
        enabled: true,
        models: [
          { provider: "deepgram", model: "nova-3", language: "en" }
        ]
      }
    }
  }
}

Notes

Authentication follows the standard provider auth order; DEEPGRAM_API_KEY is the simplest path.
Output follows the same audio rules as other providers (size caps, timeouts, transcript injection).

1.4 KiB Raw Blame History

Deepgram (Audio Transcription)

Quick start

Options

Notes

1.4 KiB

Raw Blame History