1.4 KiB
1.4 KiB
summary, read_when
| summary | read_when | ||
|---|---|---|---|
| Deepgram transcription for inbound voice notes |
|
Deepgram (Audio Transcription)
Deepgram is a speech-to-text API. In Clawdbot it is used for inbound audio/voice note
transcription via tools.media.audio.
When enabled, Clawdbot uploads the audio file to Deepgram and injects the transcript
into the reply pipeline ({{Transcript}} + [Audio] block). This is not streaming;
it uses the pre-recorded transcription endpoint.
Website: https://deepgram.com
Docs: https://developers.deepgram.com
Quick start
- Set your API key:
DEEPGRAM_API_KEY=dg_...
- Enable the provider:
{
tools: {
media: {
audio: {
enabled: true,
models: [{ provider: "deepgram", model: "nova-3" }]
}
}
}
}
Options
model: Deepgram model id (default:nova-3)language: language hint (optional)
Example with language:
{
tools: {
media: {
audio: {
enabled: true,
models: [
{ provider: "deepgram", model: "nova-3", language: "en" }
]
}
}
}
}
Notes
- Authentication follows the standard provider auth order;
DEEPGRAM_API_KEYis the simplest path. - Output follows the same audio rules as other providers (size caps, timeouts, transcript injection).