Skip to main content
POST
/
v2
/
audio
/
transcriptions
curl --request POST \
--url https://api.sully.ai/v2/audio/transcriptions \
--header 'Content-Type: multipart/form-data' \
--header 'X-ACCOUNT-ID: <api-key>' \
--header 'X-API-KEY: <api-key>' \
--form language=en \
--form multichannel=false \
--form audio=@example-file
{
"data": {
"id": "tr_abc123def456",
"status": "pending",
"created_at": "2024-01-15T10:30:00Z",
"updated_at": "2024-01-15T10:30:00Z"
}
}

Authorizations

X-API-KEY
string
header
required
X-ACCOUNT-ID
string
header
required

Body

multipart/form-data
audio
file
required

Audio file to transcribe (max 100MB)

language
enum<string>
default:en

Language code for the audio content. Different models are used based on language

Available options:
en,
en-US,
es,
de,
fr,
hi,
it,
ja,
nl,
pt,
ru,
bg,
ca,
zh,
zh-CN,
zh-Hans,
zh-TW,
zh-Hant,
zh-HK,
cs,
da,
da-DK,
en-AU,
en-GB,
en-NZ,
en-IN,
et,
fi,
nl-BE,
fr-CA,
de-CH,
el,
hu,
id,
ko,
ko-KR,
lv,
lt,
ms,
no,
pl,
pt-BR,
pt-PT,
ro,
sk,
es-419,
sv,
sv-SE,
th,
th-TH,
tr,
uk,
vi
Example:

"en"

encoding
enum<string>

Audio encoding format. Optional - if not specified, the API will auto-detect. Common formats:

  • linear16: 16-bit linear PCM
  • flac: Free Lossless Audio Codec
  • mulaw: μ-law encoding
  • ogg_opus: Opus codec in OGG container
  • webm_opus: Opus codec in WebM container
Available options:
linear16,
flac,
mulaw,
amr,
amr_wb,
ogg_opus,
speex,
webm_opus
Example:

"linear16"

multichannel
boolean
default:false

Enable multichannel audio processing

Response

Transcription created successfully

data
object
required