Audio Transcriptions
Create Audio Transcription
Upload an audio file for transcription
Processing is asynchronous - the endpoint returns immediately with a pending status.
Use the GET endpoint to check status and retrieve results.
POST
In addition to the multipart form fields shown below, this endpoint accepts an
optional
dictation field. Send dictation=true to request dictation-oriented
transcript formatting. If omitted, dictation defaults to false.Body
multipart/form-data
Audio file to transcribe (max 100MB)
Language code for the audio content. Different models are used based on language
Available options:
en, en-US, es, de, fr, hi, it, ja, nl, pt, ru, bg, ca, zh, zh-CN, zh-Hans, zh-TW, zh-Hant, zh-HK, cs, da, da-DK, en-AU, en-GB, en-NZ, en-IN, et, fi, nl-BE, fr-CA, de-CH, el, hu, id, ko, ko-KR, lv, lt, ms, no, pl, pt-BR, pt-PT, ro, sk, es-419, sv, sv-SE, th, th-TH, tr, uk, vi Example:
"en"
Audio encoding format. Optional - if not specified, the API will auto-detect. Common formats:
linear16: 16-bit linear PCMflac: Free Lossless Audio Codecmulaw: μ-law encodingogg_opus: Opus codec in OGG containerwebm_opus: Opus codec in WebM container
Available options:
linear16, flac, mulaw, amr, amr_wb, ogg_opus, speex, webm_opus Example:
"linear16"
Enable multichannel audio processing
Enable dictation-oriented transcript formatting. When sent as multipart form-data, provide true or false.
Response
Transcription created successfully