transcribe
Convert spoken words from audio to text.
Speedy transcription of shorter audios in WAV format
TRANSCRIBE FAST
Transcribes WAV formatted audio as quickly as possible.
This endpoint has the following limitations
- Accepts WAV files only
- Quality profile must be mono-channel, 16khz sample rate, 16-bit sample width
- Maximum audio length of 6 minutes or file size of 11MB
Args
upload
: file content in bytes as part of form data.language_code
: (optional) the expected language if known - [zul, sot, afr, eng, fra]
query Parameters
lang_code
Request Body
upload
string · binary · required
Responses
Transcribe an audio file (received as base64) in one request
TRANSCRIBE SYNC
Accepts an audio file in one of the supported formats and returns the recognised speech within the same HTTP request/response cycle.
Supported formats: wav, mp3, flac, aac, ogg, mp4, aiff, opus
Note: processing may take up to 2min30s or fail altogether on long, low-quality or complex audio.
Args
file_name
: name of the audio fileaudio_blob
: file byte contents as a base64-encoded stringlanguage_code
: (optional) the expected language if known - [zul, sot, afr, eng, fra]
Request Body
file_name
string · requiredaudio_blob
requiredfile_size
integer · required
language_code
channels
sample_rate
frame_rate
Responses
Transcribe an audio file (received as a file) in one request
TRANSCRIBE SYNC FILE
Accepts an audio file in one of the supported formats and returns the recognised speech within the same HTTP request/response cycle.
Supported formats: wav, mp3, flac, aac, ogg, mp4, aiff, opus
Note: processing may take up to 2min30s or fail altogether on long, low-quality or complex audio.
Args
file
: file content in bytes as part of form data.lang_code
: (optional) the expected language if known - [zul, sot, afr, eng, fra]diarise
: (optional) enable diarisationdetect_music
: (optional) enable music detection
Returns
TranscriptionDto
: a data transfer object containing the transcribed text and other details
Raises
HTTPException 415
: if the file type is not supportedHTTPException 413
: if the file size exceeds the maximum allowed limit (100MB)HTTPException 503
: if downstream services are not yet available to serve requests.
query Parameters
lang_code
diarise
booleanEnable diarisation
detect_music
booleanEnable music detection
Request Body
file
string · binary · required
Responses
id
upload_file_size
audio_length_seconds
sample_rate
channels
frame_rate
mime_type
language_code
diarisation_result
transcription_text
transcription_status
error_message
status_datetime
upload_datetime
warnings
List all the batches that have been created
Responses
status
string · enum · requiredEnum values:REQUESTEDUPLOADEDPROCESSINGCOMPLETEFAILEDRETRYINGid
customer_id
Services customer id.
project_id
Services project id.
keychain_id
Keychain ID.
language_code
created_at
updated_at
Get more details of a specific batch
List all the transcripts in a batch
Get usage statistics for the customer
GET TRANSCRIBE USAGE
Accepts a start date, end date, and interval and returns usage statistics for the customer.
Args
start_date_time
: start date and time in ISO format (YYYY-MM-DDTHH:MM:SSZ)end_date_time
: end date and time in ISO format (YYYY-MM-DDTHH:MM:SSZ)interval
: interval for aggregating data (e.g., 'minute', 'hour', 'day')
Returns
TranscribeUsageReport
: a report containing usage statistics for the customer
Raises
HTTPException 400
: if the date format is invalid | if the interval is invalid | if the start date is after the end date
query Parameters
start_date_time
string · requiredStart date and time in ISO format (YYYY-MM-DDTHH:MM:SSZ)
end_date_time
string · requiredEnd date and time in ISO format (YYYY-MM-DDTHH:MM:SSZ)
interval
string · requiredInterval for aggregating data (e.g., 'minute', 'hour', 'day')
Responses
total_invocations
integer · requiredtotal_successful_invocations
integer · requiredtotal_failed_invocations
integer · requiredtotal_pending_invocations
integer · requiredtotal_seconds_transcribed
number · requiredtotal_tokens
integer · requiredstart_date_time
string · requiredend_date_time
string · requiredinterval
string · requiredusage
object[]