Generate audio transcription asynchronously.
| Parameter | Type | Description |
|---|---|---|
| code | string | Response code indicating the result status |
| msg | string | Response message describing the operation result |
| data | object | Response data object containing task information |
| data.task_id | string | Unique identifier of the transcription task |
| success | boolean | Indicates whether the operation was successful |
| failed | boolean | Indicates whether the operation failed |
| Parameter | Type | Description |
|---|---|---|
| code | string | Response code (“0000” indicates success) |
| message | string | Status message (e.g., “SUCCESS”) |
| data | object | Response data object containing the transcription result and metadata |
| data.data | object | Inner data object containing transcription segments and usage information |
| data.data.data | array | Array of transcription segments with timestamps |
| data.data.data[].start_time | number | Start time of the segment in seconds |
| data.data.data[].end_time | number | End time of the segment in seconds |
| data.data.data[].text | string | Transcription text for this segment |
| data.data.data[].speaker | string | null | Speaker identifier if speaker=true was requested, otherwise null |
| data.data.usage_metadata | object | Usage statistics for the API call |
| data.data.usage_metadata.duration | number | Audio duration in seconds |
| data.data.usage_metadata.model | string | The model used for transcription (e.g., “whisper-1”) |
| data.data.usage_metadata.output_tokens | integer | Number of output tokens (0 for audio transcription) |
| data.data.usage_metadata.prompt_tokens | integer | Number of prompt tokens (0 for audio transcription) |
| data.msg | string | Detailed message about the operation result |
| data.success | boolean | Indicates whether the transcription was successful |
| task_id | string | The task ID associated with this transcription request |
speaker field in each transcription segment will only contain a speaker identifier (e.g., “SPEAKER_00”) when the request parameter speaker=true is set. Otherwise, it will be null.data.data.
Response Structure:
callback_response.data.data.datacallback_response.data.data.data[0].textcallback_response.data.data.data[0].speaker (will be null if speaker=false)callback_response.data.data.data[0].start_time to callback_response.data.data.data[0].end_timecallback_response.data.data.usage_metadatacallback_response.data.data.usage_metadata.durationcallback_response.data.data.usage_metadata.modelcallback_response.data.successcallback_response.task_idTranscription task information
Response code indicating the result status
"0000"
Response message describing the operation result
"success"
Response data object containing task information
Indicates whether the operation was successful
true
Indicates whether the operation failed
false