Transcribe audio to text using ElevenLabs Scribe V2 model.
asset_id > url > source_url).
sk-mai-xxx).elevenlabs for this endpoint.re_xxx). Resolved to a signed GCS URL.gs:// GCS path or public HTTP URL. GCS paths are converted to signed URLs automatically.en, zh). If omitted, the provider auto-detects the language.none, segment, or word.pcm_s16le_16000).| Parameter | Type | Description |
|---|---|---|
| data.language_code | string | Detected language code (ISO 639-1) |
| data.language_probability | number | Confidence of language detection (0.0–1.0) |
| data.text | string | Full transcription text |
| data.words | array[object] | Word-level transcription with timing |
| data.words[].text | string | The word or spacing text |
| data.words[].start | number | Start time in seconds |
| data.words[].end | number | End time in seconds |
| data.words[].type | string | Token type: word, spacing, or audio_event |
| data.words[].speaker_id | string | Speaker identifier (e.g. speaker_0). Only present when diarize=true. |
0.52).