Speech Translation: start task
POST/api/technology/speech-translation-whisper-enhanced
Start an Enhanced Speech Translation Built on Whisper task for a media file.
Features of Enhanced Speech Translation Built on Whisper
- Multi-channel audio files are supported.
- Channel ID is included in individual translation segments.
- The target language of the translation can be specified with the
output_languagequery parameter. - Source language for translation of the whole media file can be forced with the
source_languagequery parameter, otherwise it is detected automatically. - Language switching can be activated with the
language_switchingquery parameter. With this feature, the technology automatically identifies the predominant language spoken within each thirty-second segment of the file and uses it as the source language for translating that particular segment. - The
source_languageandlanguage_switchingparameters are mutually exclusive. - A language explicitly set as the
source_languagemust be licensed. Otherwise, an error is raised. - When the
source_languageis autodetected orlanguage_switchingis used, only licensed languages are considered as the source for the translation. In case the actually detected language is not licensed, the closest licensed language is used instead.
Request
Responses
- 202
- 400
- 403
- 405
- 413
- 422
- 507
Speech Translation task was accepted. Follow the Location header to poll for the task state.
Response Headers
X-Location
⚠️ Deprecated - use Location header instead.
Example:
/api/technology/speech-translation-whisper-enhanced/123e4567-e89b-12d3-a456-426614174000Location
A URL the client should poll for task state and result.
Example:
/api/technology/speech-translation-whisper-enhanced/123e4567-e89b-12d3-a456-426614174000Request payload data was invalid and could not be parsed.
Request is forbidden.
Invalid HTTP method.
The request entity (payload) size exceeds the allowed limit.
Error during validation of request payload data occurred.
The storage is full and cannot accept any data.