MiniMax Speech 2.8 HD Synchronous Speech Synthesis
Audio
MiniMax Speech 2.8 HD Synchronous Speech Synthesis
POST
MiniMax Speech 2.8 HD Synchronous Speech Synthesis
Convert text to speech, with support for multiple voices, emotion control, speech rate adjustment, and more. The text length must be less than 10000 characters. If the text length is greater than 3000 characters, streaming output is recommended.
Request Headers
Enum value:
application/jsonBearer authentication format: Bearer {{API Key}}.
Request Body
The text to synthesize into speech. The length must be less than 10000 characters. If the text length is greater than 3000 characters, streaming output is recommended. Supports paragraph switching (line breaks), pause control (
<#x#> markers), and filler/sound effect tags (such as (laughs), (coughs), etc.; supported only by speech-2.8-hd/turbo)Controls whether to use streaming output. The default is false, meaning streaming is not enabled
Parameter that controls the form of the output result. Available values are url and hex. The default value is hex. This parameter only takes effect in non-streaming scenarios; streaming scenarios only support returning in hex form. The returned url is valid for 24 hoursAvailable values:
url, hexControls whether to add an audio rhythm identifier at the end of the synthesized audio. The default value is false. This parameter only takes effect for non-streaming synthesis
Whether to enhance recognition capability for specified low-resource languages and dialects. The default value is null. It can be set to auto to let the model determine automaticallyAvailable values:
Chinese, Chinese,Yue, English, Arabic, Russian, Spanish, French, Portuguese, German, Turkish, Dutch, Ukrainian, Vietnamese, Indonesian, Japanese, Italian, Korean, Thai, Polish, Romanian, Greek, Czech, Finnish, Hindi, Bulgarian, Danish, Hebrew, Malay, Persian, Slovak, Swedish, Croatian, Filipino, Hungarian, Norwegian, Slovenian, Catalan, Nynorsk, Tamil, Afrikaans, autoMixed voice settings. Supports mixing up to 4 voices
Controls whether to enable the subtitle service. The default value is false. This parameter is only valid in non-streaming output scenarios, and only for the speech-2.6-hd, speech-2.6-turbo, speech-02-turbo, speech-02-hd, speech-01-turbo, and speech-01-hd models
Enable this parameter to make transitions between clauses more natural. Only the speech-2.8-hd and speech-2.8-turbo models are supported
Response Information
The returned synthesis data object. It may be null, so a non-null check is required
The id of this session, used to help locate issues during consultation/feedback
The status code and details of this request
Additional information about the audio