Skip to main content
POST
/
v3
/
minimax-speech-2.8-hd
MiniMax Speech 2.8 HD Synchronous Speech Synthesis
curl --request POST \
  --url https://api.highwayapi.ai/v3/minimax-speech-2.8-hd \
  --header 'Authorization: <authorization>' \
  --header 'Content-Type: <content-type>' \
  --data '
{
  "text": "<string>",
  "stream": true,
  "voice_modify": {
    "pitch": 123,
    "timbre": 123,
    "intensity": 123,
    "sound_effects": "<string>"
  },
  "audio_setting": {
    "format": "<string>",
    "bitrate": 123,
    "channel": 123,
    "force_cbr": true,
    "sample_rate": 123
  },
  "output_format": "<string>",
  "voice_setting": {
    "vol": 123,
    "pitch": 123,
    "speed": 123,
    "emotion": "<string>",
    "voice_id": "<string>",
    "latex_read": true,
    "text_normalization": true
  },
  "aigc_watermark": true,
  "language_boost": "<string>",
  "stream_options": {
    "exclude_aggregated_audio": true
  },
  "timber_weights": [
    {
      "weight": 123,
      "voice_id": "<string>"
    }
  ],
  "subtitle_enable": true,
  "continuous_sound": true,
  "pronunciation_dict": {
    "tone": [
      {}
    ]
  }
}
'
{
  "data": {},
  "trace_id": "<string>",
  "base_resp": {},
  "extra_info": {}
}
Convert text to speech, with support for multiple voices, emotion control, speech rate adjustment, and more. The text length must be less than 10000 characters. If the text length is greater than 3000 characters, streaming output is recommended.

Request Headers

Content-Type
string
required
Enum value: application/json
Authorization
string
required
Bearer authentication format: Bearer {{API Key}}.

Request Body

text
string
required
The text to synthesize into speech. The length must be less than 10000 characters. If the text length is greater than 3000 characters, streaming output is recommended. Supports paragraph switching (line breaks), pause control (&lt;#x#&gt; markers), and filler/sound effect tags (such as (laughs), (coughs), etc.; supported only by speech-2.8-hd/turbo)
stream
boolean
default:false
Controls whether to use streaming output. The default is false, meaning streaming is not enabled
voice_modify
object
audio_setting
object
output_format
string
default:"hex"
Parameter that controls the form of the output result. Available values are url and hex. The default value is hex. This parameter only takes effect in non-streaming scenarios; streaming scenarios only support returning in hex form. The returned url is valid for 24 hoursAvailable values: url, hex
voice_setting
object
aigc_watermark
boolean
default:false
Controls whether to add an audio rhythm identifier at the end of the synthesized audio. The default value is false. This parameter only takes effect for non-streaming synthesis
language_boost
string
Whether to enhance recognition capability for specified low-resource languages and dialects. The default value is null. It can be set to auto to let the model determine automaticallyAvailable values: Chinese, Chinese,Yue, English, Arabic, Russian, Spanish, French, Portuguese, German, Turkish, Dutch, Ukrainian, Vietnamese, Indonesian, Japanese, Italian, Korean, Thai, Polish, Romanian, Greek, Czech, Finnish, Hindi, Bulgarian, Danish, Hebrew, Malay, Persian, Slovak, Swedish, Croatian, Filipino, Hungarian, Norwegian, Slovenian, Catalan, Nynorsk, Tamil, Afrikaans, auto
stream_options
object
timber_weights
array
Mixed voice settings. Supports mixing up to 4 voices
subtitle_enable
boolean
default:false
Controls whether to enable the subtitle service. The default value is false. This parameter is only valid in non-streaming output scenarios, and only for the speech-2.6-hd, speech-2.6-turbo, speech-02-turbo, speech-02-hd, speech-01-turbo, and speech-01-hd models
continuous_sound
boolean
default:false
Enable this parameter to make transitions between clauses more natural. Only the speech-2.8-hd and speech-2.8-turbo models are supported
pronunciation_dict
object

Response Information

data
object
The returned synthesis data object. It may be null, so a non-null check is required
trace_id
string
The id of this session, used to help locate issues during consultation/feedback
base_resp
object
The status code and details of this request
extra_info
object
Additional information about the audio