This API supports asynchronous text-to-speech generation. A single text generation request supports up to 1 million characters for transmission, and the complete generated audio result can be retrieved asynchronously. It supports 100+ system voices and custom cloned voices; intonation, speed, volume, bitrate, sample rate, and output format can also be adjusted as needed.After submitting a long-text speech synthesis request, note that the returned url is valid for 24 hours from the time the url is returned. Please make sure to download the information in time.
Suitable for speech generation from long texts such as entire books. Task queuing may take a relatively long time. For scenarios such as short sentence generation, voice chat, and online social interaction, we recommend using synchronous speech synthesis.
This parameter supports English text normalization, which can improve performance in number reading scenarios, but will slightly increase latency. If not provided, the default value is false.
Range [32000, 64000, 128000, 256000]The bitrate of the generated voice. Optional. The default value is 128000. This parameter only takes effect for audio in mp3 format.
Replace text, symbols, and corresponding pronunciations that require special annotation.Pronunciation replacement (adjust tones/replace pronunciations with other character pronunciations), in the following format:["燕少飞/(yan4)(shao3)(fei1)","达菲/(da2)(fei1)","omg/oh my god"]Tones are represented by numbers: first tone (yinping) is 1, second tone (yangping) is 2, third tone (shangsheng) is 3, fourth tone (qusheng) is 4, and neutral tone is 5.
Enhances recognition capabilities for specified minority languages and dialects. After setting this parameter, speech performance can be improved in the specified minority language/dialect scenario. If the minority language type is unclear, you can select “auto”, and the model will determine the minority language type on its own. The following values are supported:'Chinese', 'Chinese,Yue', 'English', 'Arabic', 'Russian', 'Spanish', 'French', 'Portuguese', 'German', 'Turkish', 'Dutch', 'Ukrainian', 'Vietnamese', 'Indonesian', 'Japanese', 'Italian', 'Korean', 'Thai', 'Polish', 'Romanian', 'Greek', 'Czech', 'Finnish', 'Hindi', 'Bulgarian', 'Danish', 'Hebrew', 'Malay', 'Persian', 'Slovak', 'Swedish', 'Croatian', 'Filipino', 'Hungarian', 'Norwegian', 'Slovenian', 'Catalan', 'Nynorsk', 'Tamil', 'Afrikaans', 'auto'