This API supports asynchronous text-to-speech generation. A single text generation request supports up to 1 million characters for transmission, and the complete generated audio result can be retrieved asynchronously. It supports 100+ system voices and custom cloned voices, as well as independent adjustment of intonation, speed, volume, bitrate, sample rate, and output format.After submitting a long-text speech synthesis request, note that the returned url is valid for 24 hours from the time the url is returned. Please download the information in time.
This is suitable for speech generation of long texts such as entire books, and task queueing may take a relatively long time. For short sentence generation, voice chat, online social scenarios, and similar use cases, we recommend using synchronous speech synthesis.
This parameter supports English text normalization, which can improve performance in digit-reading scenarios, but will slightly increase latency. If not provided, the default value is false.
Range [32000, 64000, 128000, 256000]The bitrate of the generated voice. Optional, default value is 128000. This parameter only takes effect for audio in mp3 format.
Replace text, symbols, and corresponding pronunciations that require special annotation.Replace pronunciation (adjust tone/replace with pronunciations of other characters), in the following format:["燕少飞/(yan4)(shao3)(fei1)","达菲/(da2)(fei1)","omg/oh my god"]Tones are represented by numbers: first tone (yinping) is 1, second tone (yangping) is 2, third tone (shangsheng) is 3, fourth tone (qusheng) is 4, and neutral tone is 5.
Enhances recognition capability for specified low-resource languages and dialects. After setting this, voice performance can be improved in the specified low-resource language/dialect scenarios. If the low-resource language type is unclear, you can select “auto”, and the model will determine the low-resource language type automatically. The following values are supported:'Chinese', 'Chinese,Yue', 'English', 'Arabic', 'Russian', 'Spanish', 'French', 'Portuguese', 'German', 'Turkish', 'Dutch', 'Ukrainian', 'Vietnamese', 'Indonesian', 'Japanese', 'Italian', 'Korean', 'Thai', 'Polish', 'Romanian', 'Greek', 'Czech', 'Finnish', 'Hindi', 'Bulgarian', 'Danish', 'Hebrew', 'Malay', 'Persian', 'Slovak', 'Swedish', 'Croatian', 'Filipino', 'Hungarian', 'Norwegian', 'Slovenian', 'Catalan', 'Nynorsk', 'Tamil', 'Afrikaans', 'auto'