GLM Text-to-Speech
Audio
GLM Text-to-Speech
POST
GLM Text-to-Speech
Use GLM-TTS to convert text into natural speech, with support for multiple voices, emotion control, and intonation adjustment.
Request Headers
Enum value:
application/jsonBearer authentication format: Bearer {{API Key}}.
Request Body
The text to convert to speechLength limit: 0 - 1024
Speech rate, default is 1.0, value range [0.5, 2]Value range: [0.5, 2]
The voice used when generating audio. Supports two types: system voices and cloned voices. System voices include: tongtong (Tongtong, default voice), chuichui (Chuichui), xiaochen (Xiaochen), jam (Dongdong Animal Circle jam voice), kazi (Dongdong Animal Circle kazi voice), douji (Dongdong Animal Circle douji voice), luodo (Dongdong Animal Circle luodo voice)
Volume, default is 1.0, value range (0, 10]Value range: [0, 10]
Audio output format. By default, a file in pcm format is returned.Available values:
wav, pcmControls whether a watermark is added to AI-generated audio. true: Explicit AI-generated watermarking and implicit digital watermarking are enabled by default, in compliance with policy requirements. false: Disables all watermarks, and only takes effect for users who have completed the watermark removal process.
Response Information
The business processing succeeded. The recommended sample rate is 24000. Format:binary