GLM Audio-to-Text
Audio
GLM Audio-to-Text
POST
GLM Audio-to-Text
Use the GLM-ASR-2512 model to transcribe audio files into text, with support for multilingual transcription.
Request Headers
Enum value:
application/jsonBearer authentication format: Bearer {{API Key}}.
Request Body
The URL or Base64-encoded string of the audio file to transcribe. Supported audio file formats: .wav / .mp3. Limits: file size ≤ 25 MB, audio duration ≤ 30 seconds.
In long-text scenarios, you can provide previous transcription results as context. Recommended to be fewer than 8,000 characters.
A hotword list used to improve recognition accuracy for domain-specific vocabulary. Format example: [“person name”,“place name”]. Recommended to include no more than 100 items.Array length: 0 - 100
Response Information
The complete transcribed content of the audio