Kling v3.0 Standard Image-to-Video
Video
Kling v3.0 Standard Image-to-Video
POST
Kling v3.0 Standard Image-to-Video
The Kling v3.0 Standard image-to-video tool can convert static images into dynamic videos, generating natural motion and smooth scene dynamics while maintaining subject consistency. It supports synchronized audio generation and multi-part prompt combinations.
Request Headers
Enum value:
application/jsonBearer authentication format: Bearer {{API Key}}.
Request Body
The first-frame image for the video; supports
.jpg, .jpeg, and .png.
The image file size must not exceed 10MB; both width and height must be >= 300px; the aspect ratio must be between 1:2.5 and 2.5:1.Whether to generate audio while generating the video.
The positive prompt text for generating the video, describing scene motion, camera movement, actions, audio style, atmosphere, and sound effects; must not exceed 2500 characters.
The duration of the generated video in seconds, ranging from 3 to 15.Allowed values:
3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15Controls the flexibility of video generation. Lower values produce more natural motion; higher values make the generated content adhere more closely to the prompt.Value range: [0, 1]
The last-frame image URL, used to guide the transition between the starting frame and the ending frame. The format constraints are the same as for image. Cannot be used together with multi_prompt.
An array of multi-part prompts, used for composing multi-shot videos. Each item contains a prompt and the duration of that segment. Cannot be used together with end_image.
The negative prompt, specifying elements to avoid in the visuals and audio; must not exceed 2500 characters.
Response
Use task_id to request the Get Task Result API to retrieve the generated output.