Wanxiang Wan 2.7 Text-to-Video
Video
Wanxiang Wan 2.7 Text-to-Video
POST
Wanxiang Wan 2.7 Text-to-Video
Wanxiang Wan 2.7 text-to-video model generates smooth videos from text prompts. It supports audio-driven generation or automatic dubbing, supports 720P and 1080P resolutions, durations from 2 to 15 seconds, and is billed by the second. Output includes audio by default.
Request Headers
Enum value:
application/jsonBearer authentication format: Bearer {{API Key}}.
Request Body
Random seed, used to improve the reproducibility of generation results. Value range: [0, 2147483647].Value range: [0, 2147483647]
Output video resolution (widthheight), which affects cost. 720P tier: 1280720 (16:9), 7201280 (9:16), 960960 (1:1), 1088832 (4:3), 8321088 (3:4). 1080P tier: 19201080 (16:9), 10801920 (9:16), 14401440 (1:1), 16321248 (4:3), 1248*1632 (3:4).Allowed values:
1280*720, 720*1280, 960*960, 1088*832, 832*1088, 1920*1080, 1080*1920, 1440*1440, 1632*1248, 1248*1632Text prompt used to describe the elements and visual characteristics expected in the generated video. Chinese and English are supported, up to 1500 characters. Any excess is automatically truncated.Length limit: 0 - 1500
Generated video duration in seconds, billed by the second. An integer in the range [2, 15].Value range: [2, 15]
Audio file URL. The model will use this audio to drive video generation (such as lip-syncing, motion beat matching, etc.). If not provided, the model automatically generates matching background music or sound effects. Supported formats: wav and mp3. Duration: 3–30 seconds; file size no more than 15 MB. If the audio exceeds the video duration, it will be clipped; if it is shorter, the remaining part will be silent.
Whether to add a watermark identifier. The watermark is located in the lower-right corner of the video.
Whether to enable intelligent prompt rewriting. When enabled, a large model is used to intelligently rewrite the input prompt. This significantly improves generation quality for shorter prompts, but increases processing time.
Negative prompt, used to describe content that you do not want to appear in the video frames. Chinese and English are supported, up to 500 characters.Length limit: 0 - 500
Response
Use task_id to request the Get Task Result API to retrieve the generated output.