Wanxiang Wan 2.7 Image-to-Video

curl --request POST \
  --url https://api.highwayapi.ai/v3/async/wan2.7-i2v \
  --header 'Authorization: <authorization>' \
  --header 'Content-Type: <content-type>' \
  --data '
{
  "seed": 123,
  "prompt": "<string>",
  "duration": 123,
  "image_url": "<string>",
  "watermark": true,
  "resolution": "<string>",
  "prompt_extend": true,
  "first_clip_url": "<string>",
  "last_frame_url": "<string>",
  "negative_prompt": "<string>",
  "driving_audio_url": "<string>"
}
'

{
  "task_id": "<string>"
}

POST

async

wan2.7-i2v

Wanxiang Wan 2.7 Image-to-Video

curl --request POST \
  --url https://api.highwayapi.ai/v3/async/wan2.7-i2v \
  --header 'Authorization: <authorization>' \
  --header 'Content-Type: <content-type>' \
  --data '
{
  "seed": 123,
  "prompt": "<string>",
  "duration": 123,
  "image_url": "<string>",
  "watermark": true,
  "resolution": "<string>",
  "prompt_extend": true,
  "first_clip_url": "<string>",
  "last_frame_url": "<string>",
  "negative_prompt": "<string>",
  "driving_audio_url": "<string>"
}
'

{
  "task_id": "<string>"
}

Wanxiang Wan 2.7 image-to-video model supports multimodal inputs (text/image/audio/video) and can perform three major tasks: first-frame video generation, first-and-last-frame video generation, and video continuation. It supports 720P and 1080P resolutions, durations from 2 to 15 seconds, and is billed by the second. Output includes audio by default.

This is an asynchronous API and will only return the asynchronous task’s task_id. You should use this task_id to request the Get Task Result API to retrieve the generated result.

Request Headers

Content-Type

string

required

Enum value: application/json

Authorization

string

required

Bearer authentication format: Bearer {{API Key}}.

Request Body

seed

integer

Random seed used to improve the reproducibility of generated results. Value range: [0, 2147483647].Value range: [0, 2147483647]

prompt

string

Text prompt used to describe the elements and visual characteristics expected in the generated video. Supports Chinese and English, up to 5000 characters.Length limit: 0 - 5000

duration

integer

default:5

Generated video duration, in seconds, billed by the second. Must be an integer in the range [2, 15].Value range: [2, 15]

image_url

string

required

First-frame image URL. Supported formats: JPEG, JPG, PNG (transparent channels are not supported), BMP, WEBP. Resolution width and height range: [240, 8000] pixels; aspect ratio: 1:8 to 8:1; file size must not exceed 20 MB. Choose either this or first_clip_url; at least one must be provided.

watermark

boolean

default:false

Whether to add a watermark identifier. The watermark is located in the lower-right corner of the video.

resolution

string

default:"1080P"

Output video resolution tier, which affects cost. The video’s aspect ratio remains consistent with the input media.Options: 720P, 1080P

prompt_extend

boolean

default:true

Whether to enable intelligent prompt rewriting. When enabled, a large model is used to intelligently rewrite the input prompt, which can significantly improve generation quality for shorter prompts, but will increase processing time.

first_clip_url

string

URL of the first video clip, used for video continuation. The model will continue generating based on the video content. Supported formats: mp4, mov; duration: 2 to 10 seconds; resolution width and height range: [240, 4096] pixels; aspect ratio: 1:8 to 8:1; file size must not exceed 100 MB. Choose either this or image_url.

last_frame_url

string

Last-frame image URL. Used together with the first frame to generate a first-and-last-frame video. Format restrictions are the same as for the first frame.

negative_prompt

string

Negative prompt used to describe content that you do not want to see in the video. Supports Chinese and English, up to 500 characters.Length limit: 0 - 500

driving_audio_url

string

Driving audio URL. When provided, the model will use this audio as the driving source to generate the video (such as lip sync, motion beats, etc.). If not provided, the model will automatically generate matching background music or sound effects. Supported formats: wav, mp3; duration: 2 to 30 seconds; file size must not exceed 15 MB.

Response

task_id

string

Use task_id to request the Get Task Result API to retrieve the generated output.

PixVerse C1 First and Last Frame to Video

Wanxiang Wan 2.7 Text-to-Video

​Request Headers

​Request Body

​Response

Request Headers

Request Body

Response