Skip to main content
POST
/
v3
/
async
/
vidu-q2-pro-startend2video
VIDU Q2 Pro Start and End Frames
curl --request POST \
  --url https://api.highwayapi.ai/v3/async/vidu-q2-pro-startend2video \
  --header 'Authorization: <authorization>' \
  --header 'Content-Type: <content-type>' \
  --data '
{
  "bgm": true,
  "seed": 123,
  "images": [
    {}
  ],
  "is_rec": true,
  "prompt": "<string>",
  "wm_url": "<string>",
  "payload": "<string>",
  "duration": 123,
  "off_peak": true,
  "meta_data": "<string>",
  "watermark": true,
  "resolution": "<string>",
  "wm_position": 123,
  "movement_amplitude": "<string>"
}
'
{
  "task_id": "<string>",
  "provider_request_id": "<string>"
}
Generate a coherent video from the first and last frame images. Supports three resolutions: 540p, 720p, and 1080p.
This is an asynchronous API and will only return the task_id of the asynchronous task. You should use this task_id to request the Get Task Result API to retrieve the generated result.

Request Headers

Content-Type
string
required
Enum value: application/json
Authorization
string
required
Bearer authentication format: Bearer {{API Key}}.

Request Body

bgm
boolean
default:false
Whether to add background music to the generated video. true: The system will automatically select suitable music from the preset BGM library and add it; false: Do not add BGM. BGM has no duration limit; the system automatically adapts it to the video duration.
seed
integer
Random seed. When omitted by default or set to 0, a random number will be used instead; if manually set, the specified seed will be used.
images
array
required
Image array. The first image is treated as the first-frame image, and the second image is treated as the last-frame image. The model will generate the video based on the images passed in this parameter. Supports input of two images. Note 1: The resolutions of the two input first/last frame images must be similar; the resolution of the first-frame image divided by the resolution of the last-frame image must be between 0.8 and 1.25. Note 2: Supports passing image Base64 encoding or an image URL (ensure it is accessible). Note 3: Images support png, jpeg, jpg, and webp formats. Note 4: Image size must not exceed 50M. Note 5: Base64 encoding must include the appropriate content type string, for example: data:image/png;base64,Array length: 2 - 2
is_rec
boolean
default:false
Whether to use a recommended prompt. true: Yes. The system will automatically recommend a prompt and use the prompt content to generate the video (each task consumes an additional 10 credits); false: No. Generate the video based on the input prompt.
prompt
string
Text prompt, a textual description for generating the video. Note 1: The character length must not exceed 2000 characters. Note 2: If the is_rec recommended prompt parameter is used, the model will not consider the prompt entered in this parameter.Length limit: 0 - 2000
wm_url
string
Watermark content, here as an image URL. If not provided, the default watermark is used: Content generated by AI.
payload
string
Pass-through parameter. It is not processed in any way and is used only for data transmission. Maximum 1048576 characters.Length limit: 0 - 1048576
duration
integer
default:5
Video duration (seconds), supports 1-8 seconds.Optional values: 1, 2, 3, 4, 5, 6, 7, 8
off_peak
boolean
default:false
Off-peak mode. true: Generate the video during off-peak hours (consumes fewer credits, generated within 48 hours); false: Generate the video immediately. Note: For tasks submitted in off-peak mode, tasks that cannot be completed will be automatically canceled and credits will be refunded; you can also manually cancel off-peak tasks.
meta_data
string
Metadata identifier, a JSON-format string and pass-through field. You can customize the format or use the example format. When this parameter is empty, the metadata identifier generated by vidu is used by default.
watermark
boolean
default:false
Whether to add a watermark. No watermark is added by default. You can obtain video content with a watermark via the watermarked_url parameter.
resolution
string
default:"720p"
Resolution of the output video. The default value is 720p.Optional values: 540p, 720p, 1080p
wm_position
integer
default:3
Watermark position, indicating where the watermark appears on the image. 1: top-left; 2: top-right; 3: bottom-right (default); 4: bottom-leftOptional values: 1, 2, 3, 4
movement_amplitude
string
default:"auto"
Movement amplitude, controls the movement intensity of objects in the video.Optional values: auto, small, medium, large

Response Information

task_id
string
required
Use task_id to request the Get Task Result API to retrieve the generated output.
provider_request_id
string
Provider request ID (optional)