Text to Video AI

Fields marked Required must be set before you generate. Optional settings live under Advanced.

Required

Input video file.

Drop file or click

Text prompt describing the desired output video style. Be descriptive.

CRF value for output video quality (0-51). Lower values = better quality.

19
051

Set a seed for reproducibility. Random by default.

Number of sampling (denoising) steps.

30
1150

Output video width (divisible by 16 for best performance).

768
642048

Output video height (divisible by 16 for best performance).

768
642048

Flow shift for temporal consistency. Adjust to tweak video smoothness.

9
120

Force a new frame rate on the input video. 0 means no change.

0
0240

Force resize method. 'Disabled' means original size. Otherwise applies custom_width/height.

Frame rate of the output video.

24
1120

Custom width if force_size is not 'Disabled'.

512
642048

Custom height if force_size is not 'Disabled'.

512
642048

Max frames to load from input video.

Embedded guidance scale. Higher values follow the prompt more strictly.

6
120

Keep aspect ratio when resizing. If true, will adjust dimensions proportionally.

Denoise strength (0.0 to 1.0). Higher = more deviation from input content.

0.85
01

Use every nth frame (1 = every frame, 2 = every second frame, etc.).

Number of initial frames to skip from the input video.

Demo
Demo

Select an image to preview