AI Video

Generate, edit, and process videos.

These nodes handle everything from text-to-video generation to post-production tasks like subtitling, upscaling, and background removal.

Node

What it does

Key inputs

Generate Video

Create videos from text prompts or images

Text prompt, optional first frame image

Generate Lipsync

Make characters speak by syncing lip movements to audio

Source video/image + audio/text

Edit Video

Modify videos using text instructions

Source video + edit prompt

Upscale Video

Increase video resolution

Source video

Remove Video Background

Remove the background from a video

Source video

Add Video Subtitles

Generate and burn subtitles into a video

Source video

Extract Video Frame

Pull a specific frame from a video as a still image

Source video + frame selection

Merge Videos

Combine multiple video clips into one seamless video

Two or more source videos

Composer

Layer text, images, graphics, and video into a composed output

Multiple media layers + layout settings

Image-to-video is often better than text-to-video: generate a still image first with Generate Image, then animate it with Generate Video. You get much more control over the visual.
Extract Video Frame is useful as a bridge — pull a frame from a video to use as input for image editing or as a reference for generating new content.
Lipsync works with both video and still images: you can animate a static portrait photo into a speaking character.

Last updated 10 days ago