How to use LTX Video 0.9.5 on ComfyUI

LTX Video 0.9.5 is an improved version of the LTX local video model. The model is very fast — it generates a 4-second video in 17 seconds on a consumer-grade GPU RTX4090. It’s not quite real-time, but very close.

In this article, I will cover:

The improvements over the previous version.
Text-to-video workflow
Image-to-video workflow
Fix the first and last frames in the video.

Table of Contents

Software
Running on Google Colab
LTXV 0.9.5 Improvements
Text-to-image workflow
Image-to-video workflow
Fix the first and last frames in the video
Tips
Reference

Software

We will use ComfyUI, a free AI image and video generator. You can use it on Windows, Mac, or Google Colab.

Think Diffusion provides an online ComfyUI service. They offer an extra 20% credit to our readers.

Read the ComfyUI beginner’s guide if you are new to ComfyUI. See the Quick Start Guide if you are new to AI images and videos.

Take the ComfyUI course to learn how to use ComfyUI step by step.

Running on Google Colab

If you use my ComfyUI Colab notebook, you don’t need to install the model files. They will be downloaded automatically.

Select the LTX models before starting the notebook.

Download a workflow JSON file from this tutorial and drop it to ComfyUI.

LTXV 0.9.5 Improvements

License

The good news is that the LTXV 0.9.5 version has a new Open RAIL-M license, which allows commercial use. You can host the model and use the generated videos for commercial purposes.

Text-to-video

Like the previous version, LTX Video 0.9.5 supports text-to-video. The video quality has improved.

LTX Text-to-video workflow.

Image-to-video

LTX-Video can use an image as the first frame and turn it into a video.

LTX Video 0.9.5 First frame image — First frame image.

“close up of 25yo beautiful woman face, start smiling”

Some videos from the image-to-video workflows can be quite hideous. I will give you some tips for generating good videos.

Fix the first and last frames

You can also set both the first and last frames of the video.

Text-to-image workflow

This workflow generates a 4-second video from a text description.

LTX Text-to-video workflow.

Step 0: Update ComfyUI

Before loading the workflow, make sure your ComfyUI is up-to-date. The easiest way to do this is to use ComfyUI Manager.

Click the Manager button on the top toolbar.

Select Update ComfyUI.

Restart ComfyUI.

Step 1: Download models

Download ltx-video-2b-v0.9.5.safetensors and put it in ComfyUI > models > checkpoints.

Download t5xxl_fp16.safetensors and put it in ComfyUI > models > text_encoders.

Step 2: Load the workflow

Download the workflow below.

Download

Drop it in ComfyUI.

Step 3: Install missing nodes

This workflow uses the Video Combine node to save the video as MP4. If you see red blocks, you don’t have the custom node that this workflow needs.

Click Manager > Install missing custom nodes and install the missing nodes.

Restart ComfyUI.

Step 4: Revise the prompt

Change the prompt to what you want to generate. LTXV works better with long and descriptive prompts. (You can use ChatGPT to expand a prompt. Put in “Expand the following video AI prompt:…”)

Step 5: Generate a video

Click the Queue button to generate the video.

Change the noise_seed value in the SamplerCustom node to generate a different video.