
A state-of-the-art Diffusion Transformer (DiT) foundation model with 22 billion parameters. Unlike traditional video models, LTX-2 is natively designed for synchronized audio-video generation within a single unified latent space. It excels at maintaining temporal consistency and high-fidelity motion, making it a powerful backend for creative AI pipelines that require seamless audiovisual coherence.
LTX-2 is a 22B-parameter video generation model capable of producing high-fidelity videos with synchronized audio from text prompts or reference images. The API is asynchronous — submit a request, receive a job ID, poll for status, and download the result when complete.
Billed per second of generated video. The tier is determined by resolution and frame rate.
| Tier | Resolution | Cost |
|---|---|---|
sd |
≤ 1280×720 | $0.02/s |
hd |
> 1280×720 | $0.04/s |
The SD/HD boundary is 921,600 total pixels (width × height).
POST /v1/text-to-video with Authorization: Bearer YOUR_API_KEY.POST /v1/jobs/{id} until status is completed.POST /v1/jobs/{id}/download.| Method | Path | Description |
|---|---|---|
POST |
/v1/text-to-video |
Generate a video from a text prompt. |
POST |
/v1/image-to-video |
Generate a video from an image and text prompt. |
POST |
/v1/jobs/{id} |
Check job status. |
POST |
/v1/jobs/{id}/download |
Download the completed video. |
See API: Video Generation and API: Job Management for full endpoint documentation.
Text-to-Video (POST /v1/text-to-video)
| Parameter | Required | Default | Description |
|---|---|---|---|
model |
Yes | — | Model identifier. Use "Lightricks/LTX-2". |
prompt |
Yes | — | Text description of the desired video. |
duration |
No | 5 | Length in seconds (4–10). |
resolution |
No | 1280x704 |
Output resolution as WIDTHxHEIGHT. |
fps |
No | 25 | Frames per second. 25 or 50. |
Image-to-Video (POST /v1/image-to-video)
| Parameter | Required | Default | Description |
|---|---|---|---|
model |
Yes | — | Model identifier. Use "Lightricks/LTX-2". |
prompt |
Yes | — | Text describing how the image should animate. |
image |
Yes* | — | Uploaded image file. |
image_uri |
Yes* | — | Image URL or base64 data URI. |
duration |
No | 5 | Length in seconds (4–10). |
fps |
No | 25 | Frames per second. 25 or 50. |
resolution |
No | (input dims) | Output resolution. Defaults to input image dimensions. |
* Either image or image_uri must be provided.
The API never rejects a request due to resolution or duration constraints. Instead, it automatically adjusts parameters to fit within GPU limits, in this order:
When adjustments occur, the response adjustments field lists what changed. If nothing was adjusted, the field is null.
| Status Code | Description |
|---|---|
400 Bad Request |
Invalid input (e.g., missing prompt, unsupported image format). |
401 Unauthorized |
Invalid or missing API key. |
404 Not Found |
Job ID does not exist or has expired. |
409 Conflict |
Video download requested before generation is complete. |
500 Internal Error |
Internal failure during processing. |
All error responses follow this format:
{
"type": "error",
"error": {
"type": "not_found",
"message": "Job '550e8400-...' not found"
}
}
| Constraint | Value |
|---|---|
| Duration | 4–10 seconds per video |
| FPS | 25 or 50 |
| Max resolution | 1920 × 1088 |
| Job expiration | 24 hours |
| Concurrency | Queued, one per GPU |
Run models at scale with our fully managed GPU infrastructure, delivering enterprise-grade uptime at the industry's best rates.