Try LTX 2.3 Quality: Text to Video in the Workbench
Run this model interactively, tune parameters, and compare outputs.
ltx-2-3-quality-text-to-video
LTX 2.3 Quality (Text to Video) is the high-quality preset of Lightricks LTX-2.3 on fal, generating video with synchronized native audio directly from a text prompt. It runs a distilled DiT workflow with a quality preset control, supporting up to 481 frames at 1 to 60 FPS and flexible output resolutions.
The model generates sound effects, ambient noise, and dialogue alongside the visuals in a single pass, with optional prompt expansion and a configurable quality preset (low, medium, high, maximum). It is well suited for cinematic clips, stylized scenes, and short-form content where audio-visual synchronization matters. Audio output can be disabled to return a silent MP4.
| Metric | Value |
|---|---|
| Parameter Count | 22 billion |
| Mixture of Experts | No |
| Context Length | Unknown |
| Multilingual | Unknown |
| Quantized* | Unknown |
Example request
- Sync
- Async
- Async with SSE
This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.
- Minimal
- All parameters
Fetch model details
The models endpoint returns the full model object, including itsjson_request_schema.
Request parameters
Required parameters
| Field | Type | Default | Description |
|---|---|---|---|
prompt | string | — | The prompt to generate the video from. |
Optional parameters
| Field | Type | Default | Description |
|---|---|---|---|
num_frames | integer | 121 | The number of frames to generate. Range: 9 – 481. |
resolution | string | "landscape_16_9" | The size of the generated video. |
frames_per_second | number | 24 | Frames per second of the generated video. Range: 1 – 60. |
generate_audio | boolean | true | Whether to include audio in the returned video. When disabled, the final MP4 is returned without an audio track. |
video_quality | string | "high" | The quality preset of the generated video. One of: low, medium, high, maximum. |
negative_prompt | string | "color distortion, overexposure, static, blurry details, subtitles, style, artwork, painting, frame, still, dim overall tone, worst quality, low quality, JPEG compression artifacts, ugly, mutilated, extra fingers, poorly drawn hands, poorly drawn face, deformed, disfigured, malformed limbs, fused fingers, motionless frame, cluttered background, three legs, crowded background, walking backwards" | The negative prompt to steer generation away from. |
seed | integer | — | Random seed for reproducibility. If None, a random seed is chosen. |