Try Kling O3 4K: Text-to-Video in the Workbench
Run this model interactively, tune parameters, and compare outputs.
kling-video-o3-4k-text-to-video
Kling O3 4K Text-to-Video is the first AI video model with native 4K (3840x2160) output, generating cinema-grade footage directly from a text prompt with no upscaling step. It supports clips from 3 to 15 seconds, multiple aspect ratios (16:9, 9:16, 1:1), and optional native synchronized audio in English and Chinese. Built for production-ready visuals, large-screen displays, and professional creative workflows where clarity and cinematic detail are non-negotiable.
Example request
- Sync
- Async
- Async with SSE
This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.
- Minimal
- All parameters
Fetch model details
The models endpoint returns the full model object, including itsjson_request_schema.
Request parameters
Required parameters
| Field | Type | Default | Description |
|---|---|---|---|
prompt | string | — | Text prompt describing the scene to generate. |
Optional parameters
| Field | Type | Default | Description |
|---|---|---|---|
duration | integer | 5 | Length of the generated video in seconds. Range: 3 – 15. |
aspect_ratio | string | "16:9" | Aspect ratio of the generated video. One of: 16:9, 9:16, 1:1. |
generate_audio | boolean | false | Generate synchronized native audio (English or Chinese) with the video. |