Try Qwen Image in the Workbench
Run this model interactively, tune parameters, and compare outputs.
qwen-image
Qwen/Qwen-Image is an image generation model that excels in complex text rendering—including for both alphabetic and logographic languages such as English and Chinese—and precise image editing.
It is particularly strong in producing images with high-fidelity embedded text, making it well-suited for tasks where maintaining the integrity and clarity of text within generated images is critical. The model also provides consistent and realistic editing capabilities, such as style transfer, object addition/removal, and detailed attribute editing, with improved preservation of identity for people and products.
Some other noteworthy features of Qwen/Qwen-Image include robust support for image understanding tasks—such as object detection, semantic segmentation, depth and edge estimation, novel view synthesis, and super-resolution—and native integration with ControlNet for enhanced conditioning and control of outputs.
| Metric | Value |
|---|---|
| Parameter Count | 20 billion |
| Mixture of Experts | No |
| Context Length | Unknown |
| Multilingual | Yes |
| Quantized* | No |
Example request
- Sync
- Async
- Async with SSE
See the image generation reference for more details.
- Minimal
- All parameters
Fetch model details
The models endpoint returns the full model object, including itsjson_request_schema.
Request parameters
Required parameters
| Field | Type | Default | Description |
|---|---|---|---|
prompt | string | "A beautiful landscape painting of a serene lake with mountains in the background" | Prompt for generated image |
Optional parameters
| Field | Type | Default | Description |
|---|---|---|---|
negative_prompt | string | " " | Negative prompt for generated image |
aspect_ratio | string | "16:9" | Aspect ratio for the generated image One of: 1:1, 16:9, 9:16, 4:3, 3:4, 3:2, 2:3. |
image_size | string | "optimize_for_quality" | Image size for the generated image One of: optimize_for_quality, optimize_for_speed. |
num_inference_steps | integer | 30 | Number of denoising steps. Recommended range is 28-50, and lower number of steps produce lower quality outputs, faster. Range: 1 – 50. |
guidance | number | 3 | Guidance for generated image. Lower values can give more realistic images. Good values to try are 2, 2.5, 3 and 3.5 Range: 0 – 10. |
seed | integer | — | Random seed. Set for reproducible generation |
output_format | string | "jpg" | Format of the output images One of: webp, jpg, png. |
output_quality | integer | 80 | Quality when saving the output images, from 0 to 100. 100 is best quality, 0 is lowest quality. Not relevant for .png outputs Range: 0 – 100. |
disable_safety_checker | boolean | false | Disable safety checker for generated images. |