Fast photorealistic text-to-image model with accurate English and Chinese on-image text, ideal for interactive design, marketing visuals, and UI/UX workflows.
Use this file to discover all available pages before exploring further.
Try Z-Image-Turbo in the Workbench
Run this model interactively, tune parameters, and compare outputs.
Model ID:z-image-turboTongyi-MAI/Z-Image-Turbo is an image generation model that excels in fast text-to-image generation with photorealistic outputs and accurate bilingual (English & Chinese) text rendering, while running efficiently on 16 GB VRAM devices.Some other noteworthy features of Tongyi-MAI/Z-Image-Turbo include strong instruction adherence for prompt-following and an 8-step inference design that enables sub-second latency on data-center GPUs.
Metric
Value
Parameter Count
6 billion
Mixture of Experts
No
Context Length
Unknown
Multilingual
Yes
Quantized*
Yes
Precision*
Unknown
*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.
curl -X POST https://hub.oxen.ai/api/ai/images/generate \ -H "Content-Type: application/json" \ -H "Authorization: Bearer $OXEN_API_KEY" \ -d '{ "model": "z-image-turbo", "prompt": "A woman with red curly hair wearing a red flannel shirt, sitting in front of a campfire, night-time, photorealistic, indy folk vibes."}'
curl -X POST https://hub.oxen.ai/api/ai/images/generate \ -H "Content-Type: application/json" \ -H "Authorization: Bearer $OXEN_API_KEY" \ -d '{ "model": "z-image-turbo", "prompt": "A woman with red curly hair wearing a red flannel shirt, sitting in front of a campfire, night-time, photorealistic, indy folk vibes.", "num_inference_steps": 8, "output_format": "webp"}'