Question 1

What is Z-Image Turbo?

Accepted Answer

Z-Image Turbo is the distilled version of Z-Image, a 6B parameter image generation foundation model developed by Alibaba's Tongyi MAI team. It uses a Scalable Single-Stream Diffusion Transformer (S3-DiT) architecture and Decoupled-DMD distillation to achieve photorealistic quality in just 8 inference steps.

Question 2

How fast is image generation?

Accepted Answer

Z-Image Turbo generates images in under 1 second on enterprise H800 GPUs. On Phaet, typical generation time is 3-8 seconds including network overhead — making it our fastest image model.

Question 3

How much does it cost on Phaet?

Accepted Answer

Only 1 credit per image regardless of aspect ratio — the lowest cost of any model on Phaet. This makes it ideal for rapid iteration, experimentation, and batch workflows.

Question 4

What aspect ratios are supported?

Accepted Answer

Five aspect ratios: 1:1 (square), 4:3 (landscape), 3:4 (portrait), 16:9 (widescreen), and 9:16 (vertical/mobile).

Question 5

How does it compare to other models in quality?

Accepted Answer

Z-Image Turbo ranks #1 among open-source models on the Artificial Analysis Text-to-Image Leaderboard and #4 overall on AI Arena. It delivers photorealistic quality competitive with much larger and slower closed-source models.

Question 6

Is Z-Image Turbo open source?

Accepted Answer

Yes. Z-Image Turbo is released under the Apache 2.0 license by Alibaba's Tongyi MAI team. Full model weights are available on Hugging Face and ModelScope. The 6B parameter model runs on consumer GPUs with 16GB VRAM.

Z-Image Turbo

Lightning Speed, Rock-Bottom Cost

What Makes It Special

Ultra-Fast Inference

Bilingual Text Rendering

Prompt Reasoning Enhancement

Fully Open Source

Frequently Asked Questions

Start Creating with Z-Image Turbo