One thing I noticed is that you tested it with very short prompts; Z-Image Turbo really likes long prompts, and recommends using an LLM as a prompt enhancer, even providing a prompt template. [0] I have had pretty good look using an English translation that was posted on Reddit [1] with Qwen3-4B-Instruct locally sometimes modified somewhat for particular tasks; it seems biased to adding text to some images as-is) with short prompts.

[0] https://huggingface.co/Tongyi-MAI/Z-Image-Turbo/discussions/...

[1] https://www.reddit.com/r/StableDiffusion/comments/1p87xcd/zi...