Qwen-Image is here
Qwen-Image is here
π Meet Qwen-Image β a 20B MMDiT model for next-gen text-to-image generation. Especially strong at creating stunning graphic posters with native text. Now open-source.
π Key Highlights:
πΉ SOTA text rendering β rivals GPT-4o in English, best-in-class for Chinese
πΉ In-pixel text generation β no overlays, fully integrated
πΉ Bilingual support, diverse fonts, complex layouts
π¨ Also excels at general image generation β from photorealistic to anime, impressionist to minimalist. A true creative powerhouse.
Blog: https://qwenlm.github.io/blog/qwen-image/
Hugging Face: https://huggingface.co/Qwen/Qwen-Image
Model Scope: https://modelscope.cn/models/Qwen/Qwen-Image/summary
GitHub: https://github.com/QwenLM/Qwen-Image
Technical Report: https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen-Image/Qwen_Image.pdf
WaveSpeed Demo: https://wavespeed.ai/models/wavespeed-ai/qwen-image/text-to-image
Demo: https://modelscope.cn/aigc/imageGeneration?tab=advanced