Qwen-Image is here
Qwen-Image is here
π Meet Qwen-Image β a 20B MMDiT model for next-gen text-to-image generation. Especially strong at creating stunning graphic posters with native text. Now open-source.
π Key Highlights:
πΉ SOTA text rendering β rivals GPT-4o in English, best-in-class for Chinese
πΉ In-pixel text generation β no overlays, fully integrated
πΉ Bilingual support, diverse fonts, complex layouts
π¨ Also excels at general image generation β from photorealistic to anime, impressionist to minimalist. A true creative powerhouse.
Blog: https://qwenlm.github.io/blog/qwen-image/
Hugging Face: https://huggingface.co/Qwen/Qwen-Image
Model Scope: https://modelscope.cn/models/Qwen/Qwen-Image/summary
GitHub: https://github.com/QwenLM/Qwen-Image
Technical Report: https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen-Image/Qwen_Image.pdf
WaveSpeed Demo: https://wavespeed.ai/models/wavespeed-ai/qwen-image/text-to-image
Demo: https://modelscope.cn/aigc/imageGeneration?tab=advanced
But can it make a Poster of Tank man on Tianamen Square?
In fact it can but it makes Chinese propaganda pics first. Just ask and it makes pictures of a soldier in uniform on a tank with a red star. But ask more specific about the massacre and this will fall out:
How are you running this?
Huh :) the output quality is actually pretty impressive. It rivals Flux for sure.
TBH I haven't used any local image generators like Flux etc in a long time so I'm not even sure how to input this in, I think LM Studio is still a way off
What do you use?
I actually run Qwen locally using LM Studio. Even then it won't say anything that it deems as "controversial". If you want to use Flux, I'll share my LM Studio workflow later when I'm back at my ML Workstation