Qwen-Image β a 20B MMDiT model
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 24 comments
π Meet Qwen-Image β a 20B MMDiT model for next-gen text-to-image generation. Especially strong at creating stunning graphic posters with native text. Now open-source.
π Key Highlights:
πΉ SOTA text rendering β rivals GPT-4o in English, best-in-class for Chinese
πΉ In-pixel text generation β no overlays, fully integrated
πΉ Bilingual support, diverse fonts, complex layouts
π¨ Also excels at general image generation β from photorealistic to anime, impressionist to minimalist. A true creative powerhouse.
24 Comments
ilintar@reddit
Shivacious@reddit
Rich_Artist_8327@reddit
Shivacious@reddit
Capable-Ad-7494@reddit
NickCanCode@reddit
Shivacious@reddit
Temporary_Exam_3620@reddit
taimusrs@reddit
Psychological-Sale-3@reddit
Weltleere@reddit
Rich_Artist_8327@reddit
ihaag@reddit
Agreeable_Cat602@reddit
Any_Pressure4251@reddit
Agreeable_Cat602@reddit
momentcurve@reddit
Equivalent-Word-7691@reddit
stddealer@reddit
jferments@reddit
shokuninstudio@reddit
ilintar@reddit
Xhehab_@reddit (OP)
MrWeirdoFace@reddit