Qwen-Image β€” a 20B MMDiT model

Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 24 comments

πŸš€ Meet Qwen-Image β€” a 20B MMDiT model for next-gen text-to-image generation. Especially strong at creating stunning graphic posters with native text. Now open-source. πŸ” Key Highlights: πŸ”Ή SOTA text rendering β€” rivals GPT-4o in English, best-in-class for Chinese πŸ”Ή In-pixel text generation β€” no overlays, fully integrated πŸ”Ή Bilingual support, diverse fonts, complex layouts 🎨 Also excels at general image generation β€” from photorealistic to anime, impressionist to minimalist. A true creative powerhouse.