sensenova/SenseNova-U1-A3B-MoT · Hugging Face
Posted by pmttyji@reddit | LocalLLaMA | View on Reddit | 0 comments
SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture
🚀 SenseNova U1 is a new series of native multimodal models that unifies multimodal understanding, reasoning, and generation within a monolithic architecture. It marks a fundamental paradigm shift in multimodal AI: from modality integration to true unification. Rather than relying on adapters to translate between modalities, SenseNova U1 models think-and-act across language and vision natively.
Unifying visual understanding and generation in an end-to-end architecture from pixel to word opens tremendous possibilities, enabling highly efficient and strong understanding, generation, and interleaved reasoning in a natively multimodal manner.
| Model | Params | HF Weights |
|---|---|---|
| SenseNova-U1-8B-MoT-SFT | 8B MoT | 🤗 link |
| SenseNova-U1-8B-MoT | 8B MoT | 🤗 link |
| SenseNova-U1-8B-MoT-LoRA-8step-V1.0 | 0.4B | 🤗 link |
| SenseNova-U1-A3B-MoT-SFT | A3B MoT | 🤗 link |
| SenseNova-U1-A3B-MoT | A3B MoT | 🤗 link |
2 weeks ago, they released 8B model mentioned in above table.