Intern-S1-Pro (1T/A22B)

Posted by ResearchCrafty1804@reddit | LocalLLaMA | View on Reddit | 26 comments

Intern-S1-Pro (1T/A22B)
🚀Introducing Intern-S1-Pro, an advanced 1T MoE open-source multimodal scientific reasoning model. \- SOTA scientific reasoning, competitive with leading closed-source models across AI4Science tasks. \- Top-tier performance on advanced reasoning benchmarks, strong general multimodal performance on various benchmarks. \- 1T-A22B MoE training efficiency with STE routing (dense gradient for router training) and grouped routing for stable convergence and balanced expert parallelism. \- Fourier Position Encoding (FoPE) + upgraded time-series modeling for better physical signal representation; supports long, heterogeneous time-series (10\^0–10\^6 points). \- Intern-S1-Pro is now supported by vLLM @vllm\_project and SGLang @sgl\_project @lmsysorg — more ecosystem integrations are on the way. Huggingface: https://huggingface.co/internlm/Intern-S1-Pro GitHub: https://github.com/InternLM/Intern-S1