Intern-S1-Pro (1T/A22B)
Posted by ResearchCrafty1804@reddit | LocalLLaMA | View on Reddit | 26 comments
🚀Introducing Intern-S1-Pro, an advanced 1T MoE open-source multimodal scientific reasoning model.
\- SOTA scientific reasoning, competitive with leading closed-source models across AI4Science tasks.
\- Top-tier performance on advanced reasoning benchmarks, strong general multimodal performance on various benchmarks.
\- 1T-A22B MoE training efficiency with STE routing (dense gradient for router training) and grouped routing for stable convergence and balanced expert parallelism.
\- Fourier Position Encoding (FoPE) + upgraded time-series modeling for better physical signal representation; supports long, heterogeneous time-series (10\^0–10\^6 points).
\- Intern-S1-Pro is now supported by vLLM @vllm\_project and SGLang @sgl\_project @lmsysorg — more ecosystem integrations are on the way.
Huggingface: https://huggingface.co/internlm/Intern-S1-Pro
GitHub: https://github.com/InternLM/Intern-S1
26 Comments
pulse77@reddit
Karyo_Ten@reddit
crantob@reddit
Former-Ad-5757@reddit
pulse77@reddit
Lissanro@reddit
sine120@reddit
sine120@reddit
bene_42069@reddit
crantob@reddit
InternationalNebula7@reddit
Healthy-Nebula-3603@reddit
Lissanro@reddit
VoidAlchemy@reddit
Aggressive-Bother470@reddit
pigeon57434@reddit
JustSayin_thatuknow@reddit
JustSayin_thatuknow@reddit
lan-devo@reddit
ZestyCheeses@reddit
Daemontatox@reddit
Karyo_Ten@reddit
_Anime_Anuradha@reddit
Alternative-Theme885@reddit
Signature97@reddit
SlowFail2433@reddit