ServiceNow-AI/SuperApriel-15B-Instruct · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 6 comments

A 15B-parameter token-mixer supernet with 8 optimized deployment presets spanning 1.0× to 10.7× decode throughput at 32K sequence length, all from a single checkpoint. Derived from Apriel-1.6 through stochastic distillation and targeted supervised fine-tuning.

Highlights