Why doesn't deepseek release a smaller air model? Because they are focused at research?

Posted by power97992@reddit | LocalLLaMA | View on Reddit | 13 comments

Why doesn't deepseek release a smaller air model like a 120b A10b MoE model or a 32b dense model? It seems like they are mainly focused in research and doesn't frequently release small models unlike GLM and qwen