Russian LLMs
Posted by RhubarbSimilar1683@reddit | LocalLLaMA | View on Reddit | 30 comments
Here's one example: [https://huggingface.co/ai-sage/GigaChat-20B-A3B-instruct](https://huggingface.co/ai-sage/GigaChat-20B-A3B-instruct) it has a MoE architecture, I'm guessing from the parameter count that it's based on qwen3 architecture. They released a paper so I don't think it's a fine tune [https://huggingface.co/papers/2506.09440](https://huggingface.co/papers/2506.09440)
30 Comments
Shifty_13@reddit
Woof9000@reddit
mana_hoarder@reddit
Woof9000@reddit
mana_hoarder@reddit
Woof9000@reddit
Shifty_13@reddit
Shifty_13@reddit
Alex_L1nk@reddit
Shifty_13@reddit
Alex_L1nk@reddit
Shifty_13@reddit
justicecurcian@reddit
Guardian-Spirit@reddit
__JockY__@reddit
Guardian-Spirit@reddit
__JockY__@reddit
Guardian-Spirit@reddit
__JockY__@reddit
Guardian-Spirit@reddit
__JockY__@reddit
HadHands@reddit
RhubarbSimilar1683@reddit (OP)
HadHands@reddit
FriskyFennecFox@reddit
Own_Suspect5343@reddit
LicensedTerrapin@reddit
RhubarbSimilar1683@reddit (OP)
Alex_L1nk@reddit
justicecurcian@reddit