Hunyuan-MT-7B / Hunyuan-MT-Chimera-7B
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 16 comments
Model Introduction
The Hunyuan Translation Model comprises a translation model, Hunyuan-MT-7B, and an ensemble model, Hunyuan-MT-Chimera. The translation model is used to translate source text into the target language, while the ensemble model integrates multiple translation outputs to produce a higher-quality result. It primarily supports mutual translation among 33 languages, including five ethnic minority languages in China.
Key Features and Advantages
- In the WMT25 competition, the model achieved first place in 30 out of the 31 language categories it participated in.
- Hunyuan-MT-7B achieves industry-leading performance among models of comparable scale
- Hunyuan-MT-Chimera-7B is the industry’s first open-source translation ensemble model, elevating translation quality to a new level
- A comprehensive training framework for translation models has been proposed, spanning from pretrain → cross-lingual pretraining (CPT) → supervised fine-tuning (SFT) → translation enhancement → ensemble refinement, achieving state-of-the-art (SOTA) results for models of similar size
StoryIntrepid9829@reddit
Preliminary En-Ru testing - it adds too much variation compared to source text. Long text blocks gets completely messed. Overall translation feels good, but considering points above, it leaves very mixed fileeng. Tested in Lm studio
Away_Expression_3713@reddit
Better than other open source models?
Formal_Scarcity_7861@reddit
The community has limited interest in MT LLMs, I doubt whether this will ever be supported in llama.cpp.
jacek2023@reddit (OP)
It's just 7B, people are not limited to llama.cpp, you can just use the model "standard way"
Foreign-Beginning-49@reddit
There's always HF bits and bytes too....
mpasila@reddit
GGUF is more accurate than bitsandbytes and not every model supports that (especially multimodal stuff).
kironlau@reddit
already supported,both tested on newest LM Studio and llama.cpp
Chinese <=> English(quite well,better than Gemma3 12b)
Formal_Scarcity_7861@reddit
For the supported model, do you mean Hunyuan-MT-Chimera-7B? The model card describes it as a 'translation ensemble model, elevating translation quality to a new level.' So does that mean it's not intended for translation? (Although it does seem to work.)
ThiccStorms@reddit
MT is very important though.
Capable-Ad-7494@reddit
I’m into it!
Green-Ad-3964@reddit
this is a great "small" model. I'd like to use it to translate some ebooks I have in chinese for personal use. Is there a tool to "continuously" translate very long text using a LLM?
Formal_Scarcity_7861@reddit
Check out LinguaGacha
https://github.com/neavo/LinguaGacha
darkpigvirus@reddit
Wow, it supports Filipino, nice
KoreanPeninsula@reddit
Supported languages include: Chinese, English, French, Portuguese, Spanish, Japanese, Turkish, Russian, Arabic, Korean, Thai, Italian, German, Vietnamese, Malay, Indonesian, Filipino, Hindi, Traditional Chinese, Polish, Czech, Dutch, Khmer, Burmese, Persian, Gujarati, Urdu, Telugu, Marathi, Hebrew, Bengali, Tamil, Ukrainian, Tibetan, Kazakh, Mongolian, Uyghur, and Cantonese.
nullmove@reddit
So what are those languages? Can't find it the readme.
Impressive_Half_2819@reddit
Tencent had quiet for a while.