Hunyuan-MT-7B / Hunyuan-MT-Chimera-7B

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 16 comments

Model Introduction

The Hunyuan Translation Model comprises a translation model, Hunyuan-MT-7B, and an ensemble model, Hunyuan-MT-Chimera. The translation model is used to translate source text into the target language, while the ensemble model integrates multiple translation outputs to produce a higher-quality result. It primarily supports mutual translation among 33 languages, including five ethnic minority languages in China.

Key Features and Advantages

In the WMT25 competition, the model achieved first place in 30 out of the 31 language categories it participated in.
Hunyuan-MT-7B achieves industry-leading performance among models of comparable scale
Hunyuan-MT-Chimera-7B is the industry’s first open-source translation ensemble model, elevating translation quality to a new level
A comprehensive training framework for translation models has been proposed, spanning from pretrain → cross-lingual pretraining (CPT) → supervised fine-tuning (SFT) → translation enhancement → ensemble refinement, achieving state-of-the-art (SOTA) results for models of similar size

[-]

StoryIntrepid9829@reddit

Preliminary En-Ru testing - it adds too much variation compared to source text. Long text blocks gets completely messed. Overall translation feels good, but considering points above, it leaves very mixed fileeng. Tested in Lm studio

Away_Expression_3713@reddit

Better than other open source models?

Formal_Scarcity_7861@reddit

The community has limited interest in MT LLMs, I doubt whether this will ever be supported in llama.cpp.

jacek2023@reddit (OP)

It's just 7B, people are not limited to llama.cpp, you can just use the model "standard way"

Foreign-Beginning-49@reddit

There's always HF bits and bytes too....

mpasila@reddit

GGUF is more accurate than bitsandbytes and not every model supports that (especially multimodal stuff).

kironlau@reddit

already supported，both tested on newest LM Studio and llama.cpp

Chinese <=> English(quite well，better than Gemma3 12b)

For the supported model, do you mean Hunyuan-MT-Chimera-7B? The model card describes it as a 'translation ensemble model, elevating translation quality to a new level.' So does that mean it's not intended for translation? (Although it does seem to work.)

ThiccStorms@reddit

MT is very important though.

Capable-Ad-7494@reddit

I’m into it!

Green-Ad-3964@reddit

this is a great "small" model. I'd like to use it to translate some ebooks I have in chinese for personal use. Is there a tool to "continuously" translate very long text using a LLM?

Check out LinguaGacha

https://github.com/neavo/LinguaGacha

darkpigvirus@reddit

Wow, it supports Filipino, nice

KoreanPeninsula@reddit

Supported languages include: Chinese, English, French, Portuguese, Spanish, Japanese, Turkish, Russian, Arabic, Korean, Thai, Italian, German, Vietnamese, Malay, Indonesian, Filipino, Hindi, Traditional Chinese, Polish, Czech, Dutch, Khmer, Burmese, Persian, Gujarati, Urdu, Telugu, Marathi, Hebrew, Bengali, Tamil, Ukrainian, Tibetan, Kazakh, Mongolian, Uyghur, and Cantonese.

nullmove@reddit

It primarily supports mutual translation among 33 languages

So what are those languages? Can't find it the readme.

Impressive_Half_2819@reddit

Tencent had quiet for a while.