Tencent Hy 30B/7B/1.8B
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 15 comments
from tencent:
Hy-MT2 is a family of “fast-thinking” multilingual translation models designed for complex real-world scenarios. It includes three model sizes: 1.8B, 7B, and 30B-A3B (MoE), all of which support translation among 33 languages and effectively follow translation instructions in multiple languages. For on-device deployment, AngelSlim 1.25-bit extreme quantization reduces the storage requirement of the 1.8B model to only 440 MB and improves inference speed by 1.5x. Multi-dimensional evaluations show that Hy-MT2 delivers outstanding performance across general, real-world business, domain-specific, and instruction-following translation tasks. The 7B and 30B-A3B models outperform open-source models such as DeepSeek-V4-Pro and Kimi K2.6 in fast-thinking mode, while the lightweight 1.8B model also surpasses mainstream commercial APIs from providers such as Microsoft and Doubao overall.
In this release, we also open-source IFMTBench, a benchmark for evaluating translation instruction-following capabilities.
We also welcome everyone to use our released Hy-MT2-Translator Skill, which makes it easy to integrate Hy-MT2 series models for translation tasks. Download links: ClawHub and SkillHub.
Now, Tencent Hy is officially partnering with WMT26 for the "Video Subtitle Translation Task" (https://www2.statmt.org/wmt26/video-subtitle-translation.html). Participants who use the Hy-MT model series to compete in the "General Machine Translation Task" (https://www2.statmt.org/wmt26/translation-task.html) and the "Video Subtitle Translation Task" will have the chance to win special awards sponsored by Hunyuan. We sincerely invite everyone to participate and jointly push the boundaries of machine translation technology!

https://huggingface.co/tencent/Hy-MT2-30B-A3B
Icy-Degree6161@reddit
This is interesting, currently using Gemma4 - mostly European languages though, so this one might not be as good as gemma4 for me.
Ok_Profit8783@reddit
Question: How do you create those graph timings with all the AI's operational information? I always see them here in the community.
taking_bullet@reddit
I wish they would compare it to other translation models like Aya-expanse or TranslateGemma. Comparison with Chat GPT or Gemini makes no sense.
urarthur@reddit
its better than translategemma, but translategemma can do 55 and this only 33.
urarthur@reddit
its way better than any small model i tested. only gemini flash and flash lite are close. closer to gemini 3.1 pro model but 30x faster and free.
Beginning-Window-115@reddit
im pretty sure gemma 4 beats translate gemma so this beating gemma 4 would mean that its sota
Borkato@reddit
I didn’t realize this was a translation model and got so excited for a second
urarthur@reddit
you dont qant a translation model that does instant translation in33 at gemini 3.1 pro level anmt no cost on potato gpu?
FullOf_Bad_Ideas@reddit
Damn I just finished translating ~10B tokens with Hy MT 1.5 1.8B yesterday, I wish I'd have waited it out and started this week with MT 2.0 1.8B instead.
celsowm@reddit
Any space or chat to test a few prompts on they online?
pmttyji@reddit
But their license ....
Inside-Chance-320@reddit
I checkt, and thats crazy. If i would build a translator with that model in the EU, it would go against the license. It dosent matter if I'm the only one, using it.
DeltaSqueezer@reddit
I don't blame them. I wouldn't get caught under stupid EU AI laws either.
DeltaSqueezer@reddit
Thanks for sharing. I was impressed with how capable the 1.5 series was and look forward to testing the new ones.
Interesting_Key3421@reddit
Where is Hy-MT2-30B-A3B gguf?