Distilling Qwen3 TTS

Posted by Reasonable_Friend_77@reddit | LocalLLaMA | View on Reddit | 12 comments

Hi all,

I've made a few attempts to distill Qwen3 TTS without much success. I'm trying to create a model that is half the size and see what's the quality trade off... but so far I only managed to produce garbage.

Does anyone have experience with distilling TTS models?

Any tips or documentation willing to share?