llamacpp with Gemma4 31B dense and Gemma e4b as draft, plus audio input?

Posted by caetydid@reddit | LocalLLaMA | View on Reddit | 15 comments

Hi,

has anybody succeeded in running llama.cpp with Gemma 31b dense and Gemma e4b as draft model, and simultaneously inhibit the voice recognition feature?

Is it even (theoretically) possible?

thanks