Compression Aware prompting for quantized models Posted by RMCPhoto@reddit | LocalLLaMA | View on Reddit | 1 comments
[-] Extension-Constant47@reddit Funny, this paper was accepted by ICML 2024, but what exactly is the novelty here? Is it just prompt-tuning a compressed model? Reply Submit
1 Comments
Extension-Constant47@reddit