TheaterFire

Compression Aware prompting for quantized models

Posted by RMCPhoto@reddit | LocalLLaMA | View on Reddit | 1 comments

Reply to Post

1 Comments

Extension-Constant47@reddit

Funny, this paper was accepted by ICML 2024, but what exactly is the novelty here? Is it just prompt-tuning a compressed model?
View on Reddit #27559656