Best coding model on RTX 3060

Posted by solimaotheelephant3@reddit | LocalLLaMA | View on Reddit | 4 comments

Wondering what’s the best coding model that can fit on a RTX 3060 (12GB). Has anyone been able to do something useful with it?

Also wondering about best setup (vllm? Llama.cpp?) and quantization.

Thanks a lot, this community is great