What is the best value card I could buy for decent performance?

Posted by equinoxel@reddit | LocalLLaMA | View on Reddit | 22 comments

I have a 1080 (ancient) card that I use now with 7b-ish models and I'm thinking of an update mainly to use larger models. My use case is running an embedding model alongside a normal one and I don't mind switching the "normal" models depending on the case (coding vs chatbot). I was looking for a comparator for different cards and their performance but couldn't find one that gives os/gpu/tps and eventually median price. So I wonder about the new 9060/9070 from AMD, the 16g Intel ones. Is it worth getting a gpu vs the 395 max/128g or nvidia's golden box thing?