Is more cores faster ?

Posted by VolkoTheWorst@reddit | LocalLLaMA | View on Reddit | 18 comments

I would like to make an server to run big models (slowly)

I will run on CPU (or maybe add a GPU but it would be mostly offloaded to ram)

I was wondering if I should get an old Xeon (more cores) or a more classic CPU (less cores but each faster)

Basically, is llamacpp using all cores ? Can it suffer from having too much cores ?

Thanks \^\^

PS: I think I will run it on DDR3, I know it will be very very slow but it's just so much cheaper