Is there any way to estimate tokens per second given VRAM and such? The calculators don’t have every model.

Posted by Borkato@reddit | LocalLLaMA | View on Reddit | 7 comments

For example if I find GiggleBox-Super-Cool-Mega-SLOP-XL-24B.Q3629272\_K-x-ULTRA-super-DPO-LX-MegasXLR-Voldemort-GodU-Homelander-XFiles-Hopped-Into-A-Coffee-Shop.gguf and I know I have 24GB VRAM is there a way I can estimate the T/s I’ll get running it?