Is there any way to estimate tokens per second given VRAM and such? The calculators don’t have every model.
Posted by Borkato@reddit | LocalLLaMA | View on Reddit | 7 comments
For example if I find GiggleBox-Super-Cool-Mega-SLOP-XL-24B.Q3629272\_K-x-ULTRA-super-DPO-LX-MegasXLR-Voldemort-GodU-Homelander-XFiles-Hopped-Into-A-Coffee-Shop.gguf and I know I have 24GB VRAM is there a way I can estimate the T/s I’ll get running it?
7 Comments
TokenRingAI@reddit
Borkato@reddit (OP)
nagareteku@reddit
nagareteku@reddit
Borkato@reddit (OP)
Electrical_Tax4470@reddit
Borkato@reddit (OP)