I made another LLM VRAM calculator

Posted by PreferenceAsleep8093@reddit | LocalLLaMA | View on Reddit | 9 comments

Most calculators just guess based on parameters, so I made one that actually pulls the config.json from Hugging Face to calculate the K/V cache and runtime overhead.

What it does:

Link:Local AI VRAM Calculator