Compared AMD 7900 XTX to Nvidia ada 4000 SFF

Posted by badabimbadabum2@reddit | LocalLLaMA | View on Reddit | 8 comments

Compared with Ollama and phi4:latest model. * architecture phi3 * parameters 14.7B * context length 16384 * embedding length 5120 * quantization Q4\_K\_M AMD 7900 XTX (300w tdp) prompt eval duration: 5ms eval rate: 54.26 tokens/s 430W during inference total systempower Nvidia 4000 SFF ada (70W tdp) prompt eval duration: 21ms eval rate: 26.45 tokens/s 134W during inference total systempower