M4 Max 128GB running Qwen 72B Q4 MLX at 11tokens/second.

Posted by tony__Y@reddit | LocalLLaMA | View on Reddit | 69 comments

M4 Max 128GB running Qwen 72B Q4 MLX at 11tokens/second.