Looking for benchmarks comparing MLX to metal. Speed, RAM usage, speed drop off on longer context, and most importantly, quality of the outputs. MLX 4 bit versus all the similarly sized GGUF's
Posted by visionsmemories@reddit | LocalLLaMA | View on Reddit | 0 comments
0 Comments