PracticlySpeaking

Offload settings for unsloth/Gemma-4 on Apple Silicon?

Posted by PracticlySpeaking@reddit | LocalLLaMA | View on Reddit | 2 comments
Ollama 0.19 with MLX is the real deal

Posted by PracticlySpeaking@reddit | LocalLLaMA | View on Reddit | 3 comments
New - Apple Neural Engine (ANE) backend for llama.cpp

Posted by PracticlySpeaking@reddit | LocalLLaMA | View on Reddit | 24 comments
GPT-oss-120b - What is up with GPU Offload setting (LM Studio / Mac)

Posted by PracticlySpeaking@reddit | LocalLLaMA | View on Reddit | 4 comments
MacOS 26.2 to add full 'Neural Accelerator' support for M5 chips

Posted by PracticlySpeaking@reddit | LocalLLaMA | View on Reddit | 32 comments
Is MLX working with new M5 matmul yet?

Posted by PracticlySpeaking@reddit | LocalLLaMA | View on Reddit | 20 comments
Anyone with a 64GB Mac and unsloth gpt-oss-120b — Will it load with full GPU offload?

Posted by PracticlySpeaking@reddit | LocalLLaMA | View on Reddit | 15 comments
Getting models to load in LM Studio [64GB Mac]

Posted by PracticlySpeaking@reddit | LocalLLaMA | View on Reddit | 11 comments
Is it just me, or is LM Studio really pushing the new gpt-oss?

Posted by PracticlySpeaking@reddit | LocalLLaMA | View on Reddit | 55 comments
Qwen3-Coder-30B-A3B MLX-DWQ variants like lr9e8

Posted by PracticlySpeaking@reddit | LocalLLaMA | View on Reddit | 5 comments
What would you run with 128GB RAM instead of 64GB? (Mac)

Posted by PracticlySpeaking@reddit | LocalLLaMA | View on Reddit | 39 comments