24GB M4 Mac - is Qwen 9B only option while system is running?

Posted by sagiroth@reddit | LocalLLaMA | View on Reddit | 39 comments

I have mac at work that I want to use local model for prototyping and basic prompts that needs to stay on device. What sort of model I can run that I can fit at least 64k context ? Any setups sbare or guides welcome.

I need to have firefox open with one tab at minium. Problem I have is all the chap that runs on Mac itself by default.