Been out of the loop - Will this work for EXO/MLX?

Posted by NoUsual5150@reddit | LocalLLaMA | View on Reddit | 3 comments

Had to sell my AI server and am down to an M4 Macbook Air 16GB.

If I were to buy a used M1 Air with 16GB (run it headless) and connect the two via EXO + Thunderbolt...would it be possible to be able to run a (19.6GB) Qwen 3.5-27B-Q5_K_M.gguf at or around 10 tokens per second?

I have been out of the loop for over a year and trying to see if this proposed configuration would work.