DDTree-MLX — Tree-based speculative decoding for Apple Silicon.

Posted by Recoil42@reddit | LocalLLaMA | View on Reddit | 2 comments