DDTree-MLX — Tree-based speculative decoding for Apple Silicon.
Posted by Recoil42@reddit | LocalLLaMA | View on Reddit | 2 comments
Posted by Recoil42@reddit | LocalLLaMA | View on Reddit | 2 comments
MrBIMC@reddit
I wonder whether it will work with strix halo that was mentioned in post of mlx over hip/rocm earlier today.
Recoil42@reddit (OP)
Not my project, fyi.
Author is here: https://x.com/runsonai/status/2044430597251023156