DFlash: Block Diffusion for Flash Speculative Decoding.

Posted by Total-Resort-3120@reddit | LocalLLaMA | View on Reddit | 73 comments

https://z-lab.ai/projects/dflash/

https://github.com/z-lab/dflash

https://huggingface.co/collections/z-lab/dflash