Fine-tuning LLMs to 1.58bit: extreme quantization experiment

Posted by shing3232@reddit | LocalLLaMA | View on Reddit | 14 comments

[https://github.com/huggingface/blog/blob/main/1\_58\_llm\_extreme\_quantization.md](https://github.com/huggingface/blog/blob/main/1_58_llm_extreme_quantization.md) [https://huggingface.co/blog/1\_58\_llm\_extreme\_quantization](https://huggingface.co/blog/1_58_llm_extreme_quantization)