noneabove1182

Qwen3 Next imatrix GGUFs up!

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 42 comments
Mergekit has been re-licensed under GNU LGPL v3

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 6 comments
DeepSeek V3.1 BASE Q4_K_M available

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 16 comments
LM Studio updated with Gemma 3 GGUF support!

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 63 comments
Any good benchmark sources for raw token performance (especially for CPUs)?

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 6 comments
Llama 4 (Scout) GGUFs are here! (and hopefully are final!) (and hopefully better optimized!)

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 64 comments
Llama 4 Scout sub 50GB GGUF Quantization showdown (aka I did some KLD comparisons)

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 24 comments
Quantization experimentation MMLU pro results

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 72 comments
From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 5 comments
Offering fewer GGUF options - need feedback

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 87 comments
Imatrix quants of DeepSeek R1 (the big one) are up!

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 50 comments
Gemma 2 9B GGUFs are up!

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 111 comments
PSA: Phi-4 GGUFs (from official release) don't work in Ollama (yet)

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 1 comments
Update on Huggingface storage limits ("unlimited" public, limited private)

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 32 comments
Any kind of LLM for OCR?

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 7 comments
GGUF of Llama 3 8B Instruct made with officially supported llama.cpp release and imatrix

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 1 comments
If your Qwen2 GGUF is spitting nonsense, enable flash attention

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 31 comments
Testing new GGUF quant configuration

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 16 comments
Llama 3 8B instruct with fixed BPE tokenizer uploaded

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 39 comments
I must have missed something, how is 2 bit working so well?

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 2 comments