noneabove1182
-
Qwen3 Next imatrix GGUFs up!
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 42 comments
-
Mergekit has been re-licensed under GNU LGPL v3
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 6 comments
-
DeepSeek V3.1 BASE Q4_K_M available
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 16 comments
-
LM Studio updated with Gemma 3 GGUF support!
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 63 comments
-
Any good benchmark sources for raw token performance (especially for CPUs)?
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 6 comments
-
Llama 4 (Scout) GGUFs are here! (and hopefully are final!) (and hopefully better optimized!)
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 64 comments
-
Llama 4 Scout sub 50GB GGUF Quantization showdown (aka I did some KLD comparisons)
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 24 comments
-
Quantization experimentation MMLU pro results
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 72 comments
-
From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 5 comments
-
Offering fewer GGUF options - need feedback
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 87 comments
-
Imatrix quants of DeepSeek R1 (the big one) are up!
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 50 comments
-
Gemma 2 9B GGUFs are up!
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 111 comments
-
PSA: Phi-4 GGUFs (from official release) don't work in Ollama (yet)
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 1 comments
-
Update on Huggingface storage limits ("unlimited" public, limited private)
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 32 comments
-
Any kind of LLM for OCR?
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 7 comments
-
GGUF of Llama 3 8B Instruct made with officially supported llama.cpp release and imatrix
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 1 comments
-
If your Qwen2 GGUF is spitting nonsense, enable flash attention
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 31 comments
-
Testing new GGUF quant configuration
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 16 comments
-
Llama 3 8B instruct with fixed BPE tokenizer uploaded
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 39 comments
-
I must have missed something, how is 2 bit working so well?
Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 2 comments