Pristine_Income9554
Why don't we still have any games with AI agents used as NPC characters?
Posted by Another__one@reddit | LocalLLaMA | View on Reddit | 110 comments
Qwen3.6-35B-A3B Q4 262k context on 8GB 3070 Ti = +30tps
Posted by Alternative-Cat-1347@reddit | LocalLLaMA | View on Reddit | 40 comments
Pristine_Income9554@reddit
Qwen3.6-35B-A3B Q4 262k context on 8GB 3070 Ti = +30tps
Posted by Alternative-Cat-1347@reddit | LocalLLaMA | View on Reddit | 40 comments
Pristine_Income9554@reddit
Qwen3.6-35B-A3B Q4 262k context on 8GB 3070 Ti = +30tps
Posted by Alternative-Cat-1347@reddit | LocalLLaMA | View on Reddit | 40 comments
Pristine_Income9554@reddit
Qwen3.6-35B-A3B Q4 262k context on 8GB 3070 Ti = +30tps
Posted by Alternative-Cat-1347@reddit | LocalLLaMA | View on Reddit | 40 comments
Pristine_Income9554@reddit
Qwen3.6-35B-A3B Q4 262k context on 8GB 3070 Ti = +30tps
Posted by Alternative-Cat-1347@reddit | LocalLLaMA | View on Reddit | 40 comments
Pristine_Income9554@reddit
Qwen3.6-35B - Terrible instruction following when using context files (with vanilla pi-agent). Model issue or am I doing something wrong?
Posted by FusionX@reddit | LocalLLaMA | View on Reddit | 9 comments
Pristine_Income9554@reddit
Qwen 3.6 35 UD 2 K_XL is pulling beyond its weight and quantization (No one is GPU Poor now)
Posted by dreamai87@reddit | LocalLLaMA | View on Reddit | 68 comments
Pristine_Income9554@reddit
Qwen 3.6 35 UD 2 K_XL is pulling beyond its weight and quantization (No one is GPU Poor now)
Posted by dreamai87@reddit | LocalLLaMA | View on Reddit | 68 comments
Pristine_Income9554@reddit
Qwen 3.6 35 UD 2 K_XL is pulling beyond its weight and quantization (No one is GPU Poor now)
Posted by dreamai87@reddit | LocalLLaMA | View on Reddit | 68 comments
Pristine_Income9554@reddit
Qwen 3.6 35 UD 2 K_XL is pulling beyond its weight and quantization (No one is GPU Poor now)
Posted by dreamai87@reddit | LocalLLaMA | View on Reddit | 68 comments
Pristine_Income9554@reddit
Qwen 3.6 35 UD 2 K_XL is pulling beyond its weight and quantization (No one is GPU Poor now)
Posted by dreamai87@reddit | LocalLLaMA | View on Reddit | 68 comments
Pristine_Income9554@reddit
Qwen 3.6 35 UD 2 K_XL is pulling beyond its weight and quantization (No one is GPU Poor now)
Posted by dreamai87@reddit | LocalLLaMA | View on Reddit | 68 comments
Pristine_Income9554@reddit
Qwen3.5-35B-A3B-Heretic running surprisingly fast on RTX 3060 Ti 8GB - is Heretic castrated compared to original?
Posted by Temporary-Lack-1408@reddit | LocalLLaMA | View on Reddit | 47 comments
Pristine_Income9554@reddit
Imrpove Qwen3.5 Performance on Weak GPU
Posted by MarketingGui@reddit | LocalLLaMA | View on Reddit | 22 comments
Pristine_Income9554@reddit
System prompt for Qwen3.5 (27B/35BA3B) to reduce overthinking?
Posted by thigger@reddit | LocalLLaMA | View on Reddit | 27 comments
Pristine_Income9554@reddit
GLM4.7-Flash REAP @ 25% live on HF + agentic coding evals
Posted by ilzrvch@reddit | LocalLLaMA | View on Reddit | 20 comments
Pristine_Income9554@reddit
Fix for GLM 4.7 Flash has been merged into llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 91 comments
Pristine_Income9554@reddit
Fix for GLM 4.7 Flash has been merged into llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 91 comments
Pristine_Income9554@reddit
glm-4.7-flash has the best thinking process with clear steps, I love it
Posted by uptonking@reddit | LocalLLaMA | View on Reddit | 38 comments
Pristine_Income9554@reddit
My gpu poor comrades, GLM 4.7 Flash is your local agent
Posted by __Maximum__@reddit | LocalLLaMA | View on Reddit | 169 comments
Pristine_Income9554@reddit
My gpu poor comrades, GLM 4.7 Flash is your local agent
Posted by __Maximum__@reddit | LocalLLaMA | View on Reddit | 169 comments
Pristine_Income9554@reddit
I fine-tuned a 7B model for reasoning on free Colab with GRPO + TRL
Posted by External-Rub5414@reddit | LocalLLaMA | View on Reddit | 2 comments
Pristine_Income9554@reddit
AI has replaced programmers… totally.
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 297 comments
Pristine_Income9554@reddit
AI has replaced programmers… totally.
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 297 comments
Pristine_Income9554@reddit
ChatGPT stopped lying to me when I started treating it like a scared kid
Posted by Nan0pixel@reddit | LocalLLaMA | View on Reddit | 13 comments
Pristine_Income9554@reddit
Wan 2.1 1.3B fighting video is not as good as the Qwen 2.5 fighting videos I previously posted. I used the Wan 2.1 1.3B from Huge.com. Qwen 2.5 must be using some other type of super model for videos. Because this Wan has lost its' way.
Posted by Extension-Fee-8480@reddit | LocalLLaMA | View on Reddit | 10 comments
Pristine_Income9554@reddit
Wan 2.1 1.3B fighting video is not as good as the Qwen 2.5 fighting videos I previously posted. I used the Wan 2.1 1.3B from Huge.com. Qwen 2.5 must be using some other type of super model for videos. Because this Wan has lost its' way.
Posted by Extension-Fee-8480@reddit | LocalLLaMA | View on Reddit | 10 comments
Pristine_Income9554@reddit
Exceeding VRAM limit with QWQ IQ3XXS i1 quant, no OOM? (LM studio)
Posted by No_Expert1801@reddit | LocalLLaMA | View on Reddit | 7 comments
Pristine_Income9554@reddit
Think Tool Boosts Accuracy by 54%! (+ Ollama integration)
Posted by Straight-Worker-4327@reddit | LocalLLaMA | View on Reddit | 21 comments
Pristine_Income9554@reddit
Qwen LIED TO US
Posted by random-tomato@reddit | LocalLLaMA | View on Reddit | 7 comments
Pristine_Income9554@reddit
Think Tool Boosts Accuracy by 54%! (+ Ollama integration)
Posted by Straight-Worker-4327@reddit | LocalLLaMA | View on Reddit | 21 comments
Pristine_Income9554@reddit
Think Tool Boosts Accuracy by 54%! (+ Ollama integration)
Posted by Straight-Worker-4327@reddit | LocalLLaMA | View on Reddit | 21 comments
Pristine_Income9554@reddit
Think Tool Boosts Accuracy by 54%! (+ Ollama integration)
Posted by Straight-Worker-4327@reddit | LocalLLaMA | View on Reddit | 21 comments
Pristine_Income9554@reddit
Is the DeepSeek model poisoned at the data level?
Posted by aospan@reddit | LocalLLaMA | View on Reddit | 10 comments
Pristine_Income9554@reddit
1 Million Token Context Length 🔥
Posted by CelebrationClean7309@reddit | LocalLLaMA | View on Reddit | 39 comments
Pristine_Income9554@reddit
Opensource 8B parameter test time compute scaling(reasoning) model
Posted by TheLogiqueViper@reddit | LocalLLaMA | View on Reddit | 36 comments
Pristine_Income9554@reddit
It's getting difficult to evaluate models.
Posted by baehyunsol@reddit | LocalLLaMA | View on Reddit | 52 comments
Pristine_Income9554@reddit
KoboldcPP is such a gigantic leap in QoL coming from Oobabooga is just ridiculous.
Posted by pumukidelfuturo@reddit | LocalLLaMA | View on Reddit | 58 comments
Pristine_Income9554@reddit
6 bit quantization
Posted by Ok-Cicada-5207@reddit | LocalLLaMA | View on Reddit | 9 comments
Pristine_Income9554@reddit
Is LLM Studio good?
Posted by Top_Sonic@reddit | LocalLLaMA | View on Reddit | 91 comments
Pristine_Income9554@reddit
Is LLM Studio good?
Posted by Top_Sonic@reddit | LocalLLaMA | View on Reddit | 91 comments
Pristine_Income9554@reddit
Tumera 0.1.0a2 is here!
Posted by Sad-Fix-7915@reddit | LocalLLaMA | View on Reddit | 9 comments
Pristine_Income9554@reddit
Tumera 0.1.0a2 is here!
Posted by Sad-Fix-7915@reddit | LocalLLaMA | View on Reddit | 9 comments
Pristine_Income9554@reddit
Tumera 0.1.0a2 is here!
Posted by Sad-Fix-7915@reddit | LocalLLaMA | View on Reddit | 9 comments
Pristine_Income9554@reddit
Tumera 0.1.0a2 is here!
Posted by Sad-Fix-7915@reddit | LocalLLaMA | View on Reddit | 9 comments
Pristine_Income9554@reddit
Tumera 0.1.0a2 is here!
Posted by Sad-Fix-7915@reddit | LocalLLaMA | View on Reddit | 9 comments
Pristine_Income9554@reddit
Handy calculator for figuring out how much VRAM you need for a specific model + context window
Posted by Porespellar@reddit | LocalLLaMA | View on Reddit | 7 comments
Pristine_Income9554@reddit
Run Qwen 2.5, Qwen 2.5-Coder, Qwen 2.5-Math, and Other LMs in GGUF Format from HF 🤗 Locally
Posted by unseenmarscai@reddit | LocalLLaMA | View on Reddit | 18 comments
Pristine_Income9554@reddit
Run Qwen 2.5, Qwen 2.5-Coder, Qwen 2.5-Math, and Other LMs in GGUF Format from HF 🤗 Locally
Posted by unseenmarscai@reddit | LocalLLaMA | View on Reddit | 18 comments