shbong
I burned a weekend making the models "remember" me. The fix had nothing to do with trying to run bigger models locally
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 19 comments
How much VRAM needed for Qwen 3.6 27B Q8 with 262K context?
Posted by My_Unbiased_Opinion@reddit | LocalLLaMA | View on Reddit | 79 comments
shbong@reddit
I burned a weekend making the models "remember" me. The fix had nothing to do with trying to run bigger models locally
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 19 comments
shbong@reddit (OP)
I burned a weekend making the models "remember" me. The fix had nothing to do with trying to run bigger models locally
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 19 comments
shbong@reddit (OP)
How much VRAM needed for Qwen 3.6 27B Q8 with 262K context?
Posted by My_Unbiased_Opinion@reddit | LocalLLaMA | View on Reddit | 79 comments
shbong@reddit
I burned a weekend making the models "remember" me. The fix had nothing to do with trying to run bigger models locally
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 19 comments
shbong@reddit (OP)
What memory system are you using for your agents?
Posted by Mr_Moonsilver@reddit | LocalLLaMA | View on Reddit | 55 comments
shbong@reddit
I burned a weekend making the models "remember" me. The fix had nothing to do with trying to run bigger models locally
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 19 comments
shbong@reddit (OP)
How much VRAM needed for Qwen 3.6 27B Q8 with 262K context?
Posted by My_Unbiased_Opinion@reddit | LocalLLaMA | View on Reddit | 79 comments
shbong@reddit
I burned a weekend making the models "remember" me. The fix had nothing to do with trying to run bigger models locally
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 19 comments
shbong@reddit (OP)
Stop asking what model to run. There are literally only two.
Posted by Wrong_Mushroom_7350@reddit | LocalLLaMA | View on Reddit | 549 comments
shbong@reddit
Anyone else experimenting with memory for LLMs?
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 41 comments
shbong@reddit (OP)
Calling it now Microsoft is buying Unsloth.
Posted by Wrong_Mushroom_7350@reddit | LocalLLaMA | View on Reddit | 289 comments
shbong@reddit
Gemma4 27b vs GPT-OSS 20b -- Has anyone compared them ?
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 19 comments
shbong@reddit (OP)
Gemma4 27b vs GPT-OSS 20b -- Has anyone compared them ?
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 19 comments
shbong@reddit (OP)
Gemma4 27b vs GPT-OSS 20b -- Has anyone compared them ?
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 19 comments
shbong@reddit (OP)
A local agent (that works with local models) that is easy to set up.
Posted by Valuable-Run2129@reddit | LocalLLaMA | View on Reddit | 3 comments
shbong@reddit
4B models on smartphone
Posted by Sudden_Vegetable6844@reddit | LocalLLaMA | View on Reddit | 9 comments
shbong@reddit
If you haven't yet given Gemma 4 a go...do it today
Posted by No-Anchovies@reddit | LocalLLaMA | View on Reddit | 206 comments
shbong@reddit
Why retrieval breaks once documents stop being static
Posted by EnoughNinja@reddit | LocalLLaMA | View on Reddit | 1 comments
shbong@reddit
I'm shocked (Gemma 4 results)
Posted by Potential-Gold5298@reddit | LocalLLaMA | View on Reddit | 78 comments
shbong@reddit
M1 Max vs M4 Max vs M5 Max
Posted by br_web@reddit | LocalLLaMA | View on Reddit | 4 comments
shbong@reddit
which macbook configuration to buy
Posted by Ayuzh@reddit | LocalLLaMA | View on Reddit | 11 comments
shbong@reddit
Thoughts on the almost near release Avocado?
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 6 comments
shbong@reddit (OP)
Built a graph-based "memory layer" for agents - Qwen > LLaMA for us, GPT-OSS 20B fast but tooling issues
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 4 comments
shbong@reddit (OP)
Built a graph-based "memory layer" for agents - Qwen > LLaMA for us, GPT-OSS 20B fast but tooling issues
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 4 comments
shbong@reddit (OP)
OpenAI pivot investors love
Posted by PaceImaginary8610@reddit | LocalLLaMA | View on Reddit | 124 comments
shbong@reddit
I feel personally attacked
Posted by HeadAcanthisitta7390@reddit | LocalLLaMA | View on Reddit | 219 comments
shbong@reddit
LLMs finally remembering: I’ve built the memory layer, now it’s time to explore
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 4 comments
shbong@reddit (OP)
LLMs finally remembering: I’ve built the memory layer, now it’s time to explore
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 4 comments
shbong@reddit (OP)
Anyone else experimenting with memory for LLMs?
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 41 comments
shbong@reddit (OP)
Anyone else experimenting with memory for LLMs?
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 41 comments
shbong@reddit (OP)
Anyone else experimenting with memory for LLMs?
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 41 comments
shbong@reddit (OP)
Anyone else experimenting with memory for LLMs?
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 41 comments
shbong@reddit (OP)
Anyone else experimenting with memory for LLMs?
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 41 comments
shbong@reddit (OP)
Anyone else experimenting with memory for LLMs?
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 41 comments
shbong@reddit (OP)
Anyone else experimenting with memory for LLMs?
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 41 comments
shbong@reddit (OP)
8x RTX 3090 open rig
Posted by Armym@reddit | LocalLLaMA | View on Reddit | 391 comments