DeltaSqueezer
-
AI assisted music creation
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 6 comments
-
JetBrains open-sources Mellum2 - anyone tried these?
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 10 comments
-
A moment of thanks for DeepSeek
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 22 comments
-
New DeepSWE benchmark finds Claude Opus cheats
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 92 comments
-
Fed up with vibe coders, dev sneaks data-nuking prompt injection into their code
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 135 comments
-
The power of structured workflows and small local models
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 45 comments
-
got my first "rm -rf /" today
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 143 comments
-
CSP and z3 solver
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 2 comments
-
Small local model for questions on German grammar
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 24 comments
-
Prompt injection testing
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 5 comments
-
AI startup Cohere valued at $6.8 billion in latest fundraising, hires Meta exec
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 24 comments
-
Could PC x64 instruction extensions relieve hardware shortage?
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 19 comments
-
What questions have you asked reasoning models to solve that you couldn't get done with non-reasoning models?
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 34 comments
-
Recommendation for open source LLM proxy software
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 11 comments
-
Home-rolled loop agent is surprisingly effective
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 42 comments
-
Warning: do not write your own AI agent if you don't want to get sucked into a blackhole
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 6 comments
-
Warning, if you are on the spectrum, do not write your own AI coding agent - it is too addictive
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 0 comments
-
AI Coding Tabs vs Spaces
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 2 comments
-
How do you stop codebase from degenerating into an un-maintainable AI-slop mess?
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 87 comments
-
What open weight model for diagram generation
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 8 comments
-
How do you monitor what an agent is doing?
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 1 comments
-
Missing a Qwen3.5 model between the 9B and the 27B?
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 66 comments
-
Qwen 3 30B-A3B on P40
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 26 comments
-
ARC-AGI-3 is a fun game
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 5 comments
-
TurboQuant: Redefining AI efficiency with extreme compression
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 0 comments
-
Running Qwen3.5 in vLLM with MTP
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 8 comments
-
Kokoro #1 on TTS leaderboard
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 89 comments
-
Ruthless prompting efficiency
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 4 comments
-
vLLM profiling of prompts
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 1 comments
-
Finally, a real-time low-latency voice chat model
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 463 comments
-
Alibaba Releases OpenSandbox to Provide Software Developers with a Unified, Secure, and Scalable API for Autonomous AI Agent Execution
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 2 comments
-
Reverted from Qwen3.5 27B back to Qwen3 8B
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 39 comments
-
Llama 3.2 1B surprisingly good
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 55 comments
-
Is there a way to cleanly terminate a running inference job/slot with llama.cpp?
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 2 comments
-
Local Text To Speech
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 41 comments
-
Weird Qwen3.5 27B 'rabbit hole' failure mode
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 12 comments
-
OK, now we're at 1T parameter models, what's the 3090 equivalent way to run them locally?
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 62 comments
-
Hardware ASIC 17k tok/s
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 6 comments
-
Could High Bandwidth Flash be Local Inference's saviour?
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 29 comments
-
Re-sizeable BAR (REBAR) impact on performance
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 16 comments
-
China’s AI is quietly making big inroads in Silicon Valley | Technology
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 13 comments
-
GitHub - gruai/koifish: A c++ framework on efficient training & fine-tuning LLMs
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 5 comments
-
Moving beyond vibe-coding
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 10 comments
-
OCR tools for really very bad handwriting!
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 80 comments
-
Transparent LLM logging proxy
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 5 comments
-
GLM 4.7 on 8x3090
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 37 comments
-
Cheap bifurcation
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 20 comments
-
Nvidia CMP 170HX for LLMs?
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 21 comments
-
GLM released 4.6V including the apparent successor to Air. But I'm most interested to test the 9B Flash version
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 22 comments
-
llama.cpp not using kv cache effectively?
Posted by DeltaSqueezer@reddit | LocalLLaMA | View on Reddit | 17 comments