What actually pushed you to commit to running local models full time?

Posted by Necessary-Summer-348@reddit | LocalLLaMA | View on Reddit | 37 comments

Curious what the tipping point was for people who made the switch. For me it was a combination of latency for agentic workflows and not wanting API calls going through a third party for certain use cases. The cost argument got a lot better too once quantized models actually became usable. What was the deciding factor for you?