Historical-Crazy1831

Why Alibaba set high price for coding plan, while releasing powerful open source models?

Posted by Historical-Crazy1831@reddit | LocalLLaMA | View on Reddit | 16 comments
Do the "*Claude-4.6-Opus-Reasoning-Distilled" really bring something new to the original models?

Posted by Historical-Crazy1831@reddit | LocalLLaMA | View on Reddit | 33 comments
With 48gb vram, on vllm, Qwen3.6-27b-awq-int4 has only 120k ctx (fp8), is that normal?

Posted by Historical-Crazy1831@reddit | LocalLLaMA | View on Reddit | 16 comments
What is the best way to deploy LLM on 3x3090?

Posted by Historical-Crazy1831@reddit | LocalLLaMA | View on Reddit | 13 comments
3090 now >1100 usd on ebay??

Posted by Historical-Crazy1831@reddit | LocalLLaMA | View on Reddit | 17 comments
local llm on claude code runs slow, any suggestion?

Posted by Historical-Crazy1831@reddit | LocalLLaMA | View on Reddit | 7 comments
Gemini cli seems to be more user-friendly than claude code to me as a researcher

Posted by Historical-Crazy1831@reddit | LocalLLaMA | View on Reddit | 2 comments