Historical-Crazy1831
-
Why Alibaba set high price for coding plan, while releasing powerful open source models?
Posted by Historical-Crazy1831@reddit | LocalLLaMA | View on Reddit | 16 comments
-
Do the "*Claude-4.6-Opus-Reasoning-Distilled" really bring something new to the original models?
Posted by Historical-Crazy1831@reddit | LocalLLaMA | View on Reddit | 33 comments
-
With 48gb vram, on vllm, Qwen3.6-27b-awq-int4 has only 120k ctx (fp8), is that normal?
Posted by Historical-Crazy1831@reddit | LocalLLaMA | View on Reddit | 16 comments
-
What is the best way to deploy LLM on 3x3090?
Posted by Historical-Crazy1831@reddit | LocalLLaMA | View on Reddit | 13 comments
-
3090 now >1100 usd on ebay??
Posted by Historical-Crazy1831@reddit | LocalLLaMA | View on Reddit | 17 comments
-
local llm on claude code runs slow, any suggestion?
Posted by Historical-Crazy1831@reddit | LocalLLaMA | View on Reddit | 7 comments
-
Gemini cli seems to be more user-friendly than claude code to me as a researcher
Posted by Historical-Crazy1831@reddit | LocalLLaMA | View on Reddit | 2 comments