zelkovamoon
Have you tried this -> 2x Modded 2080 ti 22GB with Nvlink
Posted by zelkovamoon@reddit | LocalLLaMA | View on Reddit | 20 comments
Have you tried this -> 2x Modded 2080 ti 22GB with Nvlink
Posted by zelkovamoon@reddit | LocalLLaMA | View on Reddit | 20 comments
zelkovamoon@reddit (OP)
Have you tried this -> 2x Modded 2080 ti 22GB with Nvlink
Posted by zelkovamoon@reddit | LocalLLaMA | View on Reddit | 20 comments
zelkovamoon@reddit (OP)
Have you tried this -> 2x Modded 2080 ti 22GB with Nvlink
Posted by zelkovamoon@reddit | LocalLLaMA | View on Reddit | 20 comments
zelkovamoon@reddit (OP)
Have you tried this -> 2x Modded 2080 ti 22GB with Nvlink
Posted by zelkovamoon@reddit | LocalLLaMA | View on Reddit | 20 comments
zelkovamoon@reddit (OP)
Have you tried this -> 2x Modded 2080 ti 22GB with Nvlink
Posted by zelkovamoon@reddit | LocalLLaMA | View on Reddit | 20 comments
zelkovamoon@reddit (OP)
Have you tried this -> 2x Modded 2080 ti 22GB with Nvlink
Posted by zelkovamoon@reddit | LocalLLaMA | View on Reddit | 20 comments
zelkovamoon@reddit (OP)
Qwen3.5-27B Q4 Quantization Comparison
Posted by TitwitMuffbiscuit@reddit | LocalLLaMA | View on Reddit | 116 comments
zelkovamoon@reddit
Liquid AI releases LFM2-2.6B-Transcript, an incredibly fast open-weight meeting transcribing AI model on-par with closed-source giants.
Posted by KaroYadgar@reddit | LocalLLaMA | View on Reddit | 31 comments
zelkovamoon@reddit
Liquid Ai released LFM2.5, family of tiny on-device foundation models.
Posted by Difficult-Cap-7527@reddit | LocalLLaMA | View on Reddit | 59 comments
zelkovamoon@reddit
llama.cpp performance breakthrough for multi-GPU setups
Posted by Holiday-Injury-9397@reddit | LocalLLaMA | View on Reddit | 205 comments
zelkovamoon@reddit
Senator in Tennessee introduces bill to felonize making AI "act as a companion" or "mirror human interactions"
Posted by CanineAssBandit@reddit | LocalLLaMA | View on Reddit | 218 comments
zelkovamoon@reddit
Best Local LLMs - 2025
Posted by rm-rf-rm@reddit | LocalLLaMA | View on Reddit | 219 comments
zelkovamoon@reddit
Stop wasting your MCP context window. LTP (Lazy Tool Protocol) reduces tool-calling overhead by up to 93 percent.
Posted by song-junhyeong@reddit | LocalLLaMA | View on Reddit | 43 comments
zelkovamoon@reddit
llama.cpp - useful flags - share your thoughts please
Posted by mossy_troll_84@reddit | LocalLLaMA | View on Reddit | 34 comments
zelkovamoon@reddit
llama.cpp - useful flags - share your thoughts please
Posted by mossy_troll_84@reddit | LocalLLaMA | View on Reddit | 34 comments
zelkovamoon@reddit
llama.cpp - useful flags - share your thoughts please
Posted by mossy_troll_84@reddit | LocalLLaMA | View on Reddit | 34 comments
zelkovamoon@reddit
Without a connection to a live data source, an LLM faces critical limitations: Hallucinations and Trust
Posted by balianone@reddit | LocalLLaMA | View on Reddit | 5 comments
zelkovamoon@reddit
Key Highlights of Google's New Open Model, FunctionGemma
Posted by Dear-Success-1441@reddit | LocalLLaMA | View on Reddit | 12 comments
zelkovamoon@reddit
8x Radeon 7900 XTX Build for Longer Context Local Inference - Performance Results & Build Details
Posted by Beautiful_Trust_8151@reddit | LocalLLaMA | View on Reddit | 231 comments
zelkovamoon@reddit
8x Radeon 7900 XTX Build for Longer Context Local Inference - Performance Results & Build Details
Posted by Beautiful_Trust_8151@reddit | LocalLLaMA | View on Reddit | 231 comments
zelkovamoon@reddit
I have 4 V100s. What do I do?
Posted by MackThax@reddit | LocalLLaMA | View on Reddit | 18 comments
zelkovamoon@reddit
zai-org/GLM-4.6V-Flash (9B) is here
Posted by Cute-Sprinkles4911@reddit | LocalLLaMA | View on Reddit | 68 comments
zelkovamoon@reddit
Function calling Finetuners?
Posted by zelkovamoon@reddit | LocalLLaMA | View on Reddit | 11 comments
zelkovamoon@reddit (OP)
Function calling Finetuners?
Posted by zelkovamoon@reddit | LocalLLaMA | View on Reddit | 11 comments
zelkovamoon@reddit (OP)
Intel Arc Pro B60 Battlematrix Preview: 192GB of VRAM for On-Premise AI
Posted by reps_up@reddit | LocalLLaMA | View on Reddit | 42 comments
zelkovamoon@reddit
Ministral-3 has been released
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 61 comments
zelkovamoon@reddit
A list of models released or udpated last week on this sub, in case you missed any (3rd Oct)
Posted by aifeed-fyi@reddit | LocalLLaMA | View on Reddit | 30 comments
zelkovamoon@reddit
Junyang Lin is drinking tea
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 30 comments
zelkovamoon@reddit
Why I do not like to see AI tools are implemented to distros!
Posted by BlokZNCR@reddit | linux | View on Reddit | 238 comments
zelkovamoon@reddit
Rise of the linux desktop will be driven by developing economies
Posted by KanonBalls@reddit | linux | View on Reddit | 71 comments
zelkovamoon@reddit
Linus Torvalds used to speak to engineers in 2012 the way I speak to LLMs now.
Posted by underbillion@reddit | linux | View on Reddit | 875 comments
zelkovamoon@reddit
support for the upcoming ERNIE 4.5 0.3B model has been merged into llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 17 comments
zelkovamoon@reddit
NVIDIA’s Highly Anticipated “Mini-Supercomputer,” the DGX Spark, Launches This Month — Bringing Immense AI Power to Your Hands — up to 4000$
Posted by _SYSTEM_ADMIN_MOD_@reddit | LocalLLaMA | View on Reddit | 290 comments
zelkovamoon@reddit
NVIDIA’s Highly Anticipated “Mini-Supercomputer,” the DGX Spark, Launches This Month — Bringing Immense AI Power to Your Hands — up to 4000$
Posted by _SYSTEM_ADMIN_MOD_@reddit | LocalLLaMA | View on Reddit | 290 comments
zelkovamoon@reddit
NVIDIA’s Highly Anticipated “Mini-Supercomputer,” the DGX Spark, Launches This Month — Bringing Immense AI Power to Your Hands — up to 4000$
Posted by _SYSTEM_ADMIN_MOD_@reddit | LocalLLaMA | View on Reddit | 290 comments
zelkovamoon@reddit
NVIDIA’s Highly Anticipated “Mini-Supercomputer,” the DGX Spark, Launches This Month — Bringing Immense AI Power to Your Hands — up to 4000$
Posted by _SYSTEM_ADMIN_MOD_@reddit | LocalLLaMA | View on Reddit | 290 comments
zelkovamoon@reddit
NVIDIA’s Highly Anticipated “Mini-Supercomputer,” the DGX Spark, Launches This Month — Bringing Immense AI Power to Your Hands — up to 4000$
Posted by _SYSTEM_ADMIN_MOD_@reddit | LocalLLaMA | View on Reddit | 290 comments
zelkovamoon@reddit
NVIDIA’s Highly Anticipated “Mini-Supercomputer,” the DGX Spark, Launches This Month — Bringing Immense AI Power to Your Hands — up to 4000$
Posted by _SYSTEM_ADMIN_MOD_@reddit | LocalLLaMA | View on Reddit | 290 comments
zelkovamoon@reddit
NVIDIA’s Highly Anticipated “Mini-Supercomputer,” the DGX Spark, Launches This Month — Bringing Immense AI Power to Your Hands — up to 4000$
Posted by _SYSTEM_ADMIN_MOD_@reddit | LocalLLaMA | View on Reddit | 290 comments
zelkovamoon@reddit
NVIDIA’s Highly Anticipated “Mini-Supercomputer,” the DGX Spark, Launches This Month — Bringing Immense AI Power to Your Hands — up to 4000$
Posted by _SYSTEM_ADMIN_MOD_@reddit | LocalLLaMA | View on Reddit | 290 comments
zelkovamoon@reddit
NVIDIA’s Highly Anticipated “Mini-Supercomputer,” the DGX Spark, Launches This Month — Bringing Immense AI Power to Your Hands — up to 4000$
Posted by _SYSTEM_ADMIN_MOD_@reddit | LocalLLaMA | View on Reddit | 290 comments
zelkovamoon@reddit
NVIDIA’s Highly Anticipated “Mini-Supercomputer,” the DGX Spark, Launches This Month — Bringing Immense AI Power to Your Hands — up to 4000$
Posted by _SYSTEM_ADMIN_MOD_@reddit | LocalLLaMA | View on Reddit | 290 comments
zelkovamoon@reddit
NVIDIA’s Highly Anticipated “Mini-Supercomputer,” the DGX Spark, Launches This Month — Bringing Immense AI Power to Your Hands — up to 4000$
Posted by _SYSTEM_ADMIN_MOD_@reddit | LocalLLaMA | View on Reddit | 290 comments
zelkovamoon@reddit
NVIDIA’s Highly Anticipated “Mini-Supercomputer,” the DGX Spark, Launches This Month — Bringing Immense AI Power to Your Hands — up to 4000$
Posted by _SYSTEM_ADMIN_MOD_@reddit | LocalLLaMA | View on Reddit | 290 comments
zelkovamoon@reddit
NVIDIA’s Highly Anticipated “Mini-Supercomputer,” the DGX Spark, Launches This Month — Bringing Immense AI Power to Your Hands — up to 4000$
Posted by _SYSTEM_ADMIN_MOD_@reddit | LocalLLaMA | View on Reddit | 290 comments
zelkovamoon@reddit
llama-4-scout-17B-16E GGUF running on Strix Halo (Ryzen AI MAX 395 + 128GB) (13s prompt processing edited out)
Posted by jfowers_amd@reddit | LocalLLaMA | View on Reddit | 47 comments
zelkovamoon@reddit
Current best options to convert to FP4
Posted by zelkovamoon@reddit | LocalLLaMA | View on Reddit | 9 comments
zelkovamoon@reddit (OP)
Current best options to convert to FP4
Posted by zelkovamoon@reddit | LocalLLaMA | View on Reddit | 9 comments
zelkovamoon@reddit (OP)
support for the upcoming ERNIE 4.5 0.3B model has been merged into llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 17 comments