Mguyen
What is the point of MoE models, beyond being faster?
Posted by ihatebeinganonymous@reddit | LocalLLaMA | View on Reddit | 135 comments
Used ray tracing cores on my RTX 5070 Ti for LLM routing — 218x speedup, runs entirely on 1 consumer GPU
Posted by Critical-Chef9211@reddit | LocalLLaMA | View on Reddit | 91 comments
Mguyen@reddit
Used ray tracing cores on my RTX 5070 Ti for LLM routing — 218x speedup, runs entirely on 1 consumer GPU
Posted by Critical-Chef9211@reddit | LocalLLaMA | View on Reddit | 91 comments
Mguyen@reddit
American closed models vs Chinese open models is becoming a problem.
Posted by __JockY__@reddit | LocalLLaMA | View on Reddit | 622 comments
Mguyen@reddit
American closed models vs Chinese open models is becoming a problem.
Posted by __JockY__@reddit | LocalLLaMA | View on Reddit | 622 comments
Mguyen@reddit
American closed models vs Chinese open models is becoming a problem.
Posted by __JockY__@reddit | LocalLLaMA | View on Reddit | 622 comments
Mguyen@reddit
Full Claude Opus 4.6 System Prompt for your pleasure
Posted by frubberism@reddit | LocalLLaMA | View on Reddit | 56 comments
Mguyen@reddit
NVIDIA releases Nemotron 3 Nano, a new 30B hybrid reasoning model!
Posted by Difficult-Cap-7527@reddit | LocalLLaMA | View on Reddit | 188 comments
Mguyen@reddit
Glm 4.6 is out and it's going against claude 4.5
Posted by Independent-Wind4462@reddit | LocalLLaMA | View on Reddit | 47 comments
Mguyen@reddit
[Acc] Walker Ultimate Quad Connect w bluetooth $60 + tax (FDE, ODG, grey)
Posted by DonArgueWithMe@reddit | gundeals | View on Reddit | 53 comments