brahh85
Me visiting this sub
Posted by Scutoidzz@reddit | LocalLLaMA | View on Reddit | 79 comments
Moss tts 1.5 8b Examples. It is the currently best voice cloning model for English as of June 2026
Posted by 9r4n4y@reddit | LocalLLaMA | View on Reddit | 52 comments
brahh85@reddit
I ported NVIDIA Parakeet (speech-to-text) to ggml: same output as NeMo, faster, GGUF-quantized, no Python
Posted by mudler_it@reddit | LocalLLaMA | View on Reddit | 40 comments
brahh85@reddit
Don’t bite me for that question please…
Posted by Thin_Pollution8843@reddit | LocalLLaMA | View on Reddit | 79 comments
brahh85@reddit
One letter to appease them all
Posted by ivari@reddit | LocalLLaMA | View on Reddit | 70 comments
brahh85@reddit
It was fun while it lasted... They're advertising now.
Posted by Local-Cardiologist-5@reddit | LocalLLaMA | View on Reddit | 43 comments
brahh85@reddit
Next year we're getting 0.5T model from Grok
Posted by pmttyji@reddit | LocalLLaMA | View on Reddit | 200 comments
brahh85@reddit
TTS Benchmark Comparison (all known TTS up until May 2026)
Posted by UkieTechie@reddit | LocalLLaMA | View on Reddit | 60 comments
brahh85@reddit
Is there any reason for an uncensored model if you have no interest in roleplaying?
Posted by vick2djax@reddit | LocalLLaMA | View on Reddit | 271 comments
brahh85@reddit
GPT 5.5 "secret sauce" is just having the thinking be some stupid caveman mode?
Posted by JustFinishedBSG@reddit | LocalLLaMA | View on Reddit | 154 comments
brahh85@reddit
Rejoice, if Qwen doesn't release any new local model, it's a blessing in disguise
Posted by crowtain@reddit | LocalLLaMA | View on Reddit | 19 comments
brahh85@reddit
I fine-tuned Cohere Transcribe to support diarization and timestamps
Posted by iamMess@reddit | LocalLLaMA | View on Reddit | 25 comments
brahh85@reddit
Qwen has no incentive to release new open source models quickly because the glazing on this sub makes it unnecessary.
Posted by Porespellar@reddit | LocalLLaMA | View on Reddit | 41 comments
brahh85@reddit
Qwen3.7 Max Preview hitting Arena 13 is a bigger signal than the rank
Posted by Top-Cardiologist1011@reddit | LocalLLaMA | View on Reddit | 3 comments
brahh85@reddit
Re. what ever happened to Cohere’s Command-A series of models?
Posted by nick_frosst@reddit | LocalLLaMA | View on Reddit | 102 comments
brahh85@reddit
Translate long subtitle files
Posted by Synchronauto@reddit | LocalLLaMA | View on Reddit | 13 comments
brahh85@reddit
Why use Quants other than Unsloth
Posted by FeiX7@reddit | LocalLLaMA | View on Reddit | 41 comments
brahh85@reddit
Weird performance depending on quant
Posted by WhiskyAKM@reddit | LocalLLaMA | View on Reddit | 8 comments
brahh85@reddit
GitHub - pwilkin/openmoss: OpenMOSS pure C++ pipeline based on GGML
Posted by ilintar@reddit | LocalLLaMA | View on Reddit | 8 comments
brahh85@reddit
MI50s Qwen 3.6 27B @52.8 tps TG @1569 tps PP (no MTP, no Quant)
Posted by ai-infos@reddit | LocalLLaMA | View on Reddit | 80 comments
brahh85@reddit
Is SillyTavern the most underrated frontend? Could it be an interface with potential trapped in a silly name? Or is it just for a niche?
Posted by Spiderboyz1@reddit | LocalLLaMA | View on Reddit | 82 comments
brahh85@reddit
More Qwen3.6-27B MTP success but on dual Mi50s
Posted by legit_split_@reddit | LocalLLaMA | View on Reddit | 33 comments
brahh85@reddit
vLLM ROCm has been added to Lemonade as an experimental backend
Posted by jfowers_amd@reddit | LocalLLaMA | View on Reddit | 93 comments
brahh85@reddit
Unpopular Opinion: The DGX Spark Forum community of devs is talented AF and will make the crippled hardware a success through their sheer force of will.
Posted by Porespellar@reddit | LocalLLaMA | View on Reddit | 253 comments
brahh85@reddit
ZAYA1-74B-Preview: Scaling Pretraining on AMD
Posted by TKGaming_11@reddit | LocalLLaMA | View on Reddit | 34 comments
brahh85@reddit
DeepSeek V4 being 17x cheaper got me to actually measure what I send to cloud vs what I could run locally. the results are stupid.
Posted by spencer_kw@reddit | LocalLLaMA | View on Reddit | 166 comments
brahh85@reddit
DeepSeek V4 being 17x cheaper got me to actually measure what I send to cloud vs what I could run locally. the results are stupid.
Posted by spencer_kw@reddit | LocalLLaMA | View on Reddit | 166 comments
brahh85@reddit
Common and Obscure Models and Ways to Find Them [ Human Written ]
Posted by iMakeSense@reddit | LocalLLaMA | View on Reddit | 20 comments
brahh85@reddit
Common and Obscure Models and Ways to Find Them [ Human Written ]
Posted by iMakeSense@reddit | LocalLLaMA | View on Reddit | 20 comments
brahh85@reddit
Common and Obscure Models and Ways to Find Them [ Human Written ]
Posted by iMakeSense@reddit | LocalLLaMA | View on Reddit | 20 comments
brahh85@reddit
DeepSeek V4 being 17x cheaper got me to actually measure what I send to cloud vs what I could run locally. the results are stupid.
Posted by spencer_kw@reddit | LocalLLaMA | View on Reddit | 166 comments
brahh85@reddit
Open Weights Models Hall of Fame
Posted by Equivalent_Job_2257@reddit | LocalLLaMA | View on Reddit | 32 comments
brahh85@reddit
Kv cache quantization: ignorance, or malice?
Posted by wombweed@reddit | LocalLLaMA | View on Reddit | 94 comments
brahh85@reddit
GPT 5.5 just leaked its chain of thought to me in codex, and it looks like an idea from 5 months ago in this sub.
Posted by Homeschooled316@reddit | LocalLLaMA | View on Reddit | 61 comments
brahh85@reddit
Help with MI50 and llama.cpp/ROCm 7.2
Posted by WhatererBlah555@reddit | LocalLLaMA | View on Reddit | 7 comments
brahh85@reddit
GBNF grammar tweak for faster Qwen3.6 35B-A3B and Qwen3.6 27B
Posted by Holiday_Purpose_3166@reddit | LocalLLaMA | View on Reddit | 24 comments
brahh85@reddit
Deepseek V4 AGI comfirmed
Posted by Swimming-Sky-7025@reddit | LocalLLaMA | View on Reddit | 186 comments
brahh85@reddit
Forgive my ignorance but how is a 27B model better than 397B?
Posted by No_Conversation9561@reddit | LocalLLaMA | View on Reddit | 286 comments
brahh85@reddit
duda sobre descargarse IA de forma local
Posted by Individual-Party1661@reddit | LocalLLaMA | View on Reddit | 6 comments
brahh85@reddit
duda sobre descargarse IA de forma local
Posted by Individual-Party1661@reddit | LocalLLaMA | View on Reddit | 6 comments
brahh85@reddit
duda sobre descargarse IA de forma local
Posted by Individual-Party1661@reddit | LocalLLaMA | View on Reddit | 6 comments
brahh85@reddit
Speculative Decoding works great for Gemma 4 31B with E2B draft (+29% avg, +50% on code)
Posted by PerceptionGrouchy187@reddit | LocalLLaMA | View on Reddit | 117 comments
brahh85@reddit
32 gb or 64 gb of ddr5
Posted by Worried-Register4465@reddit | LocalLLaMA | View on Reddit | 10 comments
brahh85@reddit
32 gb or 64 gb of ddr5
Posted by Worried-Register4465@reddit | LocalLLaMA | View on Reddit | 10 comments
brahh85@reddit
FlashAttention (FA1–FA4) in PyTorch - educational implementations focused on algorithmic differences
Posted by shreyansh26@reddit | LocalLLaMA | View on Reddit | 1 comments
brahh85@reddit
32 gb or 64 gb of ddr5
Posted by Worried-Register4465@reddit | LocalLLaMA | View on Reddit | 10 comments
brahh85@reddit
FT - China’s Alibaba shifts towards revenue over open-source AI
Posted by LegacyRemaster@reddit | LocalLLaMA | View on Reddit | 132 comments
brahh85@reddit
Hugging Face launches a new repo type: Kernels
Posted by clem59480@reddit | LocalLLaMA | View on Reddit | 24 comments
brahh85@reddit
Hugging Face launches a new repo type: Kernels
Posted by clem59480@reddit | LocalLLaMA | View on Reddit | 24 comments
brahh85@reddit
EXAONE 4.5 released
Posted by Secure_Smoke_4280@reddit | LocalLLaMA | View on Reddit | 42 comments