iamMess
I fine-tuned Cohere Transcribe to support diarization and timestamps
Posted by iamMess@reddit | LocalLLaMA | View on Reddit | 25 comments
I fine-tuned Cohere Transcribe to support diarization and timestamps
Posted by iamMess@reddit | LocalLLaMA | View on Reddit | 25 comments
iamMess@reddit (OP)
I fine-tuned Cohere Transcribe to support diarization and timestamps
Posted by iamMess@reddit | LocalLLaMA | View on Reddit | 25 comments
iamMess@reddit (OP)
I fine-tuned Cohere Transcribe to support diarization and timestamps
Posted by iamMess@reddit | LocalLLaMA | View on Reddit | 25 comments
iamMess@reddit (OP)
Deepseek v4 pricing is genuinely silly, did the math and now i am questioning my entire stack
Posted by Skid_gates_99@reddit | LocalLLaMA | View on Reddit | 77 comments
iamMess@reddit
Fine-tuned Qwen3 SLMs (0.6-8B) beat frontier LLMs on narrow tasks
Posted by Jolly-Gazelle-6060@reddit | LocalLLaMA | View on Reddit | 82 comments
iamMess@reddit
Unsloth announces support for finetuning embedding models
Posted by -Cubie-@reddit | LocalLLaMA | View on Reddit | 18 comments
iamMess@reddit
Someone from NVIDIA made a big mistake and uploaded the parent folder of their upcoming model on Hugging Face
Posted by Nunki08@reddit | LocalLLaMA | View on Reddit | 165 comments
iamMess@reddit
ELI5: why does nvidia always sell their consumer gpus below market price?
Posted by GreenTreeAndBlueSky@reddit | LocalLLaMA | View on Reddit | 18 comments
iamMess@reddit
Speculative Decoding is AWESOME with Llama.cpp!
Posted by simracerman@reddit | LocalLLaMA | View on Reddit | 61 comments
iamMess@reddit
How to post-train LLM with tokenizer replacement?
Posted by Objective-Good310@reddit | LocalLLaMA | View on Reddit | 2 comments
iamMess@reddit
Deepinfra sudden 2.5x price hike for llama 3.3 70b instruction turbo. How are others coping with this?
Posted by parmarss@reddit | LocalLLaMA | View on Reddit | 25 comments
iamMess@reddit
[URGENT] Which is a reliable and affordable GPU cluster for hosting custom LLMs for business
Posted by Competitive-Wing1585@reddit | LocalLLaMA | View on Reddit | 36 comments
iamMess@reddit
Tilde AI Releases TildeOpen LLM: An Open-Source Large Language Model with Over 30 Billion Parameters and Support Most European Languages
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 42 comments
iamMess@reddit
Phantom-fragment
Posted by Ok_Horror_8567@reddit | LocalLLaMA | View on Reddit | 13 comments
iamMess@reddit
Phantom-fragment
Posted by Ok_Horror_8567@reddit | LocalLLaMA | View on Reddit | 13 comments
iamMess@reddit
Phantom-fragment
Posted by Ok_Horror_8567@reddit | LocalLLaMA | View on Reddit | 13 comments
iamMess@reddit
Phantom-fragment
Posted by Ok_Horror_8567@reddit | LocalLLaMA | View on Reddit | 13 comments
iamMess@reddit
Local Meeting Notes with Whisper Transcription + Ollama Summaries (Gemma3n, LLaMA, Mistral) - Meetily
Posted by Sorry_Transition_599@reddit | LocalLLaMA | View on Reddit | 9 comments
iamMess@reddit
axolotl vs unsloth [performance and everything]
Posted by Shivacious@reddit | LocalLLaMA | View on Reddit | 26 comments
iamMess@reddit
🚀 OpenAI released their open-weight models!!!
Posted by ResearchCrafty1804@reddit | LocalLLaMA | View on Reddit | 571 comments
iamMess@reddit
The "Leaked" 120B OpenAI Model Is Trained In FP4
Posted by Few_Painter_5588@reddit | LocalLLaMA | View on Reddit | 132 comments
iamMess@reddit
100x faster and 100x cheaper transcription with open models vs proprietary
Posted by crookedstairs@reddit | LocalLLaMA | View on Reddit | 23 comments
iamMess@reddit
Drummer's Mixtral 4x3B v1 - A finetuned clown MoE experiment with Voxtral 3B!
Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 15 comments
iamMess@reddit
Drummer's Mixtral 4x3B v1 - A finetuned clown MoE experiment with Voxtral 3B!
Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 15 comments
iamMess@reddit
Voxtral WebGPU: State-of-the-art audio transcription directly in your browser!
Posted by xenovatech@reddit | LocalLLaMA | View on Reddit | 13 comments
iamMess@reddit
I made a 1000 hour NSFW TTS dataset
Posted by hotroaches4liferz@reddit | LocalLLaMA | View on Reddit | 152 comments
iamMess@reddit
mistralai/Voxtral-Mini-3B-2507 · Hugging Face
Posted by Dark_Fire_12@reddit | LocalLLaMA | View on Reddit | 94 comments
iamMess@reddit
Well, if anyone was waiting for Llama 4 Behemoth, it's gone
Posted by Ok-Elevator5091@reddit | LocalLLaMA | View on Reddit | 154 comments
iamMess@reddit
Here is how we beat ChatGPT at classification with 1 dollar in cloud compute
Posted by iamMess@reddit | LocalLLaMA | View on Reddit | 43 comments
iamMess@reddit (OP)
Here is how we beat ChatGPT at classification with 1 dollar in cloud compute
Posted by iamMess@reddit | LocalLLaMA | View on Reddit | 43 comments
iamMess@reddit (OP)
Here is how we beat ChatGPT at classification with 1 dollar in cloud compute
Posted by iamMess@reddit | LocalLLaMA | View on Reddit | 43 comments
iamMess@reddit (OP)
Here is how we beat ChatGPT at classification with 1 dollar in cloud compute
Posted by iamMess@reddit | LocalLLaMA | View on Reddit | 43 comments
iamMess@reddit (OP)
Here is how we beat ChatGPT at classification with 1 dollar in cloud compute
Posted by iamMess@reddit | LocalLLaMA | View on Reddit | 43 comments
iamMess@reddit (OP)
Here is how we beat ChatGPT at classification with 1 dollar in cloud compute
Posted by iamMess@reddit | LocalLLaMA | View on Reddit | 43 comments
iamMess@reddit (OP)
Here is how we beat ChatGPT at classification with 1 dollar in cloud compute
Posted by iamMess@reddit | LocalLLaMA | View on Reddit | 43 comments
iamMess@reddit (OP)
Here is how we beat ChatGPT at classification with 1 dollar in cloud compute
Posted by iamMess@reddit | LocalLLaMA | View on Reddit | 43 comments
iamMess@reddit (OP)
Here is how we beat ChatGPT at classification with 1 dollar in cloud compute
Posted by iamMess@reddit | LocalLLaMA | View on Reddit | 43 comments
iamMess@reddit (OP)
What finetuning library have you seen success with?
Posted by Responsible-Crew1801@reddit | LocalLLaMA | View on Reddit | 17 comments
iamMess@reddit
Is there appetite for hosting 3b/8b size models at an affordable rate?
Posted by No-Fig-8614@reddit | LocalLLaMA | View on Reddit | 25 comments
iamMess@reddit
Is this the largest "No synthetic data" open weight LLM? (142B)
Posted by AaronFeng47@reddit | LocalLLaMA | View on Reddit | 48 comments
iamMess@reddit
Is this the largest "No synthetic data" open weight LLM? (142B)
Posted by AaronFeng47@reddit | LocalLLaMA | View on Reddit | 48 comments
iamMess@reddit
ResembleAI provides safetensors for Chatterbox TTS
Posted by WackyConundrum@reddit | LocalLLaMA | View on Reddit | 16 comments
iamMess@reddit
ResembleAI provides safetensors for Chatterbox TTS
Posted by WackyConundrum@reddit | LocalLLaMA | View on Reddit | 16 comments
iamMess@reddit
DeepSeek R1 05/28 performance on five independent benchmarks
Posted by zero0_one1@reddit | LocalLLaMA | View on Reddit | 8 comments
iamMess@reddit
How is Kokoro TTS so good with so few parameters?
Posted by JealousAmoeba@reddit | LocalLLaMA | View on Reddit | 80 comments
iamMess@reddit
B200 vs H100 Training Benchmark: Up to 57% Faster Throughput
Posted by igorsusmelj@reddit | LocalLLaMA | View on Reddit | 18 comments
iamMess@reddit
When you prompt a non-thinking model to think, does it actually improve output?
Posted by Kep0a@reddit | LocalLLaMA | View on Reddit | 42 comments
iamMess@reddit
Orpheus-FastAPI: Local TTS with 8 Voices & Emotion Tags (OpenAI Endpoint Compatible)
Posted by townofsalemfangay@reddit | LocalLLaMA | View on Reddit | 111 comments
iamMess@reddit
How I used entropy and varentropy to detect and remediate hallucinations in LLMs
Posted by AdditionalWeb107@reddit | LocalLLaMA | View on Reddit | 12 comments