pigeon57434

LM Studio finally added support for MTP Speculative Decoding

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 99 comments
People seem to already not care about heretic?

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 35 comments
What happened to Black Forest Labs?

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 53 comments
I know this community doesn't like AA-II much but this is a NUTTY score for Kimi-K2.6 wow

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 1 comments
Heretic has FINALLY defeated GPT-OSS with a new experimental decensoring method called ARA

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 152 comments
Qwen3.5-27B as good as DeepSeek-V3.2 on AA-II (plus some more data)

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 28 comments
Averaged over the 36 text benchmarks provided for Qwen3.5's new small models I have a question

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 8 comments
What's the best open source speech to text model

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 17 comments
why do uncensored and abliterated fine tunes still refuse my request

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 49 comments
Qwen/Qwen3-4B-Thinking-2507

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 6 comments
How do you change the system prompt in the oobabooga text gen webui?

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 5 comments
DeepSeek-R1-0528-Qwen3-8B optimal settings?

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 6 comments
How much does flash attention affect intelligence in reasoning models like QwQ

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 21 comments
QwQ-32B seems to get the same quality final answer as R1 while reasoning much more concisely and efficiently

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 129 comments
can someone explain all the different quant methods

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 30 comments
Qwen team seems sure that their model is better than LiveBench ranks it and demand a rerun with more optimal settings, which is crazy because it already performed really great

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 28 comments
fal announces Flux a new AI image model they claim its reminiscent of Midjourney and its 12B params open weights

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 120 comments
Elon says Grok 2 won't be open sourced until Grok 3 is stable which isn't for a couple MONTHS

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 11 comments
Which do you think will be better: Qwen-3 or Llama-4

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 33 comments
I calculated the effective cost of R1 Vs o1 and here's what I found

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 18 comments
open source model small enough to run on a single 3090 performing WAY better in most benchmarks than the ultra proprietary closed source state of the art model from only a couple months ago

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 22 comments
Whats the current State-of-The-Art for voice cloning?

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 4 comments
Now that Phi-4 has been out for a while what do you think?

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 70 comments
local solutions for government?

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 23 comments
I've been out of the local llm space for a while what do people use to run these models now?

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 21 comments
Has anyone tried Dolphin-Vision yet?

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 51 comments
When do you think 1-bit LLMs will actually kick off if ever?

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 73 comments
What's the best model I can run with my GPU?

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 22 comments
Gemma2-27B outperforms both Qwe2-72B and Llama3-70B on LMSYS. But is it actually better?

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 111 comments
SD3-Medium is on imgsys rankings now it shows 62 ELO better than SDXL base

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 46 comments
SD3 support in A1111?

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 7 comments