What am I missing about samplers?

Posted by TacticalRock@reddit | LocalLLaMA | View on Reddit | 10 comments

Hi all,

With the recent release of models that require temp = 1, top_k = N, and top_p = 0.95, I'm wondering why labs actually prefer those truncation samplers over just min_p?

As far as I understand, min_p isn't supported everywhere, and they're just following industry standards with top_k and top_p, but if one replaces those two truncation samplers with just min_p, is there a real reason not to? Let's say, for Qwen 3.6, instead of top_k of 20 and top_p of 0.95, I just do min_p of 0.05-0.10, is there a mechanical/structural or analytical reason not to?

I know I can just stick to the given samplers and call it a day, but I'm just curious, and I like the dynamic nature of min_p :)

Thanks!