DeepSeek-R1-0528-Qwen3-8B optimal settings?

Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 6 comments

Does anyone know the optimal settings for this model I'm not sure how sensitive it is I know Qwens last couple of reasoning models have been very sensitive to settings, and this is based on Qwen so