DeepSeek-R1-0528-Qwen3-8B optimal settings?
Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 6 comments
Does anyone know the optimal settings for this model I'm not sure how sensitive it is I know Qwens last couple of reasoning models have been very sensitive to settings, and this is based on Qwen so
6 Comments
Afraid-Employer-9331@reddit
Admirable_Bag8004@reddit
ab2377@reddit
Admirable_Bag8004@reddit
dreamai87@reddit
StrikeOner@reddit