QwQ-32B seems to get the same quality final answer as R1 while reasoning much more concisely and efficiently
Posted by pigeon57434@reddit | LocalLLaMA | View on Reddit | 129 comments
I think I will now switch over to using QwQ as my primary reasoning model instead of R1. In all my testing, it gets the same or superior quality answers as R1 does, while having its chain of thought be much more efficient, much more concise, and much more confident. In contrast, R1 feels like a bumbling idiot who happens to be really smart only because he tries every possible solution.
Adam was right when he said not all CoTs are equal, and in this case, I think Qwen trained their model to be more efficient without degrading quality at all.
But I'm curious to hear what everyone here thinks, because I'm sure others are more experienced than I am.
129 Comments
tengo_harambe@reddit
Nice_Grapefruit_7850@reddit
blacktothafuture@reddit
Pyros-SD-Models@reddit
VoidAlchemy@reddit
outworlder@reddit
Conscious_Cut_6144@reddit
Usual_Maximum7673@reddit
Devonance@reddit
Actual-Lecture-1556@reddit
expertsage@reddit
Caffeine_Monster@reddit
Pyros-SD-Models@reddit
power97992@reddit
Caffeine_Monster@reddit
waywardspooky@reddit
Foreign-Beginning-49@reddit
ViperAMD@reddit
AppearanceHeavy6724@reddit
Outrageous-Wait-8895@reddit
ThePixelHunter@reddit
KeyTruth5326@reddit
Hoodfu@reddit
markole@reddit
snowcountry556@reddit
markole@reddit
Dogeboja@reddit
Hoodfu@reddit
nuclearbananana@reddit
No-Mountain3817@reddit
No-Mountain3817@reddit
108er@reddit
pigeon57434@reddit (OP)
DrDisintegrator@reddit
pigeon57434@reddit (OP)
DrDisintegrator@reddit
xor_2@reddit
custodiam99@reddit
GaragePersonal5997@reddit
custodiam99@reddit
Cergorach@reddit
Dangerous_Fix_5526@reddit
Gyramuur@reddit
Dangerous_Fix_5526@reddit
Gyramuur@reddit
Dangerous_Fix_5526@reddit
Gyramuur@reddit
Dangerous_Fix_5526@reddit
Gyramuur@reddit
Gyramuur@reddit
martinerous@reddit
KBMR@reddit
Cergorach@reddit
Cergorach@reddit
AppearanceHeavy6724@reddit
waywardspooky@reddit
Lissanro@reddit
ASYMT0TIC@reddit
frivolousfidget@reddit
Sadman782@reddit
Interesting8547@reddit
frivolousfidget@reddit
knownboyofno@reddit
frivolousfidget@reddit
JohnnyLiverman@reddit
AdOdd4004@reddit
Ichihara02@reddit
pigeon57434@reddit (OP)
Ichihara02@reddit
Xandrmoro@reddit
xpnrt@reddit
Zayaraq@reddit
AnticitizenPrime@reddit
JacketHistorical2321@reddit
pigeon57434@reddit (OP)
Zayaraq@reddit
pigeon57434@reddit (OP)
Zayaraq@reddit
JacketHistorical2321@reddit
Evening_Ad6637@reddit
pigeon57434@reddit (OP)
ihaag@reddit
pigeon57434@reddit (OP)
ihaag@reddit
bitdotben@reddit
Evening_Ad6637@reddit
bitdotben@reddit
Evening_Ad6637@reddit
bitdotben@reddit
pigeon57434@reddit (OP)
bitdotben@reddit
pigeon57434@reddit (OP)
Glum-Atmosphere9248@reddit
BrilliantArmadillo64@reddit
Dogeboja@reddit
Karyo_Ten@reddit
xor_2@reddit
No-Mountain3817@reddit
Su1tz@reddit
Threatening-Silence-@reddit
ankitm1@reddit
XForceForbidden@reddit
NNN_Throwaway2@reddit
pigeon57434@reddit (OP)
NNN_Throwaway2@reddit
cantgetthistowork@reddit
pigeon57434@reddit (OP)
Emport1@reddit
pigeon57434@reddit (OP)
Emport1@reddit
sammcj@reddit
pigeon57434@reddit (OP)
Expensive-Apricot-25@reddit
pigeon57434@reddit (OP)
SuperChewbacca@reddit
pigeon57434@reddit (OP)
BackyardAnarchist@reddit
DrVonSinistro@reddit
ConnectionDry4268@reddit
DrVonSinistro@reddit
cosimoiaia@reddit
llamabott@reddit
Regrets_397@reddit
Cergorach@reddit
shyam667@reddit
M3GaPrincess@reddit
pigeon57434@reddit (OP)
M3GaPrincess@reddit
pigeon57434@reddit (OP)