One year’s benchmark progress: comparing Sonnet 3.5 with open weight 2025 non-thinking models
Posted by nomorebuttsplz@reddit | LocalLLaMA | View on Reddit | 36 comments
AI did not hit a plateau, at least in benchmarks. Pretty impressive with one year’s hindsight. Of course benchmarks aren’t everything. They aren’t nothing either.
36 Comments
AppearanceHeavy6724@reddit
nomorebuttsplz@reddit (OP)
AppearanceHeavy6724@reddit
nomorebuttsplz@reddit (OP)
AppearanceHeavy6724@reddit
nomorebuttsplz@reddit (OP)
AppearanceHeavy6724@reddit
perelmanych@reddit
AppearanceHeavy6724@reddit
perelmanych@reddit
UnionCounty22@reddit
nomorebuttsplz@reddit (OP)
Prestigious_Scene971@reddit
jovialfaction@reddit
Mkengine@reddit
jovialfaction@reddit
AppearanceHeavy6724@reddit
nuclearbananana@reddit
nomorebuttsplz@reddit (OP)
nuclearbananana@reddit
AppearanceHeavy6724@reddit
nuclearbananana@reddit
AppearanceHeavy6724@reddit
lly0571@reddit
a_beautiful_rhind@reddit
TheRealMasonMac@reddit
a_beautiful_rhind@reddit
Down_The_Rabbithole@reddit
TheRealMasonMac@reddit
nomorebuttsplz@reddit (OP)
mindful_maven_25@reddit
noage@reddit
nomorebuttsplz@reddit (OP)
Traditional-Gap-3313@reddit
nomorebuttsplz@reddit (OP)
nomorebuttsplz@reddit (OP)