baidu/ERNIE-4.5-VL-28B-A3B-Thinking released. Curious case..

Posted by PaceZealousideal6091@reddit | LocalLLaMA | View on Reddit | 19 comments

It seems Baidu has released the "thinking" variant if their vl model silently. The earlier model was supposedly hybrid, supporting both "thinking" and "non-thinking". The model card says that they have introduced something called "thinking with images" without explaining what it is. They have one put a small hardly visible graph comparing it with gemini 2.5 pro and gpt-5 high in various benchmarks . If you squint your eye enough, then you'll see they claim using the graph that this model keeps up or beat them good in many of the benchmarks. Surely benchmaxxed. Its too good to believe. Has anyone tried it? The previous ernie versions have been decent. It might be worth testing it. Does anyone have any idea how is this "thinking" variant different?