Qwen/WebWorld 32B/14B/8B (Qwen3 finetune)
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 10 comments
**WebWorld** is a large-scale **open-web world model** series for training and evaluating web agents. It is trained on **1M+ real-world web interaction trajectories** via a scalable hierarchical data pipeline, supporting:
* **Long-horizon simulation** (30+ steps)
* **Multi-format state representations**: A11y Tree, HTML, XML, Markdown, and natural language
* **CoT-activated reasoning** for transition prediction
* **Cross-domain generalization** to code, GUI, and game environments
Agents trained on WebWorld-synthesized trajectories achieve **+9.9% on MiniWob++** and **+10.9% on WebArena**. When used for inference-time lookahead search, WebWorld **outperforms GPT-5** as a world model.
[https://huggingface.co/Qwen/WebWorld-32B](https://huggingface.co/Qwen/WebWorld-32B)
[https://huggingface.co/Qwen/WebWorld-14B](https://huggingface.co/Qwen/WebWorld-14B)
[https://huggingface.co/Qwen/WebWorld-8B](https://huggingface.co/Qwen/WebWorld-8B)
10 Comments
Nepherpitu@reddit
SeyAssociation38@reddit
sudeposutemizligi@reddit
crantob@reddit
sudeposutemizligi@reddit
Foreign_Risk_2031@reddit
cmndr_spanky@reddit
Psyko38@reddit
Silver-Champion-4846@reddit
MadPelmewka@reddit