The Future of Free & Local Models: Training Co-Ops? Professional Orgs? Churches?

Posted by liftheavyscheisse@reddit | LocalLLaMA | View on Reddit | 18 comments

I'm relatively new to this forum, so forgive me if this discussion has been had ad nauseam already. In a hypothetical future where all the frontier labs stop releasing open-weight models, I don't think the open community would take it lying down. With the combined compute of the community, it seems like it should be possible to train frontier(ish) \~30B models (albeit with significantly less efficiency and speed than the labs). What shape could this take? It seems plausible to me that co-ops would form with people volunteering their compute, contractually bound to run a specific training algorithm on specific data, and then averaging their subresults to update the model. An inspector could occasionally spot-check volunteers' contributions to ensure they're following the recipe, perhaps running the same training regimen in parallel to compute the expected subresult for comparison. Trusted co-op leaders would decide the architecture, manage data sanitation, and so on. Frontier labs require massive bandwidth to synchronize epochs throughout the cluster, but I suspect the space of possibility hasn't been fully explored for training multiple epochs before synchronizing. Another possibility would be that people pool together money to train in the cloud. Maybe folks will run Kickstarters to train a model with an advertised recipe, and host the model exclusively for backers in the cloud for several months before releasing it openly. It also seems plausible that professional and ideological organizations would begin to train their own models. Custom models seem almost inevitable for religious denominations. One thing we could trust about models made by churches—they will always be multilingual and free, if not open, to spread the gospel. Models trained in Christian Scholasticism might be interesting starting points for tuning, as they should well-honed in the imprecise art of logical deduction in natural language. Predictions are hard, especially about the future, so I'm spitballing. What are your thoughts?