Upcoming Coding Models?
Posted by pmttyji@reddit | LocalLLaMA | View on Reddit | 14 comments
Based on past threads from this sub, I see that below coding models are coming.
- Qwen3 Coder - Recent thread
- Deep Cogito - Preview models there
- Polaris - Preview models there
- Granite releasing any new coding models? Preview (General) models there for upcoming Version 4. How good is their existing models.
What other coding models coming apart from above ones?
Ordinary_Mud7430@reddit
CodeGemma?
ttkciar@reddit
Isn't that what the Bifrost fine-tune is supposed to be? I keep meaning to evaluate it, but can't seem to get around to doing it.
tempetemplar@reddit
Very excited with qwen 3 coder!
jedisct1@reddit
We're all dreaming of an open model that could replace a Claude subscription.
Namra_7@reddit
😂ðŸ˜
Dentuam@reddit
we all should have dreams. 😂
RobotRobotWhatDoUSee@reddit
I've been thinking a lot about this lately, this is maybe 1/3 of my motivation for my earlier post about DIY MoE models.
I've been doing a lot of reading since that post and at least conceptully feel like I've made a lot of progress.
Life has been extremely busy lately and implementation-progress has been slow, but I'd there is enough interest I'll post an update on that I've learned in the meanwhile.
My first practical step will probably be to train up a small 3B or 4B coding model, which funny enough I see was also asked about on the front page (of /r/localllama) today.
One other model you might add to your list: NVIDIA's Llama 3.1 Nemotron Nano 4B
emprahsFury@reddit
Jetbrains released their llm Mellum, onto HF. Its a 4b fim
jupiterbjy@reddit
didnt even know they made it, lemme leave a link and save others a search:
https://huggingface.co/JetBrains/Mellum-4b-base
cantgetthistowork@reddit
Devstral large R2
fancyrocket@reddit
I too want to know this
ProfessionUpbeat4500@reddit
Baidu just released something...
I just follow huggingface and the ceo on LinkedIn..easy to keep track of all the big news..
pmttyji@reddit (OP)
So far no small size models from them. 0.3B .... then 21B .... and so on
Steuern_Runter@reddit
Those are not coding models.