Best small coding model for completion

Posted by QuinsZouls@reddit | LocalLLaMA | View on Reddit | 8 comments

Hi there, I'm looking to use a small model to generate code completions simular to cursor or copilot

There's a small model capable of generating good results for ti?

[-]

LaurentPayot@reddit

The Sweep one works great: https://blog.sweep.dev/posts/oss-next-edit

[-]

QuinsZouls@reddit (OP)

I tried the 0.5b and works incredibly well, thank you

[-]

wilo108@reddit

Sweep is a next-edit model, though, not a FIM model, right? I tried the the 1.5B version with Continue and I couldn't get it to do line completion. Do you have that working?

[-]

LaurentPayot@reddit

Working great for me in Zed as well as in VS Code. I use it like so:

llama-server -m \~/models/sweep-next-edit-1.5b.q8_0.v2.gguf --gpu-layers 29 --ctx-size 0 --temp 0.7 --top-p 0.8 --top-k 20 --repeat-penalty 1.05

[-]

wilo108@reddit

I guess it's template issue or some other Continue integration problem on my side, then. Thanks for the sanity check -- worth me continuing to work on it 👍️

[-]

Worth noting Sweep is a next-edit model (Cursor-style "predict the next change"), not an FIM model (Copilot-style fill-the-cursor). They look similar in a UI but the integration shape is totally different. For FIM specifically the qwen2.5-coder small variants u/One-Pain6799 mentioned are the usual stack — Continue + the 1.5B works decently for line completion.

[-]

DinoAmino@reddit

https://www.reddit.com/r/LocalLLaMA/s/mcbTyEPOaw

[-]

One-Pain6799@reddit

qwen2.5-coder:1.5b - 3b is good