Best small coding model for completion
Posted by QuinsZouls@reddit | LocalLLaMA | View on Reddit | 8 comments
Hi there, I'm looking to use a small model to generate code completions simular to cursor or copilot
There's a small model capable of generating good results for ti?
LaurentPayot@reddit
The Sweep one works great: https://blog.sweep.dev/posts/oss-next-edit
QuinsZouls@reddit (OP)
I tried the 0.5b and works incredibly well, thank you
wilo108@reddit
Sweep is a next-edit model, though, not a FIM model, right? I tried the the 1.5B version with Continue and I couldn't get it to do line completion. Do you have that working?
LaurentPayot@reddit
Working great for me in Zed as well as in VS Code. I use it like so:
llama-server -m \~/models/sweep-next-edit-1.5b.q8_0.v2.gguf --gpu-layers 29 --ctx-size 0 --temp 0.7 --top-p 0.8 --top-k 20 --repeat-penalty 1.05
wilo108@reddit
I guess it's template issue or some other Continue integration problem on my side, then. Thanks for the sanity check -- worth me continuing to work on it 👍️
thlandgraf@reddit
Worth noting Sweep is a next-edit model (Cursor-style "predict the next change"), not an FIM model (Copilot-style fill-the-cursor). They look similar in a UI but the integration shape is totally different. For FIM specifically the qwen2.5-coder small variants u/One-Pain6799 mentioned are the usual stack — Continue + the 1.5B works decently for line completion.
DinoAmino@reddit
https://www.reddit.com/r/LocalLLaMA/s/mcbTyEPOaw
One-Pain6799@reddit
qwen2.5-coder:1.5b - 3b is good