Qwen Code CLI has generous FREE Usage option
Posted by NoobMLDude@reddit | LocalLLaMA | View on Reddit | 104 comments
For those who didnt know, Qwen-Code which is a clone of Gemini CLI has a good Free usage plan: - 2,000 requests per day with no token limits - 60 requests per minute rate limit It allows us to use Qwen3Coder for FREE.
Made a small video to showcase how to setup and use here: https://youtu.be/M6ubLFqL-OA
YandiDesign@reddit
sad
Appropriate_Hand8945@reddit
sadly its over
NoobMLDude@reddit (OP)
:( Yes all good and free things must come to an end.
Unfortunately the Free Tier of QwenCode had been first reduced but now is discontinued. :-(
From Qwen-Code repo:
tidoo420@reddit
you jinxed it :(
NoobMLDude@reddit (OP)
:( Sorry about it. Unfortunately the Free Tier of QwenCode had been first reduced but now is discontinued. :-(
From Qwen-Code repo:
It had a good run. Used it a lot in the past year.
lezohar@reddit
Have they reduced the limit of 2000 daily requests? sometimes im getting exceeded quota when using qwen cli with KiloCode and i dont event use more than 50 daily requests
Commercial_Cup692@reddit
ứng dụng này như thế nào
poorfririgh@reddit
I think it's even more generous than they say, the daily limit seems to reset per session for me.
snaga2000@reddit
How can one know the limit left for the day?
Feeling-Tomato-6126@reddit
actually there is no limit, when I open new session for the same project it starts from zero
AfricanType@reddit
They switch you down to a dumber model as they throttle you.
Feeling-Tomato-6126@reddit
yes, you are right. After a while Qwen produces a bug and it kept trying without able to fix it, then I used co-pilot to solve it
NoobMLDude@reddit (OP)
nice I didnt know it resets per sesssion. even better then :D
zemaj-com@reddit
Qwen CLI’s free tier is great. If you want to go a step further, check out Code (github.com/just‑every/code). It wraps Qwen models but also adds multi‑agent orchestration (so you can have a planner and executor collaborate), a built‑in diff viewer and browser integration. Because it runs locally, there’s no API cost and you can still enjoy the generous Qwen token limits.
UseDirect5224@reddit
Doesn't seem to exist anymore?
plasticpippo@reddit
you can use it with kilocode inside vscode. very very powerful indeed!
deleteme123@reddit
URL seems wrong
pixel_creatrice@reddit
Works for me: https://github.com/just-every/code
bitmoji@reddit
Is this node based for some horrible reason
dizvyz@reddit
All of the CLIs are node based.
zemaj-com@reddit
You're right that Code (and KILO) currently rely on Node. The rationale is that Node provides a cross‑platform runtime and tools like `npx` and `npm` make it trivial to install and update (you can run it with `npx -y u/just-every/code` or install it globally with `npm install -g u/just-every/code`). Using Node means we can ship one package that works on most systems without compiling separate binaries.
That said, most of the logic is language‑agnostic and we're open to ports in other languages. If you prefer Python, Rust or another stack, feel free to open an issue or PR—we're actively exploring alternative front‑ends. And if you'd rather not install Node globally, using `npx` gives you a clean, throwaway environment each time you run the CLI.
amokerajvosa@reddit
For me also displayed 404. But after few refresh it worked.
camh-@reddit
The hyphen in that text is not an ASCII hyphen. It is some unicode hyphen. Replace it and copy and past will work.
Good_Tooth2514@reddit
Is the multi-agent capability for creating instances of the configured agent itself or for integrating more than one provider? Could you explain this to me? Thanks!
zemaj-com@reddit
Great question! In Code's multi‑agent mode you're not just spinning up clones of the same model; you're orchestrating separate agents that can use different providers or roles. For example, you might have a Qwen coding assistant, a planner agent based on GPT‑4, and a quality‑checker agent using another model. The CLI handles the plumbing so the agents can communicate and share context.
You can certainly run multiple instances of the same configured model if you just want concurrency, but the real power comes from mixing and matching different LLMs or functions into a cooperative workflow. Each agent runs in its own session/terminal and the built‑in multi‑agent commands coordinate them via a planner/executor architecture. That way you can integrate multiple providers in one cohesive project rather than being limited to clones of one agent.
Lower_Confidence8390@reddit
Is there any tutorial on how to set it uo with qwen ?
zemaj-com@reddit
Definitely! It's actually pretty simple to get Qwen's CLI and the Code CLI working together. For Qwen Code CLI you'll want to install the package (e.g. via `pip install qwen-code`). Then head over to qwen.ai or DashScope to create a free account and grab your DashScope API key. Run `qwen-code login` or export `DASHSCOPE_API_KEY=` so the CLI can authenticate.
If you want to use Qwen inside the Code CLI, install Code (either via `npx -y u/just-every/code` or `pip install code-cli`) and set the provider to Qwen using `--provider qwen`, for example:
```
code chat --provider qwen
```
The Code CLI has a `code providers` command that walks you through configuring keys so you don’t have to pass them each time. Since Code orchestrates multiple agents, you can also combine Qwen with OpenAI, Claude, local LLMs and the built-in browser integration. The README in the repository goes into more detail, but the basic steps are: install Qwen Code CLI, get your API key, export it (or run the login command) and call Code with the `--provider qwen` flag. Feel free to reach out if you hit any snags!
Lower_Confidence8390@reddit
Thank I will try this out tomorrow!
The api key also has the 20000 token limit etc ? I signed in to the cli with the qwen.ai (oauth) so I didn't need a Key
If the key limit is free and the same, it's possible to have a second account and a second key right? If I endup stuck by the limit someday
Lower_Confidence8390@reddit
The provider argument doesn't exist
coder chat --provider qwen
error: unexpected argument '--provider' found
tip: a similar argument exists: '--order'
Usage: code --order
For more information, try '--help'.
Lower_Confidence8390@reddit
I don't see how to use another model with my api key I got from here
https://openrouter.ai/qwen/qwen3-coder:free
nullnuller@reddit
How does it work with qwen-cli Is there any documentation?
Redox_ahmii@reddit
I don't think this does work with Qwen.
Finanzamt_Endgegner@reddit
this we need qwen oauth support!
metigue@reddit
I really wish there was a benchmark to compare all these frameworks. I've seen like 8 in the last week and they all look great.
NoobMLDude@reddit (OP)
this is great. thanks for sharing.
papaloukas@reddit
No Cline o Kilo Code are required for using in Visual Studio Code since there is an official extension: https://marketplace.visualstudio.com/items?itemName=qwenlm.qwen-code-vscode-ide-companion
NoobMLDude@reddit (OP)
Thanks for sharing. Right now it seems to be in Preview.
Have you tried it? What ca it do differently than KILO or Cline. kILO and Cline allow me to be model/provider agnostic and choose models based on the task.
EvinElias@reddit
I have managed to copy the source code from KiloCode and integrated it into Cline to add Qwen Code in API Provider settings, and it works well. I personally like Cline due to its current robust abilities compared to the rest. I completely vibe coded it using QODER. It's really good.
SerGo-emailFreela@reddit
как ты это сделал? можно подробнее для новичка?
SnooBreakthroughs537@reddit
Any ideas on how to integrate with crush (code mentioned above). I can try on my own, but would love to see pointers.
NoobMLDude@reddit (OP)
Cool stuff, I didnt get around to trying Cline with QwenCode.
Yes I like Cline too. I was using the newly released Free SONIC model in Cline.
That model is also Free now if you wish to try it out. https://youtu.be/D2GggzmAh-E
EvinElias@reddit
They accepted my pull request and merged. Even roo code did the same. So, it will soon reflect on both cline and room code in the next update.
CharacterBorn6421@reddit
qwen is not able to make any changes to the file and giving this error "Error
Roo tried to use apply_diff without value for required parameter 'path'. Retrying...
Roo is having trouble...
This may indicate a failure in the model's thought process or inability to use a tool properly, which can be mitigated with some user guidance (e.g. "Try breaking down the task into smaller steps")." do you also face this problem in roo code?
NoobMLDude@reddit (OP)
Awesome. Good work
EvinElias@reddit
Thanks 🙏
EvinElias@reddit
Yes, I have made a pull request there. I hope they accecpt and merge. Fingers crossed. 🤞
Bassiette03@reddit
how to install it?
NoobMLDude@reddit (OP)
The video links above walk you through the installation in detail.
Cultural-Arugula-894@reddit
It's taking too much time for me. I've been vibe coding for long hours now. This might the reason I guess. Can we use this in an IDE somehow with the 2000 per day free tier? I saw that we have to pay for the API if we want to use it in the chat interface?
Have you tried GLM 4.5, they also have $3 monthly plans and is affordable. Please let me know.
NoobMLDude@reddit (OP)
Wow that’s long. The time taken depends on the complexity of the task and also how many turns it needs to take to get the job done, fix mistakes, get it running etc.
Yes you can use Qwen Code free 2000 requests in VsCode using extensions like Cline / Kilo Code. Here’s a video showing how to set it up the Free QwenCode usage in Kilo Code: https://youtu.be/z_ks6Li1D5M
I have tried GLM and it’s also a good model. Just much bigger to run locally.
If it looks better for your tasks and the price seems reasonable, feel free to use it by all means.
but I try not to promote tools that are not Free just so that it is accessible for everyone.
Cultural-Arugula-894@reddit
Thanks for sharing the video. I was talking about the $3 and $15 monthly plans of Z.ai, which offer GLM 4.5 (not running it locally). Also, is there any way to use this model inside VS Code without the GLM API?
I also found out that there are ways to use Qwen code:
1. The video method you shared. (We get Qwen3 coder plus model with 1M context window)
2. Adding API key from Open Router and selecting the model Qwen3-Coder: Free, which is actually the 480B A35B. (with 262K context window).
ImFanOfRed@reddit
Using with Kilo often got :
Edit Unsuccessful
@ Kilo Code is having trouble...
This may indicate a failure in the model's thought process or inability to use a tool properly, which can be mitigated with some user guidance (e.g. "Try breaking down the task into smaller steps").
Anyone same here?
NoobMLDude@reddit (OP)
Haven’t seen this yet but your root cause analysis sounds correct. Some special tool tokens are not generated correctly for edit to be successful.
Previous_Foot_5328@reddit
Qoder is on r/artificial with an AMA! Come join the conversation and ask us anything!
https://www.reddit.com/r/artificial/comments/1n6lpl8/ama_with_qoder_team_an_agentic_coding_platform/
NoobMLDude@reddit (OP)
Thanks for the notification Ben.
Looking forward to the AMA.
I also did a review of Qoder Agent here if you are interested to check it out / bump:
https://youtu.be/4Zipfp4qdV4
SnooBreakthroughs537@reddit
any idea how to set it up in Crush?
NoobMLDude@reddit (OP)
Crush already has a FREE Qwen Code through the OpenRouter provider:
https://openrouter.ai/qwen/qwen3-coder:free
Look for Qwen free under model selection in Crush.
SnooBreakthroughs537@reddit
yeah. I am using that. BUt that's capped at a much lower level I think.
dirtychriz@reddit
qwen3-coder in qwen code is absolutely insane, and they giving it out for free. This won't last long, I believe.
NoobMLDude@reddit (OP)
Yes it IS surprising. It’s 2000 requests per day and 60 requests per minute. That’s enough to get work done, but not enough for people to abuse it by running 24/7 for random stuff. I’m trying to use it as long as the free tier lasts to get some stuff done using it .
1337vi@reddit
It’s pretty great but qwen code still has some issues. For certain projects that uses vite or other dev platform. It hangs when running ‘npm run dev’ which is such a let down for a big project like this.
NoobMLDude@reddit (OP)
Yes there definitely is issues I faced around running bash commands (also shown in the video) like starting a HTTP server.
Worried_Goat_8604@reddit
Guys dont forget someone started this idea of using qwen cli with ROO code first and it turned into a huge discussion with lots of support but it was kilo code that first introduced this is just 2 days so im pretty sure now they will regularly update making it work even more seamlessly. Remember the hard part of even seeing the feature of using qwen cli in kilo is dont now its just the task of making it work more seamlessly.
WorthDetective5912@reddit
Great alternative to cursor! Using it with KiloCode in VSC. But is there anyway to prevent that the changes are applied directly ? I only see the diff (red/green code colors) for a second and then the changes get applied instantly.
NoobMLDude@reddit (OP)
Yes you can. In the KILO CODE Settings you'll find a tab for "Auto-Approve". In there you can select which steps you want to auto-approve. All others should ask you for explicit approval. See screenshot below.
WorthDetective5912@reddit
but when i deactivate it i have to confirm every single action like reading files folders etc. i only want to see the difference in code side by side that the ai generates..
RageshAntony@reddit
Can I use this via Kilo Code without context length restrictions?
NoobMLDude@reddit (OP)
QwenCode already comes with a 1 million token context length.
If that is not sufficient for your use, KILO Code allows you to compress the Context anytime. So you can press that button to condense the context. See screenshot.
megadonkeyx@reddit
Doesn't it use the 30b moe model by default?
Equivalent_Cut_5845@reddit
And so does gemini-cli. For me the only appeal of qwen code is connection local/openai compatible models. If you're using qwen code just to use another api model then might as well use gemini cli.
Danmoreng@reddit
I did that. The Google free tier allows 100 requests/day. That allows for 1-2 hours of coding. Qwens 2.000 requests/day are more than enough for a full day of coding.
klam997@reddit
The google auth method where you use a free account is 1000 requests/day but I think they might limit you on total 2.5 pro use but you might be forced to switch to 2.5 flash.
Still.. free is free. I'm thankful for any company that still provides free options for people.
Danmoreng@reddit
Flash is unusable. It is 100 req/day of 2.5 pro with a free api key you can generate in google ai studio, the direct account login is much less.
lszb@reddit
Then I'd rather use cursor cli.
robberviet@reddit
Qwen Coder via qwen-code is great imo.
KoichiSP@reddit
Cool! Is it the 30B model, or the big one? By using Qwen's auth in Europe
NoobMLDude@reddit (OP)
They have both named as Plus and Flash.
korino11@reddit
Sry i cannot understand HOW to get API keys for Free useing?!?
dizvyz@reddit
My experience from a few second ago before I forget.
Go to chat.qwen.ai , click signup, i picked google login for simplicity. Then start the terminal app by typing qwen in the terminal, pick oauth from the given options and look at the web browser window that opens.
korino11@reddit
I have found instruction on a github. Than you and sry that i didnt get enough atention in details.
dizvyz@reddit
no worries. as long as it works.
NoobMLDude@reddit (OP)
The video shows you all the steps from start. The GitHub README mentions it as well.
dizvyz@reddit
You would create an account on qwen.ai then go from there I supposed. I am about to do that now so if you have follow up questions, let me know.
mohamed_stifi@reddit
Gpt oss for Arabic
lodott1@reddit
Great PSA - thanks for sharing! Say, are there any examples of how capable these types of tools are? Has anything substantial been built yet, without the need for heavy review/refactoring? Gpt5 left me partially impressed, partially wanting for more consistency and functionality.
badhiyahai@reddit
You still need to review for sure, sometimes they be deleting the databases.
NoobMLDude@reddit (OP)
Don't give full permissions to execute ALL commands.
Always read the commands before you allow it to execute.
badhiyahai@reddit
Yes, that's why I said you need to review for sure
NoobMLDude@reddit (OP)
Yes I was replying to the original comment, accidentally replied to yours.
NoobMLDude@reddit (OP)
you are welcome. happy to share.
Regarding quality (as I said above):
> "It is always a trade-off between can you get 90% work done with a Free service VS can you justify a $200 subscription of Claude or GPT5. For me the gain from even a $20 subscription when compared to these free alternatives is not much.
> So I stick to the free alternatives for now."
25th__Baam@reddit
I am seeing <th tokens when using in Cline or Kilo code. How can I resolve this.
NoobMLDude@reddit (OP)
Just got it working in KILO Code.
https://youtu.be/z_ks6Li1D5M
I faced similar issues when using the Flash model. However the Qwen_code_Plus works fine.
25th__Baam@reddit
Thanks Man! 🙏🙏
Fantastic_Spite_5570@reddit
Is it good though
NoobMLDude@reddit (OP)
how would you define good?
It is always a trade-off between can you get 90% work done with a Free service VS can you justify a $200 subscription of Claude or GPT5.
For me the gain from even a $20 subscription when compared to these free alternatives is not much.
So I stick to the free alternatives for now.
Fantastic_Spite_5570@reddit
Getting 90% for free is great. Didn’t know it was that good
NoobMLDude@reddit (OP)
Of course 90% is also depending on the task you wish to accomplish. If you wish to create apps it is decent, but if you wish to do complex data science, I'm not sure.
So you need to try it for your tasks and judge it.
"It's good for Task X" does not guarantee that "It will be good for Task Y"
jonasaba@reddit
Can I use it with vscode somehow?
NoobMLDude@reddit (OP)
Yes you can. You can use KILO Code inside VSCode.
Here's a video to set it up: https://youtu.be/z_ks6Li1D5MUse the qwen-coder-plus model . the Flash model has issues with tool calling in KILO Code.
lordpuddingcup@reddit
Really wish they’d incorporate the qwen cli auth into roocode
NoobMLDude@reddit (OP)
Just got it working in KILO Code.
https://youtu.be/z_ks6Li1D5M
I'm guessing RooCode could also have a similar model selection for Qwen-Code or using Oauth creds.
You can find the QwenCode oAuth creds in `\~/.qwem/oauth_creds.json`
lostnuclues@reddit
I run out of 60 request in a minute limit. Is there a way to slow that, by adding extra delay in each request?
Normal-Ad-7114@reddit
No LOcaL nO cARe incoming
JoMa4@reddit
Free in China…