Qwen Code CLI has generous FREE Usage option

[-]

NoobMLDude@reddit (OP)

:( Yes all good and free things must come to an end.
Unfortunately the Free Tier of QwenCode had been first reduced but now is discontinued. :-(

From Qwen-Code repo:

2026-04-13: Qwen OAuth free tier policy update: daily quota adjusted to 100 requests/day (from 1,000).

2026-04-15: Qwen OAuth free tier has been discontinued. To continue using Qwen Code, switch to Alibaba Cloud Coding Plan, OpenRouter, Fireworks AI, or bring your own API key. Run qwen auth to configure.

[-]

NoobMLDude@reddit (OP)

:( Sorry about it. Unfortunately the Free Tier of QwenCode had been first reduced but now is discontinued. :-(

From Qwen-Code repo:

2026-04-13: Qwen OAuth free tier policy update: daily quota adjusted to 100 requests/day (from 1,000).

2026-04-15: Qwen OAuth free tier has been discontinued. To continue using Qwen Code, switch to Alibaba Cloud Coding Plan, OpenRouter, Fireworks AI, or bring your own API key. Run qwen auth to configure.

It had a good run. Used it a lot in the past year.

[-]

lezohar@reddit

Have they reduced the limit of 2000 daily requests? sometimes im getting exceeded quota when using qwen cli with KiloCode and i dont event use more than 50 daily requests

[-]

poorfririgh@reddit

I think it's even more generous than they say, the daily limit seems to reset per session for me.

[-]

snaga2000@reddit

How can one know the limit left for the day?

[-]

Feeling-Tomato-6126@reddit

actually there is no limit, when I open new session for the same project it starts from zero

[-]

AfricanType@reddit

They switch you down to a dumber model as they throttle you.

[-]

Feeling-Tomato-6126@reddit

yes, you are right. After a while Qwen produces a bug and it kept trying without able to fix it, then I used co-pilot to solve it

[-]

NoobMLDude@reddit (OP)

nice I didnt know it resets per sesssion. even better then :D

[-]

Qwen CLI’s free tier is great. If you want to go a step further, check out Code (github.com/just‑every/code). It wraps Qwen models but also adds multi‑agent orchestration (so you can have a planner and executor collaborate), a built‑in diff viewer and browser integration. Because it runs locally, there’s no API cost and you can still enjoy the generous Qwen token limits.

[-]

UseDirect5224@reddit

Doesn't seem to exist anymore?

[-]

plasticpippo@reddit

you can use it with kilocode inside vscode. very very powerful indeed!

[-]

deleteme123@reddit

URL seems wrong

[-]

pixel_creatrice@reddit

Works for me: https://github.com/just-every/code

[-]

bitmoji@reddit

Is this node based for some horrible reason

[-]

dizvyz@reddit

All of the CLIs are node based.

[-]

zemaj-com@reddit

You're right that Code (and KILO) currently rely on Node. The rationale is that Node provides a cross‑platform runtime and tools like `npx` and `npm` make it trivial to install and update (you can run it with `npx -y u/just-every/code` or install it globally with `npm install -g u/just-every/code`). Using Node means we can ship one package that works on most systems without compiling separate binaries.

That said, most of the logic is language‑agnostic and we're open to ports in other languages. If you prefer Python, Rust or another stack, feel free to open an issue or PR—we're actively exploring alternative front‑ends. And if you'd rather not install Node globally, using `npx` gives you a clean, throwaway environment each time you run the CLI.

[-]

amokerajvosa@reddit

For me also displayed 404. But after few refresh it worked.

[-]

camh-@reddit

github.com/just‑every/code

The hyphen in that text is not an ASCII hyphen. It is some unicode hyphen. Replace it and copy and past will work.

[-]

Good_Tooth2514@reddit

Is the multi-agent capability for creating instances of the configured agent itself or for integrating more than one provider? Could you explain this to me? Thanks!

[-]

zemaj-com@reddit

Great question! In Code's multi‑agent mode you're not just spinning up clones of the same model; you're orchestrating separate agents that can use different providers or roles. For example, you might have a Qwen coding assistant, a planner agent based on GPT‑4, and a quality‑checker agent using another model. The CLI handles the plumbing so the agents can communicate and share context.

You can certainly run multiple instances of the same configured model if you just want concurrency, but the real power comes from mixing and matching different LLMs or functions into a cooperative workflow. Each agent runs in its own session/terminal and the built‑in multi‑agent commands coordinate them via a planner/executor architecture. That way you can integrate multiple providers in one cohesive project rather than being limited to clones of one agent.

[-]

Lower_Confidence8390@reddit

Is there any tutorial on how to set it uo with qwen ?

[-]

zemaj-com@reddit

Definitely! It's actually pretty simple to get Qwen's CLI and the Code CLI working together. For Qwen Code CLI you'll want to install the package (e.g. via `pip install qwen-code`). Then head over to qwen.ai or DashScope to create a free account and grab your DashScope API key. Run `qwen-code login` or export `DASHSCOPE_API_KEY=` so the CLI can authenticate.

If you want to use Qwen inside the Code CLI, install Code (either via `npx -y u/just-every/code` or `pip install code-cli`) and set the provider to Qwen using `--provider qwen`, for example:

```

code chat --provider qwen

```

The Code CLI has a `code providers` command that walks you through configuring keys so you don’t have to pass them each time. Since Code orchestrates multiple agents, you can also combine Qwen with OpenAI, Claude, local LLMs and the built-in browser integration. The README in the repository goes into more detail, but the basic steps are: install Qwen Code CLI, get your API key, export it (or run the login command) and call Code with the `--provider qwen` flag. Feel free to reach out if you hit any snags!

[-]

Lower_Confidence8390@reddit

Thank I will try this out tomorrow!

The api key also has the 20000 token limit etc ? I signed in to the cli with the qwen.ai (oauth) so I didn't need a Key

If the key limit is free and the same, it's possible to have a second account and a second key right? If I endup stuck by the limit someday

[-]

Lower_Confidence8390@reddit

The provider argument doesn't exist

coder chat --provider qwen

error: unexpected argument '--provider' found

tip: a similar argument exists: '--order'

Usage: code --order

For more information, try '--help'.

[-]

Lower_Confidence8390@reddit

I don't see how to use another model with my api key I got from here

https://openrouter.ai/qwen/qwen3-coder:free

[-]

nullnuller@reddit

How does it work with qwen-cli Is there any documentation?

[-]

Redox_ahmii@reddit

I don't think this does work with Qwen.

[-]

Finanzamt_Endgegner@reddit

this we need qwen oauth support!

[-]

metigue@reddit

I really wish there was a benchmark to compare all these frameworks. I've seen like 8 in the last week and they all look great.

[-]

NoobMLDude@reddit (OP)

this is great. thanks for sharing.

[-]

papaloukas@reddit

No Cline o Kilo Code are required for using in Visual Studio Code since there is an official extension: https://marketplace.visualstudio.com/items?itemName=qwenlm.qwen-code-vscode-ide-companion

[-]

NoobMLDude@reddit (OP)

Thanks for sharing. Right now it seems to be in Preview.

Have you tried it? What ca it do differently than KILO or Cline. kILO and Cline allow me to be model/provider agnostic and choose models based on the task.

[-]

EvinElias@reddit

I have managed to copy the source code from KiloCode and integrated it into Cline to add Qwen Code in API Provider settings, and it works well. I personally like Cline due to its current robust abilities compared to the rest. I completely vibe coded it using QODER. It's really good.

[-]

SerGo-emailFreela@reddit

как ты это сделал? можно подробнее для новичка?

[-]

SnooBreakthroughs537@reddit

Any ideas on how to integrate with crush (code mentioned above). I can try on my own, but would love to see pointers.

[-]

NoobMLDude@reddit (OP)

Cool stuff, I didnt get around to trying Cline with QwenCode.
Yes I like Cline too. I was using the newly released Free SONIC model in Cline.
That model is also Free now if you wish to try it out. https://youtu.be/D2GggzmAh-E

[-]

EvinElias@reddit

They accepted my pull request and merged. Even roo code did the same. So, it will soon reflect on both cline and room code in the next update.

[-]

CharacterBorn6421@reddit

qwen is not able to make any changes to the file and giving this error "Error

Roo tried to use apply_diff without value for required parameter 'path'. Retrying...

Roo is having trouble...

This may indicate a failure in the model's thought process or inability to use a tool properly, which can be mitigated with some user guidance (e.g. "Try breaking down the task into smaller steps")." do you also face this problem in roo code?

[-]

NoobMLDude@reddit (OP)

Awesome. Good work

[-]

EvinElias@reddit

Thanks 🙏

[-]

EvinElias@reddit

Yes, I have made a pull request there. I hope they accecpt and merge. Fingers crossed. 🤞

[-]

Bassiette03@reddit

how to install it?

[-]

NoobMLDude@reddit (OP)

The video links above walk you through the installation in detail.

[-]

Cultural-Arugula-894@reddit

It's taking too much time for me. I've been vibe coding for long hours now. This might the reason I guess. Can we use this in an IDE somehow with the 2000 per day free tier? I saw that we have to pay for the API if we want to use it in the chat interface?

Have you tried GLM 4.5, they also have $3 monthly plans and is affordable. Please let me know.

[-]

NoobMLDude@reddit (OP)

Wow that’s long. The time taken depends on the complexity of the task and also how many turns it needs to take to get the job done, fix mistakes, get it running etc.

Yes you can use Qwen Code free 2000 requests in VsCode using extensions like Cline / Kilo Code. Here’s a video showing how to set it up the Free QwenCode usage in Kilo Code: https://youtu.be/z_ks6Li1D5M

I have tried GLM and it’s also a good model. Just much bigger to run locally.

If it looks better for your tasks and the price seems reasonable, feel free to use it by all means.

but I try not to promote tools that are not Free just so that it is accessible for everyone.

[-]

Cultural-Arugula-894@reddit

Thanks for sharing the video. I was talking about the $3 and $15 monthly plans of Z.ai, which offer GLM 4.5 (not running it locally). Also, is there any way to use this model inside VS Code without the GLM API?

I also found out that there are ways to use Qwen code:
1. The video method you shared. (We get Qwen3 coder plus model with 1M context window)
2. Adding API key from Open Router and selecting the model Qwen3-Coder: Free, which is actually the 480B A35B. (with 262K context window).

[-]

ImFanOfRed@reddit

Using with Kilo often got :

Edit Unsuccessful

@ Kilo Code is having trouble...

This may indicate a failure in the model's thought process or inability to use a tool properly, which can be mitigated with some user guidance (e.g. "Try breaking down the task into smaller steps").

Anyone same here?

[-]

NoobMLDude@reddit (OP)

Haven’t seen this yet but your root cause analysis sounds correct. Some special tool tokens are not generated correctly for edit to be successful.

[-]

Previous_Foot_5328@reddit

Qoder is on r/artificial with an AMA! Come join the conversation and ask us anything!
https://www.reddit.com/r/artificial/comments/1n6lpl8/ama_with_qoder_team_an_agentic_coding_platform/

[-]

NoobMLDude@reddit (OP)

Thanks for the notification Ben.
Looking forward to the AMA.
I also did a review of Qoder Agent here if you are interested to check it out / bump:
https://youtu.be/4Zipfp4qdV4

[-]

SnooBreakthroughs537@reddit

any idea how to set it up in Crush?

[-]

NoobMLDude@reddit (OP)

Crush already has a FREE Qwen Code through the OpenRouter provider:

https://openrouter.ai/qwen/qwen3-coder:free

Look for Qwen free under model selection in Crush.

[-]

SnooBreakthroughs537@reddit

yeah. I am using that. BUt that's capped at a much lower level I think.

[-]

dirtychriz@reddit

qwen3-coder in qwen code is absolutely insane, and they giving it out for free. This won't last long, I believe.

[-]

NoobMLDude@reddit (OP)

Yes it IS surprising. It’s 2000 requests per day and 60 requests per minute. That’s enough to get work done, but not enough for people to abuse it by running 24/7 for random stuff. I’m trying to use it as long as the free tier lasts to get some stuff done using it .

[-]

1337vi@reddit

It’s pretty great but qwen code still has some issues. For certain projects that uses vite or other dev platform. It hangs when running ‘npm run dev’ which is such a let down for a big project like this.

[-]

NoobMLDude@reddit (OP)

Yes there definitely is issues I faced around running bash commands (also shown in the video) like starting a HTTP server.

[-]

Worried_Goat_8604@reddit

Guys dont forget someone started this idea of using qwen cli with ROO code first and it turned into a huge discussion with lots of support but it was kilo code that first introduced this is just 2 days so im pretty sure now they will regularly update making it work even more seamlessly. Remember the hard part of even seeing the feature of using qwen cli in kilo is dont now its just the task of making it work more seamlessly.

[-]

WorthDetective5912@reddit

Great alternative to cursor! Using it with KiloCode in VSC. But is there anyway to prevent that the changes are applied directly ? I only see the diff (red/green code colors) for a second and then the changes get applied instantly.

[-]

NoobMLDude@reddit (OP)

Yes you can. In the KILO CODE Settings you'll find a tab for "Auto-Approve". In there you can select which steps you want to auto-approve. All others should ask you for explicit approval. See screenshot below.

[-]

WorthDetective5912@reddit

but when i deactivate it i have to confirm every single action like reading files folders etc. i only want to see the difference in code side by side that the ai generates..

[-]

RageshAntony@reddit

Can I use this via Kilo Code without context length restrictions?

[-]

NoobMLDude@reddit (OP)

QwenCode already comes with a 1 million token context length.
If that is not sufficient for your use, KILO Code allows you to compress the Context anytime. So you can press that button to condense the context. See screenshot.

[-]

megadonkeyx@reddit

Doesn't it use the 30b moe model by default?

[-]

Equivalent_Cut_5845@reddit

And so does gemini-cli. For me the only appeal of qwen code is connection local/openai compatible models. If you're using qwen code just to use another api model then might as well use gemini cli.

[-]

Danmoreng@reddit

I did that. The Google free tier allows 100 requests/day. That allows for 1-2 hours of coding. Qwens 2.000 requests/day are more than enough for a full day of coding.

[-]

klam997@reddit

The google auth method where you use a free account is 1000 requests/day but I think they might limit you on total 2.5 pro use but you might be forced to switch to 2.5 flash.

Still.. free is free. I'm thankful for any company that still provides free options for people.

[-]

Danmoreng@reddit

Flash is unusable. It is 100 req/day of 2.5 pro with a free api key you can generate in google ai studio, the direct account login is much less.

[-]

lszb@reddit

Then I'd rather use cursor cli.

[-]

robberviet@reddit

Qwen Coder via qwen-code is great imo.

[-]

KoichiSP@reddit

Cool! Is it the 30B model, or the big one? By using Qwen's auth in Europe

[-]

NoobMLDude@reddit (OP)

They have both named as Plus and Flash.

[-]

korino11@reddit

Sry i cannot understand HOW to get API keys for Free useing?!?

[-]

dizvyz@reddit

My experience from a few second ago before I forget.

Go to chat.qwen.ai , click signup, i picked google login for simplicity. Then start the terminal app by typing qwen in the terminal, pick oauth from the given options and look at the web browser window that opens.

[-]

korino11@reddit

I have found instruction on a github. Than you and sry that i didnt get enough atention in details.

[-]

dizvyz@reddit

no worries. as long as it works.

[-]

NoobMLDude@reddit (OP)

The video shows you all the steps from start. The GitHub README mentions it as well.

[-]

dizvyz@reddit

You would create an account on qwen.ai then go from there I supposed. I am about to do that now so if you have follow up questions, let me know.

[-]

mohamed_stifi@reddit

Gpt oss for Arabic

[-]

lodott1@reddit

Great PSA - thanks for sharing! Say, are there any examples of how capable these types of tools are? Has anything substantial been built yet, without the need for heavy review/refactoring? Gpt5 left me partially impressed, partially wanting for more consistency and functionality.

[-]

badhiyahai@reddit

You still need to review for sure, sometimes they be deleting the databases.

[-]

NoobMLDude@reddit (OP)

Don't give full permissions to execute ALL commands.
Always read the commands before you allow it to execute.

[-]

badhiyahai@reddit

Yes, that's why I said you need to review for sure

[-]

NoobMLDude@reddit (OP)

Yes I was replying to the original comment, accidentally replied to yours.

[-]

NoobMLDude@reddit (OP)

you are welcome. happy to share.
Regarding quality (as I said above):
> "It is always a trade-off between can you get 90% work done with a Free service VS can you justify a $200 subscription of Claude or GPT5. For me the gain from even a $20 subscription when compared to these free alternatives is not much.
> So I stick to the free alternatives for now."

[-]

25th__Baam@reddit

I am seeing <th tokens when using in Cline or Kilo code. How can I resolve this.

[-]

NoobMLDude@reddit (OP)

Just got it working in KILO Code.
https://youtu.be/z_ks6Li1D5M
I faced similar issues when using the Flash model. However the Qwen_code_Plus works fine.

[-]

25th__Baam@reddit

Thanks Man! 🙏🙏

[-]

Fantastic_Spite_5570@reddit

Is it good though

[-]

NoobMLDude@reddit (OP)

how would you define good?
It is always a trade-off between can you get 90% work done with a Free service VS can you justify a $200 subscription of Claude or GPT5.

For me the gain from even a $20 subscription when compared to these free alternatives is not much.
So I stick to the free alternatives for now.

[-]

Fantastic_Spite_5570@reddit

Getting 90% for free is great. Didn’t know it was that good

[-]

NoobMLDude@reddit (OP)

Of course 90% is also depending on the task you wish to accomplish. If you wish to create apps it is decent, but if you wish to do complex data science, I'm not sure.
So you need to try it for your tasks and judge it.
"It's good for Task X" does not guarantee that "It will be good for Task Y"

[-]

jonasaba@reddit

Can I use it with vscode somehow?

[-]

NoobMLDude@reddit (OP)

Yes you can. You can use KILO Code inside VSCode.
Here's a video to set it up: https://youtu.be/z_ks6Li1D5MUse the qwen-coder-plus model . the Flash model has issues with tool calling in KILO Code.

[-]

lordpuddingcup@reddit

Really wish they’d incorporate the qwen cli auth into roocode

[-]

NoobMLDude@reddit (OP)

Just got it working in KILO Code.
https://youtu.be/z_ks6Li1D5M
I'm guessing RooCode could also have a similar model selection for Qwen-Code or using Oauth creds.
You can find the QwenCode oAuth creds in `\~/.qwem/oauth_creds.json`

[-]