Trying to understand how Claude Code token usage maps to quota consumption.

Posted by Eastern_Campaign5482@reddit | LocalLLaMA | View on Reddit | 3 comments

I ran into something confusing with Claude Code and wanted to check if others have seen this.

My usage:

- 2 conversations total

- One conversation: \~70k–100k tokens (as reported by Claude Code)

- Another conversation: \~11k tokens

- Mostly just reviewing code, no heavy generation or large tasks

So in total, roughly \~100k tokens reported.

However, this already consumed over 90% of my 5-hour quota.

This seems highly disproportionate.

Possible explanations I’m considering:

- Hidden token usage (tool calls, file operations, etc.)

- Context being repeatedly reprocessed

- Reported token count not reflecting actual billed usage

I’ve already contacted support to clarify.

Also, is there any reliable way to track how token usage maps to quota consumption in Claude Code?

Right now, the reported token count (\~100k) doesn’t explain the actual quota usage (90%+), which makes it very hard to predict or control usage.

Curious if others have experienced something similar or can explain how this is supposed to work.

If support provides any explanation, I’ll update this thread.