Understanding OpenClaw Token Consumption | Generated by AI

Home 2026.05

Question: Why does OpenClaw consume ~3 million tokens per day even when I’m not actively using it (as seen in OpenRouter)?

Answer:

Short answer: it’s almost certainly still doing work in the background.

When a system like OpenClaw (or similar agent-style frameworks) is running, “doing nothing” from your perspective doesn’t mean idle. These systems are often built as continuous loops or autonomous agents, and they can burn tokens quietly through several mechanisms:


1. Continuous agent loop (most common cause)

Many agent frameworks run something like:

while True:
    observe()
    think()
    act()

Even if no meaningful task is happening, the model is still:

Each loop = tokens consumed.

If the loop runs fast (e.g., every 1–5 seconds), you can easily hit millions of tokens/day.


2. Hidden background tasks

OpenClaw-style systems often include:

These can trigger LLM calls like:

Even if nothing changes, the model is still queried.


3. Long context accumulation

If the system keeps appending history:

Example:


4. Multiple parallel agents or threads

Some frameworks silently run:

So one “step” might actually be 3–5 LLM calls.


5. Retry / failure loops

If something is slightly broken:

The system may retry endlessly:

try → fail → reflect → retry → fail → reflect ...

This can explode token usage very quickly.


6. OpenRouter accounting makes it visible

Platforms like OpenRouter show total token usage, so you suddenly notice:

Without that visibility, you’d never realize it’s happening.


Rough intuition

3 million tokens/day ≈

That’s totally consistent with a fast loop + moderate context.


What to check immediately

  1. Is the process still running?

    • Background script / server / Docker container?
  2. Loop frequency

    • Look for while True or polling intervals
  3. Context size

    • Are you appending full history every call?
  4. Number of LLM calls per step

    • Planner + executor + critic?
  5. Retry logic

    • Any infinite retry loops?
  6. Logs

    • You’ll likely see constant LLM requests even when idle

Practical fixes


Key insight

Agent systems are not like normal apps. If you don’t explicitly stop them, they behave like:

“Always thinking, even when there’s nothing to think about.”

And thinking = tokens = money.


If you want, you can paste your OpenClaw config or loop code, and I can pinpoint exactly where the tokens are being burned.


Back Donate