Skip to main content
All Hermes Stories
Cost Optimisation4 min readGreg Isenberg & Imran Muthuvappa

Token Spend Cut 90% — $130 to $10 Per 5 Days — by Switching to Hermes

Moving from an expensive agent setup to Hermes with OpenRouter on a cheap Android device via Termux reduced the cost per 5-day sprint from $130 to $10.

90%

reduction in token spend

$130 → $10

per 5-day period

Android

runs on phone via Termux

Key Takeaways

  • OpenRouter as a provider layer gives access to cheaper models without sacrificing capability, reducing token costs dramatically.
  • Hermes on Termux means a capable AI agent runs on a cheap Android device — no dedicated hardware required.
  • The combination of VPS hosting, Termux, and OpenRouter routing creates a cost-optimised stack that is competitive with or better than expensive managed solutions.

The Cost Problem

Running an AI agent at production intensity — multiple hours per day, across complex tasks — gets expensive quickly with direct API billing. Paying $130 per 5-day sprint is unsustainable for most individual users and small teams.

cut my token spend ~90%

The Stack That Changes the Economics

The cost reduction comes from three changes working together: switching to OpenRouter for model routing (which enables using cheaper models for simpler tasks without routing everything through the most expensive option), running on Android via Termux (cheap hardware, no dedicated server cost), and Hermes's efficient context management (which reduces unnecessary token consumption per session).

  • OpenRouter: intelligent routing to cheaper models for lower-complexity tasks
  • Termux on Android: agent runs on existing phone hardware, no dedicated server needed
  • Hermes context management: compression and summarisation reduce token waste

What the Phone Unlocks

Running Hermes on Android via Termux also enables mobile-native integrations: SMS, device sensors, social posting, and app interactions that aren't available from a desktop or VPS. The cost reduction comes with capability expansion, not a capability trade-off.

Story sourced from the official Nous Research Hermes user-stories page. Original author: Greg Isenberg & Imran Muthuvappa.

Give your Hermes agent a soul

A consistent identity makes Hermes more powerful

AgentSoul generates a four-file personality bundle — identity, soul, user context, and agent directives — tuned to your Enneagram type. Drop it into Hermes and your agent has a coherent voice from session one.