2026 OpenRouter CLI Tools Ranking: Hermes, Kilo Code, or Claude Code for Your Mac Workflow?

00Why App Rankings Outrank GitHub Stars for CLI Procurement

Model leaderboards answer which LLM scored highest on a benchmark. App leaderboards answer which client developers trust to hold long sessions, fire tool calls, and burn tokens at scale. OpenRouter routes more than 100 trillion tokens per month across 300+ models and 60+ providers for over 8 million users. When you filter that firehose by application identity at openrouter.ai/apps, you get a weekly census of real CLI and agent adoption — not survey responses or repo watchers.

The shift is structural. In early 2025, programming workloads were roughly 11% of platform traffic. By June 2026 they exceed 50%, and CLI plus agent clients now absorb more than 70% of the rolling weekly token pool. That concentration means your choice of terminal client is as consequential as your default model ID: the wrong gateway doubles latency on tool loops; the right one amortizes context across sub-agents and keeps MCP servers warm across overnight runs.

Hermes Agent topping the platform chart at 4.94T tokens for the week is not a vanity metric. It signals that persistent, gateway-backed agents — with memory layers, skill documents, and Telegram or Slack bridges — have moved from weekend experiments to default infrastructure. Kilo Code at 1.22T and Claude Code at 606B confirm that IDE-integrated and vendor-native CLIs remain massive, but the gap between first and third place shows where growth is fastest: autonomous agents that never close the terminal session.

Procurement teams should treat these figures the same way they treat cloud spend dashboards. Pair app rankings with model rankings from our May 18–24 billing breakdown: the model board tells you what to call; the app board tells you what to run the calls through. Both update weekly; lock your architecture reviews to the same rolling window.

Pain PointsFive Mistakes Teams Make When Picking a CLI From Hype Instead of Token Data

Equating GitHub stars with production share: A repo can add 20,000 stars from a single Hacker News spike while weekly token share stays flat. OpenRouter app rankings measure sustained routing — the difference between a demo install and a default client.
Ignoring the platform vs CLI split: Hermes Agent leads all apps at 4.94T, but Kilo Code leads the CLI-specific slice. Platform totals include gateways, web UIs, and mobile bridges. Procurement should compare within the same category before declaring a winner.
Assuming vendor lock-in equals throughput: Claude Code holds strong share at 606B tokens, yet trails Hermes by an order of magnitude. Anthropic-native tooling wins on enterprise compliance paths; open agent stacks win on volume and BYOK flexibility. Neither metric alone answers your compliance question.
Running heavy CLI loops on oversubscribed hosts: Kilo Code and Cline issue thousands of tool calls per session. If the host sleeps, throttles SSH, or resets long HTTP/2 streams, the client looks broken when the infrastructure is at fault. Agent CLIs need 24/7 macOS compute with auditable tenancy — a separate line item from API keys.
Decoupling client choice from model routing: A CLI that excels at sub-agent orchestration (Hermes, Goose) pairs naturally with low-cost model matrices like DeepSeek-V4-Flash. A CLI optimized for single-threaded pair programming (Aider, Claude Code) may not need the same gateway topology. Match client architecture to the routing strategy in our trends guide.

01Data Source: App Leaderboard and Weekly Window

All figures in this article come from the public application leaderboard at openrouter.ai/apps. The measurement window is rolling 7-day token throughput (input plus output), anchored to June 2–8, 2026. Dimensions include per-app token totals, week-over-week momentum, and category filters that isolate CLI clients from browser and mobile surfaces.

Metric	Value	Notes
Measurement window	June 2–8, 2026	Rolling 7-day slice on openrouter.ai/apps
CLI + agent share of weekly tokens	70%+	Terminal and agent clients dominate platform volume
Platform leader	Hermes Agent — 4.94T	Rank #1 among all applications
CLI rank #3 (all apps)	Kilo Code — 1.22T	Top IDE-integrated open-source CLI
CLI rank #4 (all apps)	Claude Code — 606B	Strong enterprise and Anthropic-ecosystem share

OpenRouter attributes tokens to the client application that initiated the request, not merely the User-Agent string on a raw curl call. That attribution is what makes the app board actionable: you can see whether your team’s spend clusters on Kilo Code routes versus a custom gateway proxy. When comparing to model-level data from openrouter.ai/rankings, keep the windows aligned — mixing a June app slice with a May model slice will produce false conclusions about client-model fit.

The 70%+ CLI and agent share figure is the headline for 2026 procurement. Chat-style web UIs still exist, but the economic center of gravity moved to tools that edit repositories, spawn sub-agents, and hold MCP sessions open for hours. Any stack review that treats the CLI as optional is reviewing 2024 assumptions.

02Platform Snapshot and CLI Top 10

Among all applications on OpenRouter for the week of June 2–8, 2026, three CLI-relevant entries sit in the global top five: Hermes Agent (#1, 4.94T), Kilo Code (#3, 1.22T), and Claude Code (#4, 606B). The #2 slot is occupied by a non-CLI surface (browser or bundled IDE traffic), which is why the CLI-specific filter remains essential for terminal procurement.

CLI Rank	Application	Profile	Typical Host
1	Kilo Code	VS Code fork; multi-mode agent; strong MCP and sub-agent story	Local Mac or cloud Mac with GUI/VNC
2	Claude Code	Anthropic-native terminal; enterprise policy hooks; deep Claude model integration	macOS primary; CI runners for batch
3	Hermes Agent	Gateway-backed persistent agent; three-layer memory; Telegram/Slack bridges	Dedicated 24/7 Mac or Linux VPS
4	Aider	Git-native pair programmer; minimal UI; scriptable in CI	Local shell or headless cloud Mac
5	Cline	VS Code extension with terminal agent; plan/act modes; MCP marketplace	Developer laptop or remote desktop Mac
6	Goose	Block open-source agent; recipe and sub-agent orchestration	macOS and Linux; Square ecosystem integrations
7	OpenCode	Terminal UI client; BYOK-first; fast iteration on OpenRouter routes	Any Unix host; popular on cloud Mac SSH sessions
8	Codex CLI	OpenAI Codex terminal agent; sandboxed command execution	macOS with Apple Silicon recommended
9	Roo Code	VS Code agent extension; mode switching; budget controls	Local or remote VS Code on Mac
10	Qwen Code	Alibaba Qwen-native CLI; strong on Chinese model routes	macOS and Linux; pairs with low-cost CN models

Read the ordering carefully. Kilo Code leads the CLI-specific board, while Hermes Agent leads the all-apps board because gateway traffic aggregates Telegram bots, scheduled jobs, and headless runners under one application ID. A team standardizing on a single terminal for interactive coding will weight Kilo and Claude Code heavily; a team deploying always-on operations agents will weight Hermes and Goose.

Data point 1: Hermes Agent processed 4.94T tokens in seven days — roughly 4x Kilo Code's 1.22T and 8x Claude Code's 606B.
Data point 2: The CLI Top 10 collectively accounts for the majority of the 70%+ CLI/agent weekly share; long-tail clients fragment the remainder.
Data point 3: VS Code-ecosystem tools (Kilo Code, Cline, Roo Code) occupy three of ten CLI slots — proof that IDE-adjacent agents remain the default entry point for application developers.
Data point 4: Vendor-native CLIs (Claude Code, Codex CLI, Qwen Code) hold three slots — enterprise buyers still route through first-party clients for policy and billing alignment.

03Feature Comparison: Hermes vs Kilo Code vs Claude Code

Dimension	Hermes Agent	Kilo Code	Claude Code
Weekly tokens (all apps)	4.94T (#1)	1.22T (#3)	606B (#4)
Primary surface	Gateway + CLI + chat bridges	VS Code fork IDE	Terminal-native Anthropic client
Persistence model	Three-layer memory; skill documents; SQLite FTS	Workspace state; task history in IDE	Session context; enterprise retention policies
Sub-agents / MCP	Native skill spawning; MCP server hosting	Strong MCP marketplace; mode switching	Tool use via Claude; MCP via plugins
BYOK / OpenRouter	First-class; any provider via env config	Native OpenRouter route support	Anthropic API primary; OpenRouter via proxy
24/7 unattended	Designed for launchd/systemd daemons	Possible; not the default posture	Batch and CI; less gateway-oriented
Best fit	Ops agents, Telegram bots, memory-heavy workflows	Full-stack IDE pair programming	Enterprise Claude shops, policy-bound coding

None of the three "wins" universally. Hermes wins on persistent throughput because it is architected as infrastructure, not a REPL you open after lunch. Kilo Code wins on developer ergonomics when the human stays in the loop across files, terminals, and diffs. Claude Code wins when procurement already standardized on Anthropic and needs a terminal client with predictable compliance artifacts.

Aider, Cline, and Goose deserve mention in the same breath. Aider remains the leanest git-centric option for engineers who live in vim and want deterministic commit boundaries. Cline and Roo Code split the VS Code extension market by UX philosophy — plan/act rigor versus mode-based budgeting. OpenCode attracts teams that want a fast TUI without IDE baggage. Codex CLI and Qwen Code anchor vendor-specific routes when model routing policy mandates a single provider family.

04Scenario Router: Which CLI for Which Job

Token share is a prior, not a prescription. Route clients by workload shape, then validate against your OpenRouter bill after two weeks.

Scenario	Recommended CLI	Default model route	Why
24/7 ops agent with Telegram alerts	Hermes Agent	DeepSeek-V4-Flash via OpenRouter	Gateway memory and daemon posture match unmanned runs
Full-repo feature work in VS Code	Kilo Code or Cline	Mixed: Flash for drafts, Sonnet for review	IDE integration reduces context-switch tax
Enterprise Claude-only policy	Claude Code	Claude Sonnet 4.6 / Opus on critical paths	Vendor client simplifies audit and DPA coverage
CI batch refactors on a schedule	Aider or Codex CLI	Low-cost MoE for bulk; flagship for gates	Scriptable, headless, minimal GUI dependency
Sub-agent recipes across microservices	Goose or Hermes	OpenRouter scenario routes per sub-task	Native orchestration without custom glue code
China-model-first cost optimization	Qwen Code or OpenCode	Qwen / DeepSeek matrix on OpenRouter	Client defaults align with CN provider routes

Teams often run two clients in parallel: Kilo Code or Claude Code for interactive daytime coding on a developer Mac, and Hermes on a dedicated cloud Mac for overnight scans, dependency updates, and incident playbooks. Capture recurring prompts as SKILL.md files per our Cursor Agent Skill guide so context does not drift when operators switch clients. Install and gateway steps for Hermes are documented in our Hermes install walkthrough.

05Mac Hardware and Cloud Rental Matrix for CLI Workloads

CLI token volume is only half the equation. The host must keep SSH sessions alive, survive Xcode indexing beside agent loops, and provide enough unified memory when you colocate local inference with gateway processes.

Workload	Minimum Mac tier	Memory	Rental vs local
Interactive Kilo / Claude Code only	M4 Mac Mini	16–24 GB	Local laptop OK; cloud Mac for distributed teams
Hermes gateway + MCP servers 24/7	M4 Pro Mac Mini or Mac Studio	32–48 GB	Cloud Mac rental avoids home-network outages
Hermes + local ds4 inference	M4 Max / Studio	96 GB+	Dedicated bare-metal Mac; see ds4 guide
CI runners with Aider batch jobs	M4 Mac Mini per runner	24 GB	One cloud Mac per tenant boundary

Shared minute-pool macOS VPS offerings often suffer bandwidth jitter, oversubscription, and long-connection resets — especially painful when Kilo Code or Hermes issues thousands of tool calls over a twelve-hour background session. When you need an auditable production plane, NUKCLOUD multi-region bare-metal Mac and cloud Mac nodes align more cleanly with procurement and compliance documentation than generic oversubscribed hosts.

Sizing follows uptime and memory headroom, not the weekly app board. Hermes at 4.94T tokens does not mean you need a new chip every Monday; it means your gateway host must not sleep. Review specs on the pricing page, provision through the console, and finalize tenant boundaries on the order page before pointing production API keys at a new CLI default.

06Six-Step Runbook: CLI Selection on Dedicated Cloud Mac Agents

App rankings answer what the world is running this week. Your runbook must also answer where the client and gateway live. We recommend layering OpenRouter for model breadth on a NUKCLOUD dedicated Apple Silicon instance: run persistent agents on the host, keep interactive IDEs on developer laptops, and align repeatable prompts with your skill library.

01
Subscribe to the app board, not just the model board: Every Monday, open openrouter.ai/apps and OpenRouter Rankings. Archive screenshots of CLI and model Top 10 entries. Add movers like Goose or Qwen Code to an observation list. Validate for two weeks before changing team defaults.
02
Map scenarios to clients before mandating one tool: Interactive IDE work → Kilo Code or Cline; enterprise Claude policy → Claude Code; 24/7 gateway → Hermes; CI batch → Aider. Document the matrix in your internal AGENTS.md so contractors do not cargo-cult a single CLI.
03
Reconcile OpenRouter bills by application ID: Each month, compare dollar spend per client with the app leaderboard token share. If Kilo Code traffic rose but spend stayed on Claude Code routes, your proxy configuration is stale — rebalance before the invoice normalizes the wrong habit.
04
Provision a dedicated cloud Mac for persistent agents: Use the console to finalize region, SSH access, and tenant boundaries. Gateway long connections and GitHub Runners need hosts that will not be evicted by oversubscription. See the order page for specs: standard tier for gateway-only Hermes; 96GB+ unified memory if you colocate local inference.
05
Deploy clients under launchd with explicit env files: Install Hermes or Goose per the install guide. Point OpenRouter base URLs at an internal proxy. Store API keys in tenant-scoped env files, not shell profiles. Capture recurring prompts as SKILL.md files to reduce instruction drift when switching model IDs.
06
Run a biweekly retrospective: Adjust default clients when the app board shifts. If monthly API spend exceeds high-memory Mac rental and your codebase is sensitive, evaluate self-hosted inference on a dedicated Mac. If you only need 24/7 uptime, prioritize network stability and memory headroom over chasing new silicon. Cost reviews live on the pricing page.

07Frequently Asked Questions

Hermes is #1 overall but Kilo Code leads the CLI board — which number should I cite?

Cite both with context. Hermes aggregates gateway, bot, and headless traffic under one app ID, so it wins the all-apps chart at 4.94T. Kilo Code leads when buyers filter to interactive CLI/IDE clients. Procurement decks should use the filter that matches the buying decision.

Does Claude Code's 606B share mean Anthropic is losing the CLI war?

No — it means volume leadership moved to open agent stacks and IDE forks. Claude Code remains the right default when contracts require Anthropic-native tooling. Many enterprises run Claude Code for compliance-sensitive repos and Hermes for high-frequency automation on the same OpenRouter account.

Can we standardize on one CLI for the whole company?

Possible for small teams; unrealistic above ~20 engineers. Standardize on routing policy, secrets handling, and host tenancy instead. Let squads pick Kilo, Claude Code, or Aider by scenario while the platform team owns the cloud Mac gateway and OpenRouter proxy.

Why does this article focus on Mac hosts instead of Linux VPS?

Many CLI tools — Claude Code, Codex CLI, Kilo Code with GUI — assume macOS paths, Xcode toolchains, or Apple Silicon performance. Linux VPS works for headless Hermes and Aider, but mixed teams shipping iOS or macOS targets need Mac hardware. NUKCLOUD bare-metal Mac nodes cover both SSH headless and GUI remote desktop patterns.

How often do these CLI rankings change?

Weekly, sometimes sharply after a major release. Treat rankings like model boards: review every Monday, change defaults only after two consecutive weeks of momentum. Host sizing changes far less often — upgrade memory when workloads grow, not when a new CLI enters the Top 10.