2026 OpenRouter CLI Tools Ranking: Hermes, Kilo Code, or Claude Code for Your Mac Workflow?

OpenRouter's rolling 7-day app leaderboard for June 2–8, 2026 shows Hermes Agent at 4.94 trillion tokens as the number-one platform, Kilo Code at 1.22T in third place, and Claude Code at 606B in fourth. CLI and agent clients together account for more than 70% of weekly throughput — this article ranks the Top 10 CLI tools, compares capabilities, and delivers a six-step runbook for dedicated cloud Mac hosts.

GitHub star counts and launch-day demo videos tell you which CLI tool had the loudest marketing week. OpenRouter's app-level token rankings tell you which tool developers actually routed production traffic through. This article is for engineers and tech leads choosing between Hermes Agent, Kilo Code, Claude Code, Aider, Cline, and the rest of the 2026 terminal stack: (1) why billing-grade app data beats social proof; (2) a platform and CLI Top 10 from openrouter.ai/apps for June 2–8, 2026; (3) a feature matrix and scenario router aligned with our weekly model rankings and June trends analysis; and (4) a Mac hardware decision guide plus a six-step checklist on NUKCLOUD dedicated cloud Mac nodes.

00Why App Rankings Outrank GitHub Stars for CLI Procurement

Model leaderboards answer which LLM scored highest on a benchmark. App leaderboards answer which client developers trust to hold long sessions, fire tool calls, and burn tokens at scale. OpenRouter routes more than 100 trillion tokens per month across 300+ models and 60+ providers for over 8 million users. When you filter that firehose by application identity at openrouter.ai/apps, you get a weekly census of real CLI and agent adoption — not survey responses or repo watchers.

The shift is structural. In early 2025, programming workloads were roughly 11% of platform traffic. By June 2026 they exceed 50%, and CLI plus agent clients now absorb more than 70% of the rolling weekly token pool. That concentration means your choice of terminal client is as consequential as your default model ID: the wrong gateway doubles latency on tool loops; the right one amortizes context across sub-agents and keeps MCP servers warm across overnight runs.

Hermes Agent topping the platform chart at 4.94T tokens for the week is not a vanity metric. It signals that persistent, gateway-backed agents — with memory layers, skill documents, and Telegram or Slack bridges — have moved from weekend experiments to default infrastructure. Kilo Code at 1.22T and Claude Code at 606B confirm that IDE-integrated and vendor-native CLIs remain massive, but the gap between first and third place shows where growth is fastest: autonomous agents that never close the terminal session.

Procurement teams should treat these figures the same way they treat cloud spend dashboards. Pair app rankings with model rankings from our May 18–24 billing breakdown: the model board tells you what to call; the app board tells you what to run the calls through. Both update weekly; lock your architecture reviews to the same rolling window.

Pain PointsFive Mistakes Teams Make When Picking a CLI From Hype Instead of Token Data

  • Equating GitHub stars with production share: A repo can add 20,000 stars from a single Hacker News spike while weekly token share stays flat. OpenRouter app rankings measure sustained routing — the difference between a demo install and a default client.
  • Ignoring the platform vs CLI split: Hermes Agent leads all apps at 4.94T, but Kilo Code leads the CLI-specific slice. Platform totals include gateways, web UIs, and mobile bridges. Procurement should compare within the same category before declaring a winner.
  • Assuming vendor lock-in equals throughput: Claude Code holds strong share at 606B tokens, yet trails Hermes by an order of magnitude. Anthropic-native tooling wins on enterprise compliance paths; open agent stacks win on volume and BYOK flexibility. Neither metric alone answers your compliance question.
  • Running heavy CLI loops on oversubscribed hosts: Kilo Code and Cline issue thousands of tool calls per session. If the host sleeps, throttles SSH, or resets long HTTP/2 streams, the client looks broken when the infrastructure is at fault. Agent CLIs need 24/7 macOS compute with auditable tenancy — a separate line item from API keys.
  • Decoupling client choice from model routing: A CLI that excels at sub-agent orchestration (Hermes, Goose) pairs naturally with low-cost model matrices like DeepSeek-V4-Flash. A CLI optimized for single-threaded pair programming (Aider, Claude Code) may not need the same gateway topology. Match client architecture to the routing strategy in our trends guide.

01Data Source: App Leaderboard and Weekly Window

All figures in this article come from the public application leaderboard at openrouter.ai/apps. The measurement window is rolling 7-day token throughput (input plus output), anchored to June 2–8, 2026. Dimensions include per-app token totals, week-over-week momentum, and category filters that isolate CLI clients from browser and mobile surfaces.

MetricValueNotes
Measurement windowJune 2–8, 2026Rolling 7-day slice on openrouter.ai/apps
CLI + agent share of weekly tokens70%+Terminal and agent clients dominate platform volume
Platform leaderHermes Agent — 4.94TRank #1 among all applications
CLI rank #3 (all apps)Kilo Code — 1.22TTop IDE-integrated open-source CLI
CLI rank #4 (all apps)Claude Code — 606BStrong enterprise and Anthropic-ecosystem share

OpenRouter attributes tokens to the client application that initiated the request, not merely the User-Agent string on a raw curl call. That attribution is what makes the app board actionable: you can see whether your team’s spend clusters on Kilo Code routes versus a custom gateway proxy. When comparing to model-level data from openrouter.ai/rankings, keep the windows aligned — mixing a June app slice with a May model slice will produce false conclusions about client-model fit.

The 70%+ CLI and agent share figure is the headline for 2026 procurement. Chat-style web UIs still exist, but the economic center of gravity moved to tools that edit repositories, spawn sub-agents, and hold MCP sessions open for hours. Any stack review that treats the CLI as optional is reviewing 2024 assumptions.

02Platform Snapshot and CLI Top 10

Among all applications on OpenRouter for the week of June 2–8, 2026, three CLI-relevant entries sit in the global top five: Hermes Agent (#1, 4.94T), Kilo Code (#3, 1.22T), and Claude Code (#4, 606B). The #2 slot is occupied by a non-CLI surface (browser or bundled IDE traffic), which is why the CLI-specific filter remains essential for terminal procurement.

CLI RankApplicationProfileTypical Host
1Kilo CodeVS Code fork; multi-mode agent; strong MCP and sub-agent storyLocal Mac or cloud Mac with GUI/VNC
2Claude CodeAnthropic-native terminal; enterprise policy hooks; deep Claude model integrationmacOS primary; CI runners for batch
3Hermes AgentGateway-backed persistent agent; three-layer memory; Telegram/Slack bridgesDedicated 24/7 Mac or Linux VPS
4AiderGit-native pair programmer; minimal UI; scriptable in CILocal shell or headless cloud Mac
5ClineVS Code extension with terminal agent; plan/act modes; MCP marketplaceDeveloper laptop or remote desktop Mac
6GooseBlock open-source agent; recipe and sub-agent orchestrationmacOS and Linux; Square ecosystem integrations
7OpenCodeTerminal UI client; BYOK-first; fast iteration on OpenRouter routesAny Unix host; popular on cloud Mac SSH sessions
8Codex CLIOpenAI Codex terminal agent; sandboxed command executionmacOS with Apple Silicon recommended
9Roo CodeVS Code agent extension; mode switching; budget controlsLocal or remote VS Code on Mac
10Qwen CodeAlibaba Qwen-native CLI; strong on Chinese model routesmacOS and Linux; pairs with low-cost CN models

Read the ordering carefully. Kilo Code leads the CLI-specific board, while Hermes Agent leads the all-apps board because gateway traffic aggregates Telegram bots, scheduled jobs, and headless runners under one application ID. A team standardizing on a single terminal for interactive coding will weight Kilo and Claude Code heavily; a team deploying always-on operations agents will weight Hermes and Goose.

  • Data point 1: Hermes Agent processed 4.94T tokens in seven days — roughly 4x Kilo Code's 1.22T and 8x Claude Code's 606B.
  • Data point 2: The CLI Top 10 collectively accounts for the majority of the 70%+ CLI/agent weekly share; long-tail clients fragment the remainder.
  • Data point 3: VS Code-ecosystem tools (Kilo Code, Cline, Roo Code) occupy three of ten CLI slots — proof that IDE-adjacent agents remain the default entry point for application developers.
  • Data point 4: Vendor-native CLIs (Claude Code, Codex CLI, Qwen Code) hold three slots — enterprise buyers still route through first-party clients for policy and billing alignment.

03Feature Comparison: Hermes vs Kilo Code vs Claude Code

DimensionHermes AgentKilo CodeClaude Code
Weekly tokens (all apps)4.94T (#1)1.22T (#3)606B (#4)
Primary surfaceGateway + CLI + chat bridgesVS Code fork IDETerminal-native Anthropic client
Persistence modelThree-layer memory; skill documents; SQLite FTSWorkspace state; task history in IDESession context; enterprise retention policies
Sub-agents / MCPNative skill spawning; MCP server hostingStrong MCP marketplace; mode switchingTool use via Claude; MCP via plugins
BYOK / OpenRouterFirst-class; any provider via env configNative OpenRouter route supportAnthropic API primary; OpenRouter via proxy
24/7 unattendedDesigned for launchd/systemd daemonsPossible; not the default postureBatch and CI; less gateway-oriented
Best fitOps agents, Telegram bots, memory-heavy workflowsFull-stack IDE pair programmingEnterprise Claude shops, policy-bound coding

None of the three "wins" universally. Hermes wins on persistent throughput because it is architected as infrastructure, not a REPL you open after lunch. Kilo Code wins on developer ergonomics when the human stays in the loop across files, terminals, and diffs. Claude Code wins when procurement already standardized on Anthropic and needs a terminal client with predictable compliance artifacts.

Aider, Cline, and Goose deserve mention in the same breath. Aider remains the leanest git-centric option for engineers who live in vim and want deterministic commit boundaries. Cline and Roo Code split the VS Code extension market by UX philosophy — plan/act rigor versus mode-based budgeting. OpenCode attracts teams that want a fast TUI without IDE baggage. Codex CLI and Qwen Code anchor vendor-specific routes when model routing policy mandates a single provider family.

04Scenario Router: Which CLI for Which Job

Token share is a prior, not a prescription. Route clients by workload shape, then validate against your OpenRouter bill after two weeks.

ScenarioRecommended CLIDefault model routeWhy
24/7 ops agent with Telegram alertsHermes AgentDeepSeek-V4-Flash via OpenRouterGateway memory and daemon posture match unmanned runs
Full-repo feature work in VS CodeKilo Code or ClineMixed: Flash for drafts, Sonnet for reviewIDE integration reduces context-switch tax
Enterprise Claude-only policyClaude CodeClaude Sonnet 4.6 / Opus on critical pathsVendor client simplifies audit and DPA coverage
CI batch refactors on a scheduleAider or Codex CLILow-cost MoE for bulk; flagship for gatesScriptable, headless, minimal GUI dependency
Sub-agent recipes across microservicesGoose or HermesOpenRouter scenario routes per sub-taskNative orchestration without custom glue code
China-model-first cost optimizationQwen Code or OpenCodeQwen / DeepSeek matrix on OpenRouterClient defaults align with CN provider routes

Teams often run two clients in parallel: Kilo Code or Claude Code for interactive daytime coding on a developer Mac, and Hermes on a dedicated cloud Mac for overnight scans, dependency updates, and incident playbooks. Capture recurring prompts as SKILL.md files per our Cursor Agent Skill guide so context does not drift when operators switch clients. Install and gateway steps for Hermes are documented in our Hermes install walkthrough.

05Mac Hardware and Cloud Rental Matrix for CLI Workloads

CLI token volume is only half the equation. The host must keep SSH sessions alive, survive Xcode indexing beside agent loops, and provide enough unified memory when you colocate local inference with gateway processes.

WorkloadMinimum Mac tierMemoryRental vs local
Interactive Kilo / Claude Code onlyM4 Mac Mini16–24 GBLocal laptop OK; cloud Mac for distributed teams
Hermes gateway + MCP servers 24/7M4 Pro Mac Mini or Mac Studio32–48 GBCloud Mac rental avoids home-network outages
Hermes + local ds4 inferenceM4 Max / Studio96 GB+Dedicated bare-metal Mac; see ds4 guide
CI runners with Aider batch jobsM4 Mac Mini per runner24 GBOne cloud Mac per tenant boundary

Shared minute-pool macOS VPS offerings often suffer bandwidth jitter, oversubscription, and long-connection resets — especially painful when Kilo Code or Hermes issues thousands of tool calls over a twelve-hour background session. When you need an auditable production plane, NUKCLOUD multi-region bare-metal Mac and cloud Mac nodes align more cleanly with procurement and compliance documentation than generic oversubscribed hosts.

Sizing follows uptime and memory headroom, not the weekly app board. Hermes at 4.94T tokens does not mean you need a new chip every Monday; it means your gateway host must not sleep. Review specs on the pricing page, provision through the console, and finalize tenant boundaries on the order page before pointing production API keys at a new CLI default.

06Six-Step Runbook: CLI Selection on Dedicated Cloud Mac Agents

App rankings answer what the world is running this week. Your runbook must also answer where the client and gateway live. We recommend layering OpenRouter for model breadth on a NUKCLOUD dedicated Apple Silicon instance: run persistent agents on the host, keep interactive IDEs on developer laptops, and align repeatable prompts with your skill library.

  1. 01
    Subscribe to the app board, not just the model board: Every Monday, open openrouter.ai/apps and OpenRouter Rankings. Archive screenshots of CLI and model Top 10 entries. Add movers like Goose or Qwen Code to an observation list. Validate for two weeks before changing team defaults.
  2. 02
    Map scenarios to clients before mandating one tool: Interactive IDE work → Kilo Code or Cline; enterprise Claude policy → Claude Code; 24/7 gateway → Hermes; CI batch → Aider. Document the matrix in your internal AGENTS.md so contractors do not cargo-cult a single CLI.
  3. 03
    Reconcile OpenRouter bills by application ID: Each month, compare dollar spend per client with the app leaderboard token share. If Kilo Code traffic rose but spend stayed on Claude Code routes, your proxy configuration is stale — rebalance before the invoice normalizes the wrong habit.
  4. 04
    Provision a dedicated cloud Mac for persistent agents: Use the console to finalize region, SSH access, and tenant boundaries. Gateway long connections and GitHub Runners need hosts that will not be evicted by oversubscription. See the order page for specs: standard tier for gateway-only Hermes; 96GB+ unified memory if you colocate local inference.
  5. 05
    Deploy clients under launchd with explicit env files: Install Hermes or Goose per the install guide. Point OpenRouter base URLs at an internal proxy. Store API keys in tenant-scoped env files, not shell profiles. Capture recurring prompts as SKILL.md files to reduce instruction drift when switching model IDs.
  6. 06
    Run a biweekly retrospective: Adjust default clients when the app board shifts. If monthly API spend exceeds high-memory Mac rental and your codebase is sensitive, evaluate self-hosted inference on a dedicated Mac. If you only need 24/7 uptime, prioritize network stability and memory headroom over chasing new silicon. Cost reviews live on the pricing page.

07Frequently Asked Questions

Hermes is #1 overall but Kilo Code leads the CLI board — which number should I cite?
Cite both with context. Hermes aggregates gateway, bot, and headless traffic under one app ID, so it wins the all-apps chart at 4.94T. Kilo Code leads when buyers filter to interactive CLI/IDE clients. Procurement decks should use the filter that matches the buying decision.
Does Claude Code's 606B share mean Anthropic is losing the CLI war?
No — it means volume leadership moved to open agent stacks and IDE forks. Claude Code remains the right default when contracts require Anthropic-native tooling. Many enterprises run Claude Code for compliance-sensitive repos and Hermes for high-frequency automation on the same OpenRouter account.
Can we standardize on one CLI for the whole company?
Possible for small teams; unrealistic above ~20 engineers. Standardize on routing policy, secrets handling, and host tenancy instead. Let squads pick Kilo, Claude Code, or Aider by scenario while the platform team owns the cloud Mac gateway and OpenRouter proxy.
Why does this article focus on Mac hosts instead of Linux VPS?
Many CLI tools — Claude Code, Codex CLI, Kilo Code with GUI — assume macOS paths, Xcode toolchains, or Apple Silicon performance. Linux VPS works for headless Hermes and Aider, but mixed teams shipping iOS or macOS targets need Mac hardware. NUKCLOUD bare-metal Mac nodes cover both SSH headless and GUI remote desktop patterns.
How often do these CLI rankings change?
Weekly, sometimes sharply after a major release. Treat rankings like model boards: review every Monday, change defaults only after two consecutive weeks of momentum. Host sizing changes far less often — upgrade memory when workloads grow, not when a new CLI enters the Top 10.