Hermes Agent

Hermes Agent is Nous Research's open-source, model-agnostic agent harness; distinct from the Hermes LLM line that shares the name. Where Hermes (the model) is a fine-tuned base, Hermes Agent is the surrounding scaffolding that turns any LLM into a long-running personal assistant with skills, memory,

Canonical version: Hermes Agent.

Hermes Agent is Nous Research's open-source, model-agnostic agent harness; distinct from the Hermes LLM line that shares the name. Where Hermes (the model) is a fine-tuned base, Hermes Agent is the surrounding scaffolding that turns any LLM into a long-running personal assistant with skills, memory, and cross-platform reach.

The differentiator versus Claude Code, Codex CLI, and Gemini CLI is the built-in learning loop. Hermes Agent does not stop at executing tasks; it autonomously creates skills from successful trajectories, refines them in use, persists facts about the user across sessions, and searches its own past conversations. The whole stack is positioned closer to a personal-assistant harness than to a coding-only CLI.

What it actually does

  • Skills system; autonomous skill creation after complex tasks; skills self-improve during use; compatible with the open agentskills.io standard, which means skills are portable across other harnesses.
  • Persistent memory; agent-curated facts with periodic "nudges" that prompt the agent to record what it learned. Backed by FTS5 full-text search over session history with LLM summarization for cross-session recall.
  • User modeling; uses Honcho (an external dialectic user-modeling library) to build an evolving model of who the user is and what they care about.
  • Multi-platform reach; ships a single gateway that bridges 15+ platforms — Telegram, Discord, Slack, WhatsApp, Signal, Email, and more; voice memos transcribed in. The agent is reachable from anywhere, not just the terminal.
  • Voice mode; real-time interaction across the CLI and chat surfaces; voice in, voice out, no separate app.
  • Cron scheduler; built-in unattended automations.
  • Subagents; isolated subagent spawning for parallel workstreams (the "Hermes Agent Kanban" pattern from Nous Research's announcements).
  • Tool ecosystem; 40+ built-in tools; Python script integration via RPC for custom tool calls.
  • Multi-backend execution; six terminal backends; local, Docker, SSH, Daytona, Singularity, Modal. Daytona and Modal support hibernation, keeping idle costs minimal on serverless.

Model agnosticism

Hermes Agent does not lock to Nous's own models. Switch via hermes model without code changes:

  • Nous Portal (first-party).
  • OpenRouter (200+ models).
  • NVIDIA NIM, Xiaomi MiMo, z.ai / GLM, Kimi / Moonshot, MiniMax.
  • Hugging Face, OpenAI, custom endpoints.
  • xAI Grok via OAuth (May 2026); sign in with a Grok / SuperGrok subscription, no API key. First subscription-OAuth provider on the list; exposes Grok 4.3 for text/reasoning, Grok TTS for voice, and Grok Imagine for image/video. Available on every Grok tier. Pick via hermes model → "xAI Grok OAuth (SuperGrok Subscription)". Docs; https://hermes-agent.nousresearch.com/docs/guides/xai-grok-oauth ; announcement; https://x.ai/news/grok-hermes

That makes it one of the most genuinely provider-agnostic harnesses on the map; closer to OpenCode in philosophy than to first-party CLIs. The Grok OAuth path is structurally novel; it is the first time a frontier closed-weight provider lets a consumer subscription act as an agent backend, bypassing the usual API-key-and-billing-account flow.

Installation

Linux, macOS, WSL2, Android via Termux. Single installer:

curl -fsSL https://raw.githubusercontent.com/NousResearch/hermes-agent/main/scripts/install.sh | bash

Common entry points:

  • hermes — interactive CLI.
  • hermes model — pick provider and model.
  • hermes tools — enable / disable tools.
  • hermes gateway — start the messaging gateway.
  • hermes setup — full configuration wizard.
  • hermes claw migrate — automatic import from OpenClaw (settings, memories, skills, API keys); supports dry-run and selective presets.

The OpenClaw migration path is notable; Nous explicitly absorbed the OpenClaw user base by making the upgrade frictionless.

Slash commands

Same commands work in the CLI and across messaging surfaces:

  • /new, /reset — fresh conversation.
  • /model [provider:model] — change LLM mid-session.
  • /personality [name] — switch persona.
  • /retry, /undo — reverse the last action.
  • /compress, /usage — context management.
  • /skills — browse procedural memory.

Self-evolution

The companion hermes-agent-self-evolution project applies DSPy + GEPA to automatically optimize skills, tool descriptions, system prompts, and code. It reads execution traces, proposes targeted variants via API calls (no GPU training required), evaluates them against held-out traces, and gates results behind tests, size limits, and human review before raising a PR. Synthetic data or real session history (including from Claude Code) can drive the evaluation set.

This is one of the few harnesses where the harness itself ships with a documented improvement loop, not just a tool registry.

Orchestrating other harnesses

Hermes ships first-party skills under official/autonomous-ai-agents/ that let it drive other coding harnesses as sub-agents — Claude Code, Codex CLI, OpenCode, Hermes itself, and (May 2026, announced by Teknium) OpenHands. Install with hermes update && hermes skills install official/autonomous-ai-agents/<name>. Once installed, Hermes auto-discovers them when asked to delegate, or force-load with /<agent-name> <prompt>. Cleanest path today to compose multiple agent harnesses without writing glue.

Where it sits among harnesses

  • vs Claude Code / Codex CLI; coding-focused; Hermes Agent is broader (personal assistant, messaging, automations).
  • vs OpenClaw; same niche; Hermes Agent is the more actively developed successor with first-class migration tooling.
  • vs Aider / OpenCode; similar provider-agnosticism; Hermes Agent adds learning loop, gateway, scheduler.
  • vs Claude Managed Agents; Hermes runs on your hardware, including a $5 VPS, with no managed-service dependency.

Trade-offs

  • Surface sprawl; messaging gateways, schedulers, subagents and learning loops are powerful but raise operational complexity. A coding-focused user is better served by a focused CLI.
  • Self-improvement risks; an agent that mutates its own skills is an agent whose behavior drifts. Treat the self-evolution loop as opt-in, not a default.
  • Discovery cost; 40+ tools, multiple backends, and a skills hub mean a real onboarding curve. Worth it for a daily-driver harness; expensive for a one-off.

License

MIT.

References


About Sébastien

I'm Sébastien Dubois, and I'm on a mission to help knowledge workers escape information overload. After 20+ years in IT and seeing too many brilliant minds drowning in digital chaos, I've decided to help people build systems that actually work. Through the Knowii Community, my courses, products & services and my Website/Newsletter, I share practical and battle-tested systems.

I write about Knowledge Work, Personal Knowledge Management, Note-taking, Lifelong Learning, Personal Organization, Productivity, and more. I also craft lovely digital products and tools.

If you want to follow my work, then become a member and join our community.

Ready to get to the next level?

If you're tired of information overwhelm and ready to build a reliable knowledge system:

Found this valuable? Share it with someone who needs it.

Join 6,000+ readers. Get practical systems for knowledge & AI. Free.

Subscribe ✨

Free: Knowledge System Checklist

A clear roadmap to building your own knowledge system. Subscribe and get it straight to your inbox.

6,000+ readers. No spam. Unsubscribe anytime.