GitHub All-Stars #14: NanoClaw

A slightly sad realization hit us at VirtusLab recently - our "collection" of starred repositories has grown to ridiculous proportions. So we decided to change the approach: every other Wednesday, we pick one trending repo and take it apart piece by piece. This way, we'll understand them better and discover where we can genuinely help.

Today's subject has become one of the most talked-about open-source projects in the AI agent space, racking up over 11,000 GitHub stars in less than a month. We're looking at NanoClaw by Gavriel Cohen - a lightweight, container-isolated personal AI assistant that connects to WhatsApp (and Telegram, Discord, Slack, Signal) and runs on Anthropic's Agent SDK. Written in TypeScript, MIT-licensed, and built around one radical premise: the software that hosts powerful AI agents should be simple enough to read in eight minutes.

The OpenClaw Problem (or: Why 400,000 Lines of Code Should Scare You)

To understand NanoClaw, you first need to understand the phenomenon it's reacting to. OpenClaw (formerly Clawdbot, formerly Moltbot - the naming history alone deserves its own article) exploded in early 2026 as the viral open-source personal AI assistant. Over 300,000 users. Andrej Karpathy called its social network spinoff "the most incredible sci-fi take-off adjacent thing I have seen recently." Its creator, Peter Steinberger, was acqui-hired by OpenAI in February 2026 to build agents for a mainstream audience.

The escalation here… is interesting. It took a month

But there's a problem. OpenClaw weighs in at nearly half a million lines of code, 53 config files, 70+ dependencies, 52+ modules, and runs everything - including shell command execution and file access - in a single Node.js process with shared memory. Its security model is application-level: allowlists, pairing codes, and permission checks in code. As Cohen bluntly put it: "I can't sleep well running software I don't understand with access to my life."

And he's not alone. Security researchers flagged exposed instances on public IPs, plaintext credential leaks, and even a CVE (CVE-2026-25253) for one-click token exfiltration. When Cisco and CrowdStrike start warning about your personal assistant framework, you know something has gone structurally wrong.

NanoClaw is the response: same core functionality - messaging, memory, scheduled jobs, web access - but in a codebase you can actually audit. About 500 lines of core TypeScript. One process. A handful of files. And agents run in actual Linux containers, not behind application-level permission checks.

Architecture: Radical Simplicity as a Security Strategy

The architecture diagram is almost comically simple compared to OpenClaw's gateway-router-adapter-queue model:

That's it. A single Node.js process handles message ingestion via the Baileys library (unofficial WhatsApp Web API), persists everything to SQLite, and when a message matches the trigger pattern (default: @Andy), spawns an isolated container with only the relevant group's directory mounted. Inside the container, Anthropic's Claude Agent SDK handles the actual reasoning.

Let's unpack the key architectural decisions:

Container isolation, not application sandboxing. On macOS, NanoClaw uses Apple Container - a lightweight, Apple Silicon-optimized container runtime that provides true VM-level isolation with its own kernel. On Linux, it uses Docker. The critical distinction: even if the agent gains root inside the container, it cannot reach the host filesystem. The isolation boundary is enforced by the operating system, not by if statements in JavaScript.

Per-group isolation. Each WhatsApp group (or Telegram channel, or Discord server) gets its own container, its own CLAUDE.md memory file, its own filesystem, and its own session state. The "Family Chat" group cannot see data from the "Sales Pipeline" group, and vice versa. This isn't just a nice-to-have — it's a fundamental security property. A prompt injection in one group cannot exfiltrate data from another.

Filesystem-based IPC. The host orchestrator and the containerized agent communicate through the filesystem, not through network sockets or shared memory. This is deliberately simple. The container writes responses to mounted files; the host reads them and routes them back to WhatsApp. No WebSocket control plane, no message broker, no Redis.

Single-file orchestrator. The main file (src/index.ts) handles state management, message loop, and agent invocation. A per-group message queue with concurrency control prevents race conditions. The entire codebase fits into roughly 35,000 tokens - about 17% of Claude Code's context window. This means a coding agent can ingest the full source, understand it completely, and modify it safely in a single session.

The "AI-Native" Philosophy (and Why It's Fascinating)

NanoClaw's most provocative design decision isn't technical - it's philosophical. The project explicitly rejects the traditional software development model of "add features, ship updates, configure via YAML." Instead, it embraces what Cohen calls "AI-native" software:

No configuration files. Want to change the trigger word from @Andy to @Claude? Don't edit a config file. Tell Claude Code: "Change the trigger word to @Claude." It modifies the source directly.

No installation wizard. You clone the repo, run Claude, then run /setup. Claude Code handles dependencies, WhatsApp authentication (QR code scanning), container runtime selection, and service configuration - all through conversation.

Skills over features. This is the genuinely radical part. NanoClaw explicitly discourages pull requests that add features. Want Telegram support? Don't create a PR that adds Telegram alongside WhatsApp. Instead, contribute a skill - a Markdown file in .claude/skills/add-telegram/SKILL.md that teaches Claude Code how to transform a NanoClaw installation. Users then run /add-telegram, and Claude rewrites their local fork.

The implications are worth thinking about. In this model, the codebase stays minimal forever. Every user's fork contains exactly the code they need and nothing more. There's no bloat from supporting 15 channel providers you'll never use. The "documentation" is Claude Code itself - you ask it questions about the code, and it reads the source and answers.

Is this the future of software? Maybe not for everything. But for personal tooling that sits at the intersection of "powerful enough to be dangerous" and "simple enough to audit," it's a compelling experiment.

Agent Swarms: The First Personal AI Assistant to Support Multi-Agent Collaboration

NanoClaw ships with support for Agent Swarms via the Anthropic Agent SDK - and claims to be the first personal AI assistant to offer this. In practice, this means you can ask your assistant to spawn a team of specialized sub-agents that collaborate on a single task.

The isolation model extends naturally to swarms: each sub-agent can run in its own container with its own memory context. This prevents cross-contamination between different functions - a sales analysis agent can't accidentally access healthcare data, even if both are operating in the same swarm.

This feature is still early, but the architecture is sound. The container-per-agent model that NanoClaw uses for group isolation maps cleanly onto the swarm use case. It's the kind of thing that's hard to retrofit onto a monolithic codebase but falls out naturally from NanoClaw's design.

How People Actually Use It

NanoClaw isn't just a demo - the Cohen brothers (Gavriel and Lazer, co-founders of AI agency Qwibit) use it to run their actual business operations. Their instance, named Andy, manages the sales pipeline:

Sunday-through-Friday briefings at 9:00 AM with lead statuses and task assignments
Messy WhatsApp notes and email threads forwarded into an admin group → Andy parses them, updates an Obsidian vault and SQLite database, sets follow-up reminders
Weekly git history reviews for "documentation drift."
The agent can even refactor its own functions to improve ergonomics for future sessions

This is the real test of any personal agent framework: does the creator actually trust it enough to run their business on it? In NanoClaw's case, the answer is yes — and the container isolation is a big part of why.

The Competitive Landscape: OpenClaw, NanoClaw, and the Personal Agent Wars

The personal AI assistant space has exploded in early 2026, and it's worth mapping the landscape:

OpenClaw - The original viral phenomenon. Maximum capability surface, 50+ integrations, now transitioning to an independent foundation with OpenAI as sponsor. The "Linux" of personal agents: powerful, extensible, but you'd better know what you're doing.
NanoClaw - The minimalist counterpoint. Same core features, 1/800th of the code. Container isolation. AI-native extensibility. The "Alpine Linux" — stripped down to essentials, security-first.
Moltbot - OpenClaw's social network spinoff with over a million agents. Different use case entirely, but part of the same ecosystem.
ZeroClaw, PicoClaw, memU — Various other alternatives that have emerged, each with different tradeoffs around complexity, security, and features.

The more interesting pattern is the gravitational pull toward competing labs. OpenClaw's creator went to OpenAI. NanoClaw runs exclusively on Anthropic's Agent SDK. The two leading open-source personal agents are now aligned with competing model providers — which tells you something about how strategically important the orchestration layer has become.

A Few Things That Made Me Think

On markdown as the native language of AI. Cohen argues that markdown files are becoming the artifacts for persisting and managing agentic processes. NanoClaw's CLAUDE.md per-group memory files, its skill system based on Markdown instructions, its Obsidian vault integration - all of it treats markdown as the universal interface between humans and agents. This resonates with what we've been seeing in the broader ecosystem with AGENTS.md, CLAUDE.md, and similar patterns.

On DRY being potentially harmful in the age of agents. Cohen makes a counterintuitive argument: the DRY principle (Don't Repeat Yourself) might be counterproductive when AI agents are modifying your code. When agents edit a shared function, they tend to make the change and move on without considering downstream effects. Duplicated code eliminates that class of side effects. "The overhead of maintaining duplicates doesn't cost that much anymore," Cohen told The New Stack. "You can run Claude Code on it, and it will apply the same changes throughout."

On the "Skills over Features" model. Docker popularized the idea that infrastructure should be composable. NanoClaw extends this to features: instead of everyone getting every feature, you compose your installation from skills. The trade-off is discoverability - you need to know that /add-telegram exists. But the benefit is that your codebase stays minimal and your security surface stays small.

Things I'd Like to See

The project is young (launched January 31, 2026) and growing fast. A few areas where there's room to mature:

Network isolation. NanoClaw's containers provide filesystem isolation, but the README doesn't detail network egress controls. Can the agent inside the container make arbitrary outbound HTTP requests? If yes, that's a data exfiltration vector. Matchlock (which we covered in a recent edition) solves this with deny-all networking and domain allowlisting - NanoClaw could benefit from similar controls.

Audit trail. For anyone using this for business operations (as the Cohens do), a comprehensive log of every agent action - commands executed, files modified, API calls made - would be valuable. Right now, debugging means asking Claude "what happened?"

Multi-model support. NanoClaw is tightly coupled to Anthropic's Agent SDK, which means Claude is the only model option. OpenClaw supports multiple LLM backends. For some users, model flexibility matters.

Skill discoverability. The skills system is brilliant in concept, but without a registry or marketplace (OpenClaw has ClawHub), discovering available skills relies on browsing GitHub issues and the contributing guide.

Conclusions: Small Software for Big Agents

NanoClaw is a statement about what software should look like in the age of AI agents. When the thing running your code has near-human reasoning capabilities and can execute arbitrary commands, the traditional response has been to add more guardrails in application code - more permission checks, more allowlists, more configuration. NanoClaw argues the opposite: make the code so small that you can read all of it, and let the operating system enforce the boundaries.

Is it perfect? No. The network isolation story needs work, the mono-model dependency is limiting, and the "just ask Claude" debugging model won't satisfy everyone. But as a proof of concept for what personal AI assistants could look like — simple, auditable, secure by architecture rather than by policy — it's the most interesting project in the space right now.

The fact that swyx (Shawn Wang, one of the most respected voices in the AI developer community) and Andrej Karpathy recommended NanoClaw and that Docker published an official guide for running it in their shell sandboxes suggests the thesis is resonating.

When AI agents become powerful enough to manage your business, the code that orchestrates them should be simple enough to understand. NanoClaw takes that idea seriously.

NanoClaw is available at github.com/qwibitai/nanoclaw. Setup: git clone, cd nanoclaw, claude, then /setup. Website: nanoclaw.dev. Discord: discord.gg/VGWXrf8x.