╔═══════════════════════════════════════════════════════════════╗
║ ║
║ █████╗ ██████╗ ███████╗███╗ ██╗████████╗███████╗ ║
║ ██╔══██╗██╔════╝ ██╔════╝████╗ ██║╚══██╔══╝██╔════╝ ║
║ ███████║██║ ███╗█████╗ ██╔██╗ ██║ ██║ ███████╗ ║
║ ██╔══██║██║ ██║██╔══╝ ██║╚██╗██║ ██║ ╚════██║ ║
║ ██║ ██║╚██████╔╝███████╗██║ ╚████║ ██║ ███████║ ║
║ ╚═╝ ╚═╝ ╚═════╝ ╚══════╝╚═╝ ╚═══╝ ╚═╝ ╚══════╝ ║
║ ██╗███╗ ██╗ █████╗ ║
║ ██║████╗ ██║ ██╔══██╗ ║
║ ██║██╔██╗ ██║ ███████║ ║
║ ██║██║╚██╗██║ ██╔══██║ ║
║ ██║██║ ╚████║ ██║ ██║ ║
║ ╚═╝╚═╝ ╚═══╝ ╚═╝ ╚═╝ ║
║ ██████╗ ██████╗ ██╗ ██╗ ║
║ ██╔══██╗██╔═══██╗╚██╗██╔╝ ║
║ ██████╔╝██║ ██║ ╚███╔╝ ║
║ ██╔══██╗██║ ██║ ██╔██╗ ║
║ ██████╔╝╚██████╔╝██╔╝ ██╗ ║
║ ╚═════╝ ╚═════╝ ╚═╝ ╚═╝ ║
║ ║
╚═══════════════════════════════════════════════════════════════╝
A complete ecosystem for AI-assisted development
115 Rust Modules · 71 Skills · 37 Agents · 9 AI Tools · Knowledge Graph
A terminal-native ecosystem for managing AI coding agents. Built around a Rust TUI that orchestrates Claude Code, Codex, Gemini, and Copilot sessions with git worktree isolation, and a portable toolkit of skills, agents, and workflows that plug into 9 different AI coding tools.
Live dashboard: multi-workspace sidebar, session preview pane, and tmux-backed persistent sessions
Built-in usage analytics: 11.9B tokens tracked across 45 projects and 487 sessions, by provider and by day
| Component | What it does | Scale |
|---|---|---|
| ainb TUI | Rust terminal app for managing Claude Code sessions | 115 modules |
| Toolkit | Portable skills, agents, and workflows for AI coding tools | 71 skills, 37 agents |
| Knowledge System | GraphRAG + QMD learning capture and retrieval | Architecture docs |
Most AI coding setups are a loose collection of dotfiles. This project treats the problem as an engineering system:
- One toolkit, many tools — Write a skill once, deploy it to Claude Code, Codex, Gemini, Cursor, Copilot, Amazon Q, Cline, Roo, or Clawdhub
- Session isolation — Each coding session gets its own git worktree and tmux session. No cross-contamination
- Agents that compose — 37 specialized agents (backend-developer, security-agent, architecture-reviewer, etc.) that can be orchestrated into swarms
- Memory that persists — A two-tier knowledge system (GraphRAG + QMD) that captures learnings and retrieves them across sessions and projects
- Production Rust — The TUI isn't a shell script. It's 115 modules of typed, tested, async Rust with clippy pedantic/nursery lints
# Install the TUI
brew tap stevengonsalvez/ainb && brew install ainb
# Install the toolkit for your AI tool
cd toolkit && npm install && node create-rule.js --tool=claude-code-4.5
# Launch
ainbA Rust-based terminal application for managing AI coding sessions with git worktree isolation, model selection, and persistent tmux sessions. Every operation is available as both an interactive TUI view and a scriptable CLI subcommand with JSON output — so humans drive it from a dashboard and agents drive it from shell scripts.
- Multi-provider — Run Claude Code, Codex CLI, Gemini CLI, or GitHub Copilot in the same workflow, with Sonnet / Opus / Haiku selection per session
- Git worktree isolation — Each session runs in its own branch and working directory. No cross-contamination, no stash dance
- tmux persistence — Sessions survive terminal disconnects, SSH drops, and laptop sleep. Reattach any time
- Usage analytics — Built-in token + session tracking by day, week, provider, and project. Know where your budget went
- Easy onboarding — First-run setup wizard checks dependencies, configures auth, and gets you creating sessions in minutes
- Live log streaming — Real-time viewer with level filtering and search across all running sessions
- Scriptable CLI — 15 commands with
--format jsonoutput for every piece of state. 📘 Full CLI reference →
For agents, automation, and scripts, ainb ships a full CLI. Every command supports --format json for piping to jq.
ainb --help # Top-level overview
ainb run --repo . --worktree --tool claude --model sonnet
ainb list --format json | jq .
ainb logs my-session --follow
ainb recover list # Find orphaned sessions
ainb config set authentication.default_model opus
ainb completion zsh > ~/.zsh/completions/_ainb15 top-level commands — run, list, logs, attach, status, kill, auth, recover, config, git, favorites, init, presets, completion, tui — with nested subcommands for recover / config / git / favorites / presets.
📘 Full CLI reference → ainb-tui/docs/CLI.md
Homebrew (macOS / Linux)
brew tap stevengonsalvez/ainb
brew install ainbOne-liner install
curl -fsSL https://raw.githubusercontent.com/stevengonsalvez/agents-in-a-box/v2/ainb-tui/install.sh | bashCargo (any platform)
cargo install --git https://github.com/stevengonsalvez/agents-in-a-box --branch v2 agents-box
# Optionally alias: alias ainb="agents-box"Windows (WSL)
# 1. Install WSL2
wsl --install
# 2. Inside Ubuntu/Debian
curl -fsSL https://raw.githubusercontent.com/stevengonsalvez/agents-in-a-box/v2/ainb-tui/install.sh | bash
sudo apt update && sudo apt install -y tmux
ainbainb requires tmux for persistent sessions, which is Unix-only. WSL provides the best Windows experience.
| Key | Action |
|---|---|
j/k or ↑/↓ |
Navigate sessions |
Enter |
Attach to session |
n |
New session |
d |
Delete session |
r |
Restart Claude in session |
l |
View logs |
q |
Quit |
| Platform | Status | Method |
|---|---|---|
| macOS Apple Silicon | ✅ | Pre-built binary |
| macOS Intel | ✅ | Build from source |
| Linux x86_64 | ✅ | Pre-built binary |
| Linux ARM64 | ✅ | Build from source |
| Windows (WSL2) | ✅ | Install script |
| Windows (Native) | ❌ | Use WSL |
- tmux — persistent session management
- git — worktree operations
- Claude Code CLI — the
claudecommand
A portable AI coding agent toolkit: skills, agents, workflows, and configurations that deploy to 9 different AI coding tools from a single source.
| Tool | Deploy target | Method |
|---|---|---|
| Claude Code | ~/.claude/ |
Home directory |
| Codex | ~/.codex/ |
Home directory |
| GitHub Copilot | ~/.copilot/ |
Home directory |
| Gemini CLI | .gemini/ |
Project directory |
| Amazon Q | .amazonq/rules/ |
Project directory |
| Cursor | Project root | Project directory |
| Cline | Project root | Project directory |
| Roo | Project root | Project directory |
| Clawdhub | Project root | Project directory |
Skills are reusable capabilities that any supported AI tool can invoke.
Workflow & Planning
plan · plan-tdd · plan-gh · implement · validate · workflow · brainstorm · critique · discuss · expose · interview
Code Quality & Testing
commit · find-missing-tests · webapp-testing · security-audit · security-scan · simplify
DevOps & Infrastructure
start-local · start-ios · start-android · spawn-agent · tmux-monitor · tmux-status · expose · debug-bridge
Knowledge & Learning
reflect · global-learnings · research · research-cache · instincts · compound-docs · prime
Session Management
health-check · session-info · session-metrics · session-summary · handover · recover-sessions · plugins
Swarm Orchestration
swarm-create · swarm-join · swarm-inbox · swarm-status · swarm-shutdown · swarm-orchestration · swarm-agent-troubleshooting
GitHub & Issues
gh-issue · make-github-issues · do-issues · merge-agent-work · list-agent-worktrees · attach-agent-worktree · cleanup-agent-worktree
Design & Frontend
ui-ux-pro-max · frontend-design · frontend-slides · tui-style-guide · tui-screen · liquid-glass · remotion-best-practices
Research & Analysis
crypto-research · oracle · notebooklm · sentry-cli · ats-resume-matcher · resume-formatter · retro-pdf
Agent Architecture
skill-creator · agent-ops · autonomous-loops · cost-aware-pipeline · media-processing · nano-banana-pro · sync-learnings · claude-developer-platform
Specialized AI agents organized by domain. Each agent has a defined persona, tool access, and area of expertise.
| Category | Agents |
|---|---|
| Universal | backend-developer · frontend-developer · superstar-engineer |
| Orchestrators | tech-lead-orchestrator · project-analyst · team-configurator |
| Engineering | api-architect · architecture-reviewer · code-archaeologist · code-reviewer · dev-cleanup-wizard · devops-automator · documentation-specialist · gatekeeper · integration-tests · lead-orchestrator · migration · performance-optimizer · planner · playwright-test-validator · property-mutation · release-manager · security-agent · service-codegen · solution-architect · tailwind-css-expert · test-writer-fixer |
| Design | ui-designer |
| Swarm | swarm-leader · swarm-worker |
| Meta | agentmaker · reflect |
| Root | distinguished-engineer · web-search-researcher |
A two-tier learning system that captures insights during development and retrieves them across sessions and projects.
| Layer | Technology | Purpose |
|---|---|---|
| Fast local | QMD (Quick Markdown Documents) | Semantic search over structured learning notes |
| Deep graph | GraphRAG (nano-graphrag) | Entity-relationship graph with community detection for cross-project knowledge retrieval |
The /reflect skill captures learnings. The /research and /prime skills retrieve them. The /global-learnings skill manages the knowledge base directly.
How the knowledge system works →
agents-in-a-box/
│
├── ainb-tui/ # Rust TUI application
│ ├── src/ # 115 modules
│ │ ├── app/ # State machine & event handling
│ │ ├── components/ # TUI screen components
│ │ ├── widgets/ # Reusable UI widgets
│ │ ├── docker/ # Container management
│ │ ├── tmux/ # Session & PTY integration
│ │ ├── git/ # Worktree operations
│ │ ├── claude/ # Claude API client
│ │ ├── models/ # Data models
│ │ └── config/ # Configuration handling
│ ├── deny.toml # License & security policy
│ ├── Formula/ # Homebrew formula
│ └── install.sh # One-liner installer
│
├── toolkit/ # Portable AI agent toolkit
│ ├── packages/
│ │ ├── skills/ # 71 reusable skills
│ │ ├── agents/ # 37 agent definitions
│ │ │ ├── universal/ # Cross-stack specialists
│ │ │ ├── engineering/ # Backend & infra agents
│ │ │ ├── orchestrators/ # Team coordination
│ │ │ ├── design/ # UI/UX specialists
│ │ │ ├── swarm/ # Multi-agent coordination
│ │ │ └── meta/ # Agent creation & reflection
│ │ ├── workflows/ # Structured delivery workflows
│ │ └── utilities/ # Shared utilities
│ ├── bootstrap.js # Multi-tool deployment engine
│ └── create-rule.js # CLI installer
│
├── docs/ # Documentation
│ └── how-reflection-works.md # Knowledge system architecture
│
└── .github/workflows/
├── ci.yml # Rust CI (fmt, clippy, test, deny, machete)
├── toolkit-validation.yml # Toolkit structure & install validation
└── release.yml # Cross-platform binary releases
| Check | Tool | What it catches |
|---|---|---|
| Format | rustfmt |
Style inconsistencies |
| Lint | clippy (pedantic + nursery) |
Logic errors, anti-patterns, code smells |
| Test | cargo-nextest (Ubuntu + macOS) |
Regressions across platforms |
| Security | cargo-deny (RustSec) |
Known vulnerabilities in dependencies |
| Licenses | cargo-deny |
Non-compliant dependency licenses |
| Dead deps | cargo-machete |
Unused crate declarations |
| Toolkit structure | Custom validation | Package counts, template substitution, install verification |
The Rust codebase enforces unsafe_code = "forbid" and runs clippy with pedantic, nursery, and cargo lint groups enabled.
cd ainb-tui
cargo build --release
./target/release/agents-boxcd ainb-tui
cargo test # Unit tests
cargo test --features visual-debug # With terminal output
cargo test --features vt100-tests # VT100 screen verification
cargo nextest run # With nextest (parallel)cd ainb-tui
cargo fmt --check # Format check
cargo clippy --all-targets # Lint
cargo deny check # Security + licensescd toolkit
npm install
node create-rule.js --tool=claude-code-4.5 # Deploy to ~/.claude/
node create-rule.js --tool=gemini # Deploy to .gemini/
node create-rule.js --tool=codex # Deploy to ~/.codex/- Fork the repository
- Create a feature branch (
git checkout -b feature/amazing-feature) - Commit your changes (
git commit -m 'feat: add amazing feature') - Push to the branch (
git push origin feature/amazing-feature) - Open a Pull Request
MIT — see LICENSE for details.




