Future roadmap: what's next

The long-term vision for this setup is a self-healing, autonomous development environment:

A workstation that reproduces itself from source (git clone + nixos-rebuild switch)
Autonomous agents that handle ops triage, code review, and routine maintenance
Agent runners isolated in Firecracker microVMs with zero secrets
A home server that converges storage, backups, CI, and agent orchestration
All of it declared in Nix, with every change audited and version-controlled

Here's the concrete roadmap.

Phase 1: complete the autonomous agent runner

The design is validated (autonomous-agents-design.md) and the implementation plan is written. Execution is structured as milestones:

Milestone 0 — scaffolding and gVisor runtime

[x] Design document written and reviewed
[x] Implementation plan with all 6 milestones
[ ] Install gVisor (runsc) and register as Docker runtime
[ ] First tracer bullet: docker run --runtime=runsc hello-world

Milestone 1 — isolation boundary

[ ] Minimal sandbox image with scoped filesystem
[ ] Verify ~/.aws, ~/.ssh are unreachable from inside sandbox
[ ] Verify egress to arbitrary hosts fails

Milestone 2 — MCP broker

[ ] TypeScript MCP server with the 7 audited tools
[ ] Read-only DB tool with statement allowlist
[ ] Jira read + create-ticket (scoped project allowlist)
[ ] GitHub open-PR (no merge, no push-to-main)
[ ] Structured audit log (every call, every parameter)

Milestone 3 — egress allowlist

[ ] Host-side proxy (squid or nftables)
[ ] Allowlist: bedrock-runtime + package registries + git hosts
[ ] Default-deny for everything else

Milestone 4 — Bedrock token minter

[ ] Scoped AssumeRole session with bedrock-only policy
[ ] Token injection into sandbox at job start
[ ] Token expiry of 12 hours maximum

Milestone 5 — supervisor and job lifecycle

[ ] systemd --user supervisor that spawns/tears down per-job sandboxes
[ ] Manual trigger interface (no auto-triggering from external sources)
[ ] Audit log aggregation

Milestone 6 — security audit

[ ] External security review
[ ] Penetration testing against the isolation boundary
[ ] Go-live with daily ops agent

Phase 2: Firecracker microVMs

gVisor is the starting point because it's simpler to integrate (Docker runtime, no kernel management). The real target is Firecracker microVMs, which provide hardware-level isolation.

The repo already has a working Firecracker setup (nix/vms/hermes/) with TAP device + NAT networking, port forwarding (VM:3000 to host:3000 for Hermes Studio), systemd lifecycle management, and krops-based deploy (NixOS config push via SSH).

nixosConfigurations.hermes-vm  →  builds the VM rootfs
packages.krops-hermes-deploy   →  deploys via SSH
packages.hermes-vm-image       →  builds the raw ext4 image
services.firecracker-hermes    →  systemd unit for VM lifecycle

The transition from gVisor to Firecracker means each agent gets a dedicated kernel (not just a user-space syscall interceptor), stronger resource isolation (dedicated vCPU, pinned memory), and hardware-assisted security via KVM.

Phase 3: Heracles — the Hermes agent evolution

The Hermes system (already running in the Firecracker VM) will evolve into a full agent platform called Heracles:

Studio: Web UI for agent task management, audit log browsing, and manual review of agent outputs
Cron: Scheduled agent runs (daily ops triage, weekly code health checks)
MCP catalog: Agent-discoverable MCP tool registry
Insights: Token usage tracking and cost attribution

The Hermes VM currently runs with 2 vCPUs and 1 GB RAM. That's enough for the ops agent (I/O-bound, reading Jira/CloudWatch/DBs) and the code agent (CPU-bound but limited by LLM latency, not sandbox compute).

Phase 4: full NixOS switch

The NixOS migration still has items on the list:

CURRENT                → TARGET
─────────────────────────────────────────────────────
Fedora + GNOME         → NixOS + Hyprland
dnf + topgrade         → nixos-rebuild switch
mise for languages     → nix develop devshells
GNOME Shell            → Hyprland + waybar + walker
creatlinks.sh          → home-manager (already done)

The gap is NVMe-backed full disk encryption. The current machine has a hardware limitation that prevents a seamless reinstall. The workaround:

Continue running home-manager switch --flake .#usman on Fedora (already works)
When a new machine arrives, NixOS goes on it first
The old machine becomes the homeserver or a build worker

Phase 5: skill architecture improvements

The current skill deployment uses three mechanisms (Nix store, live git clone, npm skills CLI). This works but is awkward. The target:

One mechanism. Either Nix-managed (read-only, reproducible) or a lightweight skills CLI that syncs from a manifest.
Dependency graph. Skills that reference other skills should declare that dependency.
Version pinning. Every skill should be pinned to a commit or semver range, with automatic hash verification.

The target state:

dotfilesctl skills add <org>/<repo>@<version>
dotfilesctl skills sync    # installs missing, updates pinned
dotfilesctl skills status  # shows drift from pinned versions

Phase 6: home server convergence

A dedicated home server (the old laptop, currently running basic services) should converge:

Backup target: restic repository, rclone cache, nightly rsync
Build worker: offload Nix builds (faster on always-on server)
Agent runner: Firecracker microVMs for long-running agents
Ollama server: local LLM inference for privacy-sensitive agent tasks
Tailscale exit node: route all mobile traffic through home for ad-blocking

The server runs the same flake (different hostname, different hardware-configuration.nix), sharing 90% of the config.

Phase 7: devshell expansion

Current devshells cover the main languages, but some are undocumented in use:

DevShell	Status	Notes
`java`	Active	Maven, JDK 21, daily use
`node`	Active	Frontend work
`rust`	Active	CLI tools, custom MCPs
`go`	Active	Tooling, daemons
`python`	Active	uv2nix-backed, AI tools
`cuda`	Occasional	ML model development
`android`	Occasional	Mobile builds
`ai`	Active	ML Python stack
`pageindex`	Active	Document indexing

The goal: each devshell should have a README.md (or Nix doc comment) explaining what it's for and how to use it.

Phase 8: dependency audit and reduction

The current home.packages list has about 70 packages. Some are redundant and should be reviewed quarterly. Remove anything not used in 90 days.

The ten-year arc

Beyond the concrete phases:

Reproducible everything. Every tool, config, service, and development environment is a Nix expression. The "works on my machine" problem is eliminated.

Default-deny agent execution. AI agents can only do what an audited, version-controlled policy allows. The Pattern 2 architecture (zero-secret, MCP-mediated) becomes the standard for agent deployment.

Self-healing infrastructure. The system detects drift, anomalies, and failures and fixes them via agent-driven remediation within the zero-secret constraint. A backup that failed? The agent retries with a different strategy. A Tailscale node dropped? The agent re-authenticates.

Converged home infrastructure. Laptop + home server + cloud VMs all share one config. The flake is the single source of truth for every machine.

Skills as code. AI agent skills are version-controlled, dependency-managed, and tested, just like software libraries. The line between code and prompt blurs, but reliability improves.

When?

This is a hobby project, so there are no deadlines. The roadmap is ordered by priority:

Q3 2026: Milestones 0-3 of the autonomous agent runner (gVisor + broker + egress)
Q4 2026: Milestones 4-6 (Bedrock token minter + supervisor + audit) and Phase 2 Firecracker transition
2027: Full NixOS switch, Heracles Studio, skill architecture overhaul
Longer term: Home server convergence, devshell expansion, dependency reduction

The repo is at ~/dotfiles/. Every step of this roadmap is in the git log. If you're building something similar, the nix/ directory, autonomous-agents-design.md, and this series are the canonical references.