arch-agent

Structured architecture design for AI-assisted development.

arch-agent turns Claude Code into an opinionated architecture reviewer that guides your team through a rigorous, phase-gated design process. It challenges assumptions, enforces decision gates, and produces a documented architecture — not just a generated wall of text.

The Problem

Architecture decisions are the most expensive to reverse. Yet most teams either:

Skip formal design and pay for it later in rework
Generate AI docs that say "yes" to everything and reflect no real constraints
Write documents once that become outdated and disconnected from reality

arch-agent solves this by combining AI reasoning with human-controlled gates. The AI proposes and challenges. You decide. Every decision is logged with rationale, alternatives, and trade-offs.

Where arch-agent Fits

flowchart LR
    subgraph Input
        PRD[".arch/prd.md\n(requirements)"]
        CTX[".arch/org-context.md\n(team, stack, constraints)"]
    end

    subgraph arch-agent["arch-agent + Claude Code"]
        direction TB
        P1["Phase 1\nEvaluate PRD"]
        P2["Phase 2\nDecide Architecture"]
        P3["Phase 3\nDesign Components"]
        P4["Phase 4\nValidate & Document"]
        P1 --> P2 --> P3 --> P4
    end

    subgraph Artifacts["Generated Artifacts"]
        DL[".arch/decisions.md"]
        CS[".arch/components/*.md"]
        RV[".arch/reviews/*.md"]
        AD["output/\narchitecture-document.md"]
    end

    PRD --> P1
    CTX --> P1
    P4 --> DL
    P4 --> CS
    P4 --> RV
    P4 --> AD

Input: Your PRD and organizational context (team size, tech stack, constraints), placed in the .arch/ directory.

Output: A decision log, detailed component specs, adversarial review findings, and a comprehensive architecture document — every choice backed by rationale and alternatives. Artifacts are produced throughout the phases, not only at the end.

End-to-End Workflow

Here is the complete journey from requirements to architecture output:

1. Scaffold the project

npx arch-agent init --name "My Project"

This creates .arch/ (state machine, validation scripts, working files), .claude/ (slash commands, skills, hook configuration), and CLAUDE.md (agent identity and rules).

2. Provide your requirements

Write or paste your Product Requirements Document into .arch/prd.md. Optionally describe your team size, tech stack, and constraints in .arch/org-context.md. If you skip org-context, the agent interviews you during Phase 1 (15 questions across scale, team, and constraints).

3. Start Claude Code and run the architecture process

claude

Then type /analyze-prd to begin. The agent walks you through four phases:

Phase	What Happens	Command	Artifact Produced
1. Evaluate	Analyzes your PRD, extracts requirements, finds gaps, assesses risks	`/analyze-prd`	`.arch/phase1-evaluation.md`
2A. Pattern	Proposes architecture pattern with rationale and alternatives	`/propose-methodology`	`.arch/phase2-methodology.md`
2B. Components	Maps all system components, dependencies, and integration points	`/propose-methodology`	`.arch/phase2-components-overview.md`
2C. Cross-Cutting	Locks auth, observability, deployment, error handling, data management	`/propose-methodology`	`.arch/phase2-cross-cutting.md`
3. Design	Details each component: technology + version, API contracts, failure modes, traceability to requirements	`/design-component [name]`	`.arch/components/[name].md`

Note: The [name] argument is optional. If omitted, the agent picks the next component in logical dependency order (foundational first). After each /accept, the agent auto-advances to the next pending component — you only need to call /design-component once. | 4. Document | Validates end-to-end consistency, builds risk register, generates final doc | /generate-docs | output/architecture-document.md |

4. Review and accept at each gate

At every phase boundary, the agent presents its proposal. You review it and:

/accept — Confirm the proposal and advance to the next phase
/refine [feedback] — Request specific changes while staying in the current phase
/alternative [request] — Request a completely different approach

The agent cannot skip phases or auto-accept. This is enforced by Python validation hooks at the file system level, not by prompt instructions.

5. Collect your architecture

When complete, your project contains a full set of architecture artifacts: evaluated requirements, accepted decisions with rationale, detailed component designs, and a comprehensive architecture document in output/architecture-document.md.

How Artifacts Evolve Across Phases

Architecture knowledge builds incrementally. Each phase produces artifacts that feed into the next:

flowchart TD
    subgraph Phase1["Phase 1: Evaluate"]
        E1["Extracted requirements (FR-001, FR-002...)"]
        E2["Identified gaps & risks"]
        E3["phase1-evaluation.md"]
    end

    subgraph Phase2["Phase 2: Decide"]
        D1["Architecture pattern + rationale"]
        D2["Component map with dependencies"]
        D3["Cross-cutting constraints\n(auth, observability, deployment,\nerror handling, data management)"]
    end

    subgraph Phase3["Phase 3: Design"]
        C1["Component spec: technology + version"]
        C2["API contracts & integration points"]
        C3["Failure modes & scaling strategy"]
        C4["Cross-cutting compliance check"]
    end

    subgraph Phase4["Phase 4: Document"]
        F1["End-to-end validation"]
        F2["Risk register"]
        F3["architecture-document.md"]
    end

    Phase1 -->|"requirements inform\npattern selection"| Phase2
    Phase2 -->|"cross-cutting decisions\nconstrain every component"| Phase3
    Phase3 -->|"all components accepted\ntriggers validation"| Phase4

    DL["decisions.md\n(updated every phase)"] -.-> Phase1
    DL -.-> Phase2
    DL -.-> Phase3
    DL -.-> Phase4

Every decision is logged to .arch/decisions.md as it happens — not retroactively. By Phase 4, the decision log contains the complete rationale trail for every architectural choice.

Example: Designing an E-Commerce Platform

A team of 6 engineers is building an e-commerce platform. Here's the full journey:

Input: The team writes a PRD describing product catalog, shopping cart, checkout, payments, and order management. They note their constraints in org-context: 6 engineers, existing PostgreSQL expertise, 12-month runway, no Kubernetes experience.

npx arch-agent init --name "ShopFlow"
# Team writes PRD in .arch/prd.md and org-context in .arch/org-context.md
claude

Phase 1 — Evaluate:

/analyze-prd

The agent extracts 14 functional requirements and flags 4 critical gaps: no mention of inventory consistency model, missing payment failure recovery strategy, undefined latency SLA, and no data retention policy. The team answers the agent's questions, refines the analysis, and accepts.

Artifact produced: .arch/phase1-evaluation.md

Phase 2 — Decide:

/propose-methodology

2A: Agent proposes modular monolith — team of 6 with no Kubernetes experience shouldn't operate microservices. Compares against microservices (rejected: operational burden) and serverless (rejected: cold start on checkout). Team accepts.
2B: Agent maps 6 components: API Gateway, Product Catalog, Shopping Cart, Checkout & Payments, Order Management, Notification Service. Shows dependency graph. Team accepts.
2C: Agent locks cross-cutting: JWT auth with RS256, structured JSON logging via Pino, Docker Compose deployment (no K8s), exponential backoff retries, PostgreSQL per-module schemas with eventual consistency for cart-to-order transition. Team accepts.

Artifacts produced: .arch/phase2-methodology.md, .arch/phase2-components-overview.md, .arch/phase2-cross-cutting.md Decisions logged: DEC-001 through DEC-009 in .arch/decisions.md

Phase 3 — Design:

/design-component

The agent picks the first component in dependency order (e.g., Product Catalog) and designs it: PostgreSQL 16 with full-text search, REST API with pagination, Redis 7 cache for hot products, failure mode if Redis is down (fall through to DB). Each component includes a traceability section mapping to Phase 1 requirements (FR-NNN) and Phase 2C cross-cutting decisions (DEC-NNN). Team accepts — agent auto-advances to Shopping Cart, then Checkout & Payments, and so on. No need to call /design-component again.

Artifacts produced: .arch/components/product-catalog.md, .arch/components/shopping-cart.md, etc. Decisions logged: DEC-010 through DEC-021

Phase 4 — Validate & Document:

/generate-docs

The agent traces 3 critical user journeys (browse-to-purchase, payment failure recovery, order cancellation) through all components, verifying integration points. Builds a risk register scoring 8 identified risks by probability and impact. Generates the final architecture document.

Artifact produced: output/architecture-document.md

Result: A complete architecture with 21 logged decisions, 6 detailed component specs, and a comprehensive document — ready for implementation, onboarding, or audit.

Generated Artifacts

arch-agent produces architecture artifacts throughout the process, not just at the end:

Artifact	Location	When Produced	Contents
Decision log	`.arch/decisions.md`	Every phase	Each decision with ID, rationale, alternatives, trade-offs, risk, and supersession/traceability links
PRD evaluation	`.arch/phase1-evaluation.md`	Phase 1	Extracted requirements, gaps rated by severity, risk assessment
Architecture pattern	`.arch/phase2-methodology.md`	Phase 2A	Chosen pattern, rationale, alternatives compared
Component map	`.arch/phase2-components-overview.md`	Phase 2B	All components, dependencies, integration points
Cross-cutting decisions	`.arch/phase2-cross-cutting.md`	Phase 2C	Auth, observability, deployment, error handling, data strategies
Component designs	`.arch/components/[name].md`	Phase 3	Technology + version, API contracts, failure modes, scaling, traceability to requirements
Review findings	`.arch/reviews/[name].md`	Phase 3 (optional)	Adversarial review results from `/review-component`
Architecture document	`output/architecture-document.md`	Phase 4	Comprehensive deliverable consolidating all decisions

Example decision log entries:

### [DEC-003] Phase 2A | Architecture Pattern
- Decision: Modular monolith with event-driven boundaries
- Rationale: 6-person team, single deployment unit reduces operational burden
- Alternatives: Microservices (rejected: team too small), Serverless (rejected: cold start latency)
- Trade-offs: Sacrifices independent deployment for operational simplicity
- Risk: Module boundaries may need extraction to services at 10x scale
- References: FR-001, FR-003, FR-008
- Date: 2026-03-10T14:00:00Z

### [DEC-012] Phase 2A | Reopen
- Decision: Reopened architecture pattern — compliance requires data residency
- Rationale: Legal review found GDPR data residency requirements incompatible with single-region serverless
- Trade-offs: Progress reset on 5 items
- Risk: Reopens remaining: 1 of 2
- Supersedes: DEC-003
- Date: 2026-03-11T09:00:00Z

Example component design (abbreviated):

## Auth Service — Component Design

Technology: Keycloak 24.0 (self-hosted) + PostgreSQL 16
Pattern: OAuth 2.0 + OIDC, RS256 JWT tokens

Integration Points:
  IN:  POST /auth/token  <- API Gateway (client credentials)
  IN:  POST /auth/login  <- Web Client (authorization code flow)
  OUT: JWT validation    -> All services (public key endpoint)

Failure Modes:
  - Keycloak down -> Circuit breaker, return cached JWKS for 15min
  - Database failover -> Read replica promotion, ~30s token delay
  - Token leak -> Revocation endpoint + short-lived tokens (5min)

Traceability:
  Requirements: FR-002 (user authentication), FR-009 (session management)
  Cross-Cutting: DEC-007 (RS256 JWT), DEC-008 (structured JSON logging)
  Depends On: API Gateway

When to Use arch-agent

Scenario	How arch-agent Helps
Starting a new project from a PRD	Evaluates your requirements, finds gaps, proposes an architecture, and designs every component — with rationale for each choice
Structured architecture governance	Enforces phase gates, decision logging, and explicit acceptance — every decision has a paper trail
Documented architecture decisions	Produces a complete decision log for compliance, onboarding, or audit — not tribal knowledge in someone's head
Reviewing an existing architecture	Import your document with `arch-agent import`, and the agent walks through each phase challenging what exists
AI-assisted design with human control	The AI proposes and challenges; you decide at every gate. No auto-acceptance, no skipped phases
Onboarding engineers to a system	The generated architecture document and decision log explain not just what was decided, but why — with alternatives considered and risks documented

Quick Start

New project

npx arch-agent init --name "My Project"

Then:

Write your requirements in .arch/prd.md
Optionally describe your team and constraints in .arch/org-context.md
Start Claude Code and type /analyze-prd

Existing architecture

npx arch-agent import existing-architecture.md --name "My Project"

The agent parses your document and walks through each phase, challenging your existing decisions. Accept what's solid, refine what's outdated. Imported projects get 5 reopens (vs 2) for more iteration room.

Requirements: Node.js 18+, Claude Code, Python 3, git. Tessl is optional — required only if you want to run skill evaluations or publish tiles.

Commands

Workflow commands (phase sequence)

Phase	Command	Description
1. Evaluation	`/analyze-prd`	Evaluate PRD, find gaps, assess risks
2A. Pattern	`/propose-methodology`	Propose architecture pattern with rationale
2B. Components	`/propose-methodology`	Map all system components and dependencies
2C. Cross-Cutting	`/propose-methodology`	Lock auth, observability, deployment, error handling
3. Design	`/design-component [name]`	Detail one component (name optional — auto-picks next in dependency order; auto-advances after each `/accept`)
4. Finalization	`/generate-docs`	Validate consistency and generate final document
Import	`/import-architecture`	Import an existing architecture document (must be at `not_started`)

Decision commands

Command	Description
`/accept`	Accept current proposal and advance
`/refine [feedback]`	Request specific changes
`/alternative [request]`	Request a different approach
`/reopen [target] [reason]`	Reopen an accepted decision (max 2 per project)

Utility commands

Command	Description
`/review-component [name]`	Launch adversarial review of a component
`/status`	Show current progress
`/decision-log`	Show all recorded decisions
`/help`	Show available commands for current phase

Decision Governance

Every architectural decision flows through a controlled lifecycle:

flowchart LR
    PROPOSE["Agent Proposes"] --> CHALLENGE["Agent Challenges\n(asks hard questions)"]
    CHALLENGE --> DECIDE{"Architect Decides"}
    DECIDE -->|"/accept"| LOG["Decision Logged\n(rationale + alternatives\n+ trade-offs + risk)"]
    DECIDE -->|"/refine"| PROPOSE
    DECIDE -->|"/alternative"| PROPOSE

Decisions are immutable once accepted — unless you spend one of your limited reopens (max 2 per project). Reopening cascades: changing an early decision un-accepts everything downstream. New decisions that replace old ones carry a Supersedes: DEC-NNN link, creating an auditable chain of evolution without editing previous entries.

Key Design Decisions

Hard enforcement, not prompt instructions. Python validation hooks block illegal state transitions at the file system level. Phase skipping, backward transitions, and component injection are blocked regardless of what the AI is asked to do.

Cross-cutting decisions before component design. Auth, observability, deployment, and error handling strategies are locked in Phase 2C. Every component in Phase 3 must comply with these constraints. This prevents inconsistency across components.

One component at a time. Phase 3 designs components sequentially in dependency order. Each component is reviewed against previously accepted components for integration consistency. No parallel design that leads to interface mismatches.

Controlled iteration. The /reopen command allows going back to fix decisions with cascading invalidation. Reopening Phase 2A un-accepts 2B, 2C, and marks all components as "needs-review". Limited to 2 reopens per project to prevent design thrashing.

Project Structure

your-project/
├── CLAUDE.md                         # Agent identity and phase rules
├── .arch/
│   ├── state.json                    # Phase state machine
│   ├── prd.md                        # Your requirements (you edit this)
│   ├── org-context.md                # Team, stack, constraints (you edit this)
│   ├── decisions.md                  # Auto-generated decision log
│   ├── existing-architecture.md      # Imported document (arch-agent import only)
│   ├── scripts/
│   │   ├── validate-transition.py    # Enforcement hook
│   │   └── log-decision.py           # Auto-logging hook
│   ├── components/                   # Component design outputs
│   └── reviews/                      # Adversarial review findings
├── .claude/
│   ├── settings.json                 # Hook configuration
│   ├── commands/                     # Slash commands
│   └── skills/                       # Auto-activating skills
│       ├── architecture-methodology/ # Core 4-phase workflow skill
│       ├── architecture-patterns/    # Pattern knowledge base skill
│       ├── challenge-assumptions/    # Adversarial reviewer skill
│       └── state-manager/            # State machine skill
└── output/
    └── architecture-document.md      # Final deliverable

Tessl Integration

The skills bundled with arch-agent are packaged as Tessl tiles for versioning, distribution, and evaluation. Tessl is optional — required only if you want to run skill evaluations or publish tiles.

For tile descriptions, setup instructions, eval commands, and publishing guidance, see docs/CONTRIBUTING.md — Tessl Integration.

CLI Reference

npx arch-agent init [--name <name>] [--force]     # Scaffold a new project
npx arch-agent import <source> [--name <name>]     # Import existing architecture
npx arch-agent verify                               # Check prerequisites
npx arch-agent reset [--yes]                        # Reset state to initial

Documentation

User Guide — Step-by-step walkthrough
Architecture — System design and enforcement layers
Methodology — Four-phase design methodology
Contributing — How to contribute
Changelog — Version history

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.arch		.arch
.claude		.claude
bin		bin
docs		docs
output		output
src		src
templates		templates
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tessl.json		tessl.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

arch-agent

The Problem

Where arch-agent Fits

End-to-End Workflow

How Artifacts Evolve Across Phases

Example: Designing an E-Commerce Platform

Generated Artifacts

When to Use arch-agent

Quick Start

New project

Existing architecture

Commands

Workflow commands (phase sequence)

Decision commands

Utility commands

Decision Governance

Key Design Decisions

Project Structure

Tessl Integration

CLI Reference

Documentation

License

About

Uh oh!

Releases 5

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

arch-agent

The Problem

Where arch-agent Fits

End-to-End Workflow

How Artifacts Evolve Across Phases

Example: Designing an E-Commerce Platform

Generated Artifacts

When to Use arch-agent

Quick Start

New project

Existing architecture

Commands

Workflow commands (phase sequence)

Decision commands

Utility commands

Decision Governance

Key Design Decisions

Project Structure

Tessl Integration

CLI Reference

Documentation

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages