Visual Optimization Diagrams

Mermaid flowcharts for key cost optimization decisions. These render natively on GitHub.

Claude Model Family (April 2026)
Model Selection Decision Tree
Session Cost Optimization Flowchart
Cost Tier Strategy Map
Pricing Modifier Stack

Claude Model Family (April 2026)

The current Claude model lineup, their positioning, and cost tiers. Mythos Preview is an invitation-only research model for defensive cybersecurity under Project Glasswing — not for general development.

flowchart TB
    subgraph GA["Generally Available"]
        direction TB
        opus7["Opus 4.7 (flagship)<br/>$5 / $25 per 1M<br/>1M context · 128K output<br/>Adaptive thinking"]
        opus6["Opus 4.6 (legacy)<br/>$5 / $25 per 1M<br/>1M context · 128K output<br/>Only model with Fast Mode"]
        sonnet["Sonnet 4.6 (safe default)<br/>$3 / $15 per 1M<br/>1M context · 64K output<br/>Extended + adaptive thinking"]
        haiku["Haiku 4.5 (budget)<br/>$1 / $5 per 1M<br/>200K context · 64K output<br/>Extended thinking"]
    end

    subgraph RP["Research Preview (invite-only)"]
        direction TB
        mythos["Mythos Preview<br/>$25 / $125 per 1M<br/>Project Glasswing<br/>Defensive security only"]
    end

    classDef flagship fill:#f4d0e0,stroke:#c94a7a,stroke-width:2px,color:#222
    classDef legacy fill:#f4e0d0,stroke:#c97a4a,color:#222
    classDef default fill:#d0e8f4,stroke:#4a8ac9,stroke-width:2px,color:#222
    classDef budget fill:#d0f4d5,stroke:#4ac96a,color:#222
    classDef preview fill:#e8d0f4,stroke:#7a4ac9,color:#222

    class opus7 flagship
    class opus6 legacy
    class sonnet default
    class haiku budget
    class mythos preview

Model Positioning

Model	Access	Best For	Why Not
Opus 4.7	GA (Bedrock research preview)	Complex agentic coding, multi-file refactors, long autonomous runs	Overkill for simple edits; new tokenizer uses ~20-35% more tokens
Opus 4.6	GA	Workloads tuned to old tokenizer, Fast Mode latency-critical paths	Marked legacy; migrate to 4.7 for coding quality
Sonnet 4.6	GA	Everyday development (the safe default)	Stretched on complex architecture + long agentic runs
Haiku 4.5	GA	Formatting, renaming, simple edits, file lookups	Lacks reasoning depth for multi-file work; 200K context (not 1M)
Mythos Preview	Invite only	Vulnerability discovery, security research (Glasswing partners)	5x output pricing, not for general use, no self-serve access

Model Selection Decision Tree

Use this to pick the right model before starting a task. Starting with Sonnet is always a safe default.

flowchart TD
    A[Start: evaluate task] --> B{"Complex architecture,<br/>long agentic run, or<br/>multi-file refactor?"}
    B -- Yes --> C["Use Opus 4.7<br/>$5 / $25 per 1M"]
    B -- No --> D{"Standard feature work,<br/>code review, or<br/>writing tests?"}
    D -- Yes --> E["Use Sonnet 4.6<br/>$3 / $15 per 1M"]
    D -- No --> F{"Simple fix, formatting,<br/>boilerplate, or<br/>file lookup?"}
    F -- Yes --> G["Use Haiku 4.5<br/>$1 / $5 per 1M"]
    F -- No --> H["Not sure?<br/>Start with Sonnet 4.6"]

    C -. "latency-critical?" .-> C2["Fall back to Opus 4.6<br/>(only model with Fast Mode)"]

    classDef flagship fill:#f4d0e0,stroke:#c94a7a,stroke-width:2px,color:#222
    classDef legacy fill:#f4e0d0,stroke:#c97a4a,color:#222
    classDef mid fill:#d0e8f4,stroke:#4a8ac9,stroke-width:2px,color:#222
    classDef low fill:#d0f4d5,stroke:#4ac96a,color:#222

    class C flagship
    class C2 legacy
    class E,H mid
    class G low

Quick Reference

Complexity	Model	Cost (Input/Output per 1M)	Examples
High	Opus 4.7	$5 / $25	Architecture design, complex debugging, large refactors, long agentic runs
High (Fast Mode)	Opus 4.6	$30 / $150	Latency-critical urgent work (6x premium)
Medium	Sonnet 4.6	$3 / $15	Feature implementation, code review, test writing
Low	Haiku 4.5	$1 / $5	Formatting, renaming, boilerplate, lookups

Session Cost Optimization Flowchart

Follow this checklist at the start of every Claude Code session to minimize waste.

flowchart TD
    A[Start session] --> B{"CLAUDE.md<br/>over 150 lines?"}
    B -- Yes --> C["Trim CLAUDE.md<br/>under 150 lines"]
    B -- No --> D
    C --> D{".claudeignore<br/>exists?"}
    D -- No --> E["Create .claudeignore<br/>(exclude node_modules,<br/>dist, lock files)"]
    D -- Yes --> F
    E --> F["Choose model<br/>by task complexity"]
    F --> G{"Task is complex?"}
    G -- Yes --> H["Use Plan Mode<br/>before coding"]
    G -- No --> I
    H --> I["Work on task"]
    I --> J["Monitor with /usage"]
    J --> K{"Context<br/>getting large?"}
    K -- Yes --> L["Run /compact"]
    K -- No --> M
    L --> M{"Starting a<br/>new task?"}
    M -- Yes --> N["Start fresh session<br/>to reset context"]
    M -- No --> J

    classDef action fill:#d0f4d5,stroke:#4ac96a,color:#222
    classDef warn fill:#f4ead0,stroke:#c9a84a,color:#222
    classDef check fill:#d0e8f4,stroke:#4a8ac9,color:#222

    class A,F,I,J check
    class C,E,H,L,N action

Key Checkpoints

CLAUDE.md size -- Every line loads on every turn. Keep it under 150 lines to avoid recurring token waste.
.claudeignore -- Prevents Claude from reading large generated or vendored files.
Model selection -- Match model to task complexity (see decision tree above).
Plan Mode -- For complex tasks, plan first to avoid expensive iterative dead ends.
/compact -- Summarizes conversation history to reduce context size mid-session.
Fresh sessions -- New tasks should get new sessions. Stale context from prior tasks is pure waste.

Cost Tier Strategy Map

Which strategies matter most depends on your monthly spend. Focus on high-impact changes first.

flowchart TD
    A[Monthly Claude Code spend] --> B{"Less than<br/>$50 / month?"}
    B -- Yes --> T1
    B -- No --> D{"$50 to<br/>$200 / month?"}
    D -- Yes --> T2
    D -- No --> T3

    subgraph T1["Tier 1: Basics"]
        direction TB
        C1["Select models by task complexity"]
        C2["Trim CLAUDE.md under 150 lines"]
        C3["Use /compact when context grows"]
    end

    subgraph T2["Tier 2: Intermediate"]
        direction TB
        E1["Everything in Tier 1"]
        E2["Add .claudeignore to all projects"]
        E3["Delegate searches to subagents"]
        E4["Build /compact + Plan Mode habits"]
    end

    subgraph T3["Tier 3: Full Optimization"]
        direction TB
        F1["Everything in Tiers 1 and 2"]
        F2["Set per-developer budgets"]
        F3["Run usage analyzer weekly"]
        F4["Use token estimator on large prompts"]
        F5["Evaluate Batch API for bulk work"]
    end

    classDef tier1 fill:#d0f4d5,stroke:#4ac96a,color:#222
    classDef tier2 fill:#f4ead0,stroke:#c9a84a,color:#222
    classDef tier3 fill:#f4d0d0,stroke:#c94a4a,color:#222

    class T1 tier1
    class T2 tier2
    class T3 tier3

Strategy Summary by Tier

Tier	Monthly Spend	Focus Areas	Expected Savings
1 - Basics	< $50	Model selection, CLAUDE.md trimming, /compact	15-30%
2 - Intermediate	$50-200	Add .claudeignore, subagents, Plan Mode habits	30-45%
3 - Full Optimization	> $200	Team budgets, usage analyzer, token estimator, Batch API	40-60%

Pricing Modifier Stack

How the various multipliers combine on top of the base $/MTok rate. Each modifier stacks multiplicatively.

flowchart LR
    base["Base rate<br/>Opus 4.7 $5 / $25"] --> cache{"Cache hit<br/>or write?"}

    cache -- "Cache read hit" --> cacheRead["× 0.1<br/>(90% off input)"]
    cache -- "5-min write" --> write5m["× 1.25"]
    cache -- "1-hour write" --> write1h["× 2.0"]
    cache -- "No cache" --> normal["× 1.0"]

    cacheRead --> batch{"Batch API?"}
    write5m --> batch
    write1h --> batch
    normal --> batch

    batch -- "Yes" --> batchYes["× 0.5<br/>(50% off)"]
    batch -- "No" --> batchNo["× 1.0"]

    batchYes --> region{"Platform /<br/>region?"}
    batchNo --> region

    region -- "Global / API" --> regGlobal["× 1.0"]
    region -- "Regional endpoint" --> regRegional["× 1.1<br/>(+10%)"]
    region -- "US-only data<br/>residency" --> regData["× 1.1<br/>(+10%)"]

    regGlobal --> fast{"Fast Mode?<br/>(Opus 4.6 only)"}
    regRegional --> fast
    regData --> fast

    fast -- "Yes" --> fastYes["× 6<br/>(research preview)"]
    fast -- "No" --> fastNo["× 1.0"]

    fastYes --> final["Final $/MTok"]
    fastNo --> final

    classDef discount fill:#d0f4d5,stroke:#4ac96a,color:#222
    classDef premium fill:#f4ead0,stroke:#c9a84a,color:#222
    classDef expensive fill:#f4d0d0,stroke:#c94a4a,color:#222
    classDef result fill:#d0e8f4,stroke:#4a8ac9,stroke-width:2px,color:#222

    class cacheRead,batchYes discount
    class write5m,write1h,regRegional,regData premium
    class fastYes expensive
    class final result

Stacking Examples (Opus 4.7 input at $5/MTok base)

Scenario	Calculation	Effective rate
Standard API call	$5 × 1	$5.00
Cache read hit	$5 × 0.1	$0.50
Batch API	$5 × 0.5	$2.50
Batch + cache read	$5 × 0.5 × 0.1	$0.25
Regional endpoint on Bedrock	$5 × 1.1	$5.50
Regional + data residency	$5 × 1.1 × 1.1	$6.05
Fast Mode (Opus 4.6 only)	$5 × 6	$30.00
Fast Mode + cache read	$5 × 6 × 0.1	$3.00

Note: Fast Mode cannot combine with Batch API. All other multipliers can stack.

Related Guides

Model Selection -- Detailed model comparison with cost-per-task data
Context Optimization -- CLAUDE.md trimming and .claudeignore setup
Workflow Patterns -- Plan Mode, subagents, and /compact usage
Team Budgeting -- Per-developer budgets and ROI tracking
Access Methods and Pricing -- Platform comparison, endpoint premiums, Fast Mode

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Visual Optimization Diagrams

Table of Contents

Claude Model Family (April 2026)

Model Positioning

Model Selection Decision Tree

Quick Reference

Session Cost Optimization Flowchart

Key Checkpoints

Cost Tier Strategy Map

Strategy Summary by Tier

Pricing Modifier Stack

Stacking Examples (Opus 4.7 input at $5/MTok base)

Related Guides

FilesExpand file tree

diagrams.md

Latest commit

History

diagrams.md

File metadata and controls

Visual Optimization Diagrams

Table of Contents

Claude Model Family (April 2026)

Model Positioning

Model Selection Decision Tree

Quick Reference

Session Cost Optimization Flowchart

Key Checkpoints

Cost Tier Strategy Map

Strategy Summary by Tier

Pricing Modifier Stack

Stacking Examples (Opus 4.7 input at $5/MTok base)

Related Guides