Skip to content

docs: Plan native Kubernetes Cleanroom#507

Draft
lox wants to merge 1 commit into
mainfrom
lox/native-kubernetes-cleanroom-plan
Draft

docs: Plan native Kubernetes Cleanroom#507
lox wants to merge 1 commit into
mainfrom
lox/native-kubernetes-cleanroom-plan

Conversation

@lox

@lox lox commented May 31, 2026

Copy link
Copy Markdown
Contributor

Cleanroom's current server-oriented control model is awkward for Kubernetes-scale deployment because a small pod or central service can hide VM CPU, memory, disk, KVM, and storage pressure from the scheduler. That makes bin packing, quotas, autoscaling, and failure recovery harder to reason about.

This draft plan proposes a Kubernetes-native shape where each Cleanroom sandbox is represented as one schedulable unit, with Kubernetes owning desired state, placement, lifecycle visibility, coarse tenancy, and routing. Cleanroom stays responsible for the runtime semantics that matter to repository sandboxes: policy compilation, microVM isolation, deny-by-default egress, gateway and cache mediation, exact-principal ownership, execution/file APIs, snapshots, and backend-specific setup.

The plan prefers integrating under Kubernetes SIG Agent Sandbox for the user-facing API, starts with a Firecracker-backed adapter pod and node-local runtime, and treats KubeVirt as a later backend option rather than the product API. It also calls out the first delivery slices, validation strategy, open questions, and pressure-test findings around hidden capacity, warm pools, NetworkPolicy boundaries, and CRD status scope.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant