docs: Plan native Kubernetes Cleanroom by lox · Pull Request #507 · buildkite/cleanroom

lox · 2026-05-31T21:28:25Z

Cleanroom's current server-oriented control model is awkward for Kubernetes-scale deployment because a small pod or central service can hide VM CPU, memory, disk, KVM, and storage pressure from the scheduler. That makes bin packing, quotas, autoscaling, and failure recovery harder to reason about.

This draft plan proposes a Kubernetes-native shape where each Cleanroom sandbox is represented as one schedulable unit, with Kubernetes owning desired state, placement, lifecycle visibility, coarse tenancy, and routing. Cleanroom stays responsible for the runtime semantics that matter to repository sandboxes: policy compilation, microVM isolation, deny-by-default egress, gateway and cache mediation, exact-principal ownership, execution/file APIs, snapshots, and backend-specific setup.

The plan prefers integrating under Kubernetes SIG Agent Sandbox for the user-facing API, starts with a Firecracker-backed adapter pod and node-local runtime, and treats KubeVirt as a later backend option rather than the product API. It also calls out the first delivery slices, validation strategy, open questions, and pressure-test findings around hidden capacity, warm pools, NetworkPolicy boundaries, and CRD status scope.

docs: Plan native Kubernetes Cleanroom

306a919

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: Plan native Kubernetes Cleanroom#507

docs: Plan native Kubernetes Cleanroom#507
lox wants to merge 1 commit into
mainfrom
lox/native-kubernetes-cleanroom-plan

lox commented May 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

lox commented May 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant