Cluster Backup Strategy

Cluster state needs backup. The strategy.

etcd

Cluster backup strategy is the discipline that prepares the team for cluster recovery. Without backups, a corrupted etcd or destroyed control plane is unrecoverable; with backups, the team can recover. The discipline is tested backups; untested backups are aspirational.

What etcd backup provides:

etcd backup is the foundation. Without it, the cluster cannot be reconstructed from scratch.

Velero

Velero handles the application-level backup. While etcd captures the API state, Velero captures the workload-level state including persistent volumes.

Velero complements etcd backup. Together they provide complete cluster recovery capability.

Test

The backup is tested. An untested backup is unproven; the test produces confidence that recovery actually works when needed.

Cluster backup strategy is one of those operational disciplines that pays off in the rare cases where it matters. Nova AI Ops integrates with backup and recovery tools, surfaces backup health, and produces the per-cluster recovery readiness view that the platform team uses to verify the discipline is working.