Multi-Cluster Egress Security

Multi-cluster setups need consistent egress policy. The patterns and the enforcement.

The egress security model

The egress security model rests on default-deny, FQDN-based allowlists where possible, and logging on every egress flow. Default-deny catches lateral movement after compromise and bounds blast radius; FQDN allowlists outlive IP-based ones because cloud services change IPs; logs make compromise detectable rather than newsworthy.

Centralised egress patterns

Centralised egress routes all clusters’ outbound through one path, giving a single point to enforce policy and inspect traffic. Per-cluster egress is operationally simpler but loses the uniform policy enforcement; the trade-off depends on the team’s tolerance for centralised cost and the value of inspection.

Policy enforcement

Policy enforcement spans layers. NetworkPolicy resources plus a supporting CNI for in-cluster egress; cloud security groups, route tables, and transit gateway rules beyond the cluster; per-namespace allowlists for granular team-level control. Defence in depth means each layer is enforced independently.

Auditing egress

Auditing closes the loop. VPC flow logs at the cloud layer, pod egress logs at the cluster layer, both feeding an SIEM for correlation and detection. Anomaly detection on new external IPs, sudden volume changes, and unusual hours surfaces compromise; quarterly audit catches forgotten test integrations and abandoned phone-home services.

Operating multi-cluster egress security

Operating multi-cluster egress security needs a standard policy template, non-prod testing before promotion, and a documented allowlist with owners and justifications. Drift across clusters defeats the security posture; egress changes can break legitimate traffic, so test runs are the safety net.