Network Egress Cost Controls That Pay
Egress fees can be 30-50% of cloud bill. The four controls that cut egress materially.
In-region traffic
Network egress cost controls are the disciplines that prevent egress costs from becoming a major line item. The strategies are well-known; the discipline is in applying them consistently and revisiting as traffic patterns evolve.
What in-region traffic discipline provides:
- Place services that talk in the same region.: Services with significant traffic between them should live in the same region. Cross-region traffic incurs cross-region egress costs; same-region traffic is much cheaper.
- Cross-region egress is 10x intra-region.: The price difference is significant. Architecture decisions that produce cross-region traffic patterns produce ongoing costs that intra-region patterns do not.
- Verify with VPC flow logs.: The team's actual traffic patterns are visible in flow logs. Verifying that services that should be co-located actually are produces real data; assumptions sometimes do not match reality.
- Surprises hide in legacy routing.: Old configurations sometimes route traffic through unexpected paths. The flow log audit reveals these; remediation produces direct savings.
- Plan deployments accordingly.: When deploying new services, the team chooses regions deliberately. Co-location with related services is part of the deployment plan; the cost is bounded from the start.
In-region discipline is the highest-leverage control. The savings compound across all cross-region paths the team avoids.
Compress
Compression reduces bytes on the wire. The CPU cost of compression is much lower than the network cost of uncompressed transfer; the math favors compression for almost all egress traffic.
- gzip on HTTP responses.: HTTP responses should be compressed by default. gzip is universally supported; the compression ratio is good for typical content.
- Brotli where supported.: Modern browsers support Brotli compression. The ratio is better than gzip; the team configures Brotli for clients that support it; the savings are larger.
- Compression at the application reduces bytes.: The compression is at the application layer (or the proxy in front of it). The bytes that leave the network are compressed; the egress cost is on the compressed bytes.
- Egress drops proportionally.: A 5x compression ratio produces 80% reduction in egress bytes for that traffic. The savings are direct; the cost reduction is proportional.
- Watch for already-compressed content.: Images, videos, and similar content are typically already compressed. Compressing again wastes CPU without producing savings; the team's compression configuration excludes these.
Compression is mechanical and high-leverage. The discipline is enabling it broadly and configuring it correctly.
CDN for public traffic
Public-facing traffic should go through a CDN. The CDN's egress is cheaper than direct cloud egress; caching at the CDN reduces origin egress further.
- Public-facing traffic via CDN.: Traffic to public users should not flow directly from the cloud's origin. The CDN sits in the middle; users connect to the CDN; the CDN connects to origin.
- CDN egress is cheaper than direct cloud egress.: CDN providers charge for egress at lower rates than the cloud's native egress. The price difference is real; the CDN is economically sensible even before considering caching.
- Caching helps too.: The CDN caches responses. Cache hits do not reach origin; origin egress is reduced. The cache hit rate is the key multiplier; high hit rate produces large savings.
- Served bytes count, but only on first miss.: The CDN charges for served bytes (cache hits and misses both). Origin charges only for cache misses. The total cost is CDN-served bytes plus origin-served bytes; well-cached content optimizes both.
- Configure cache TTLs deliberately.: Long TTLs maximize cache hit rate but produce stale content; short TTLs minimize staleness but produce more origin traffic. The team tunes per content type.
CDN-fronted public traffic is the standard pattern. The egress cost benefits compound with the latency benefits.
Audit
The audit is what keeps the controls effective. Without periodic audit, traffic patterns drift; new services produce new egress; the team's costs grow unobserved.
- Per-service egress dashboard.: Each service's egress is tracked. The dashboard shows trends; per-service growth is visible; investigation has data.
- Sort by cost.: The dashboard sorts services by egress cost. The top contributors are the optimization candidates; the team's effort focuses where the savings are largest.
- Investigate top contributors.: The team investigates the top-spending services. Why is this service's egress so high? Is the architecture appropriate? Are the controls applied? The investigation produces specific remediation.
- Track over time.: The trend matters. Improving trends mean controls are working; degrading trends mean attention is needed. The dashboard supports both reactive and proactive management.
- Quarterly review.: Once per quarter, the team reviews the egress cost in detail. New patterns surface; controls are updated; the discipline is sustained.
Network egress cost controls are one of those FinOps disciplines that pays off proportionally to traffic volume. Nova AI Ops integrates with cloud cost data and traffic telemetry, surfaces per-service egress patterns, and produces the optimization queue that the team uses to drive savings.
Most teams find one or two unexpected high-egress services.