Availability vs Correctness Trade-off

Sometimes correctness matters more than availability.

Correctness

One of the deepest reliability decisions a service makes is the trade-off between availability and correctness. Sometimes serving a wrong answer is worse than serving no answer; sometimes the reverse. Most teams handle this trade-off implicitly; mature teams handle it explicitly, with documented decisions per service and clear behaviors when the trade-off is forced.

What "correctness over availability" means:

Correctness-over-availability is the right choice for services where wrong answers compound into worse outcomes than brief unavailability.

Availability

The reverse trade-off is also valid for the right workloads. Some services prefer to serve approximate or slightly-stale data rather than fail. The user who sees yesterday's recommendations is better off than the user who sees an error page; the user who gets a cached search result is better off than the user who gets nothing.

Availability-over-correctness is the right choice for services where partial functionality is more useful than full failure.

Decide

The decision between availability and correctness is per service, not per company. Different services have different trade-offs because their workloads have different characteristics. The discipline is making the choice deliberately rather than letting it emerge from how the code happens to be written.

Availability versus correctness is one of those design decisions that compounds across years of operation. Nova AI Ops surfaces both metrics per service, supports SLOs that combine both dimensions, and tracks the architectural patterns that match each service's chosen trade-off.