The Golden Signals Dashboard tracks the four metrics that matter most for service reliability: latency, traffic, errors, and saturation. Every signal is baselined against 24 hours of historical data so anomalies are detected automatically. P50, P95, and P99 latency percentiles, throughput rates, error breakdowns by type, and resource saturation across CPU, memory, and disk are all visible from a single unified view.
Latency is the most user-visible signal. The Golden Signals Dashboard breaks latency into P50 (median), P95, and P99 percentiles so you can see both typical and worst-case performance. Each percentile is charted over time with a 24-hour baseline overlay, making it obvious when performance deviates from normal. Drill into any spike to see which endpoints, services, or database queries are contributing to the latency increase.
Traffic throughput is displayed as requests per second with breakdowns by service, endpoint, and HTTP method. Error rates are shown as both absolute counts and percentages, categorized by error type (4xx client, 5xx server, timeout, connection refused). The dashboard correlates traffic spikes with error rate changes so you can distinguish between load-induced errors and functional bugs.
Saturation measures how close your resources are to their limits. The dashboard tracks CPU utilization, memory pressure, disk I/O, and network bandwidth across all services and hosts. Each metric is displayed with threshold lines for warning (70%) and critical (90%) levels. Predictive trendlines estimate when resources will hit capacity so you can scale proactively rather than reactively.
Subscribe to Nova AI Ops on YouTube for demos, tutorials, and feature deep-dives.
Golden Signals Dashboard is part of the Nova AI Ops platform. Start a free trial to see it with your own data.