Reliability Engineering

Resource saturation, the simple way,
live gauges, threshold lines, no PromQL needed

System Gauge is the resource-utilization page. CPU, memory, disk, network, connection counts, queue depths, per host and per service, with threshold lines from your alert rules drawn directly on the gauges. Use it as the first stop when an SLO is burning hot and you need to know which resource is the bottleneck.

Get Started Talk to Sales
app.novaaiops.com / system-gauge
● LIVE
6
Resource types tracked
Per-host
or per-service
Live
thresholds drawn
< 10s
data freshness
What's Tracked

Six resource types, every host

CPU, memory, disk, network, connection counts, and queue depth, captured every 10 seconds per host and rolled up per service. The metrics come from the agents already deployed for log/metric collection so there is no new sidecar. Bring-your-own-Prometheus also works: the page reads from your existing Prometheus if you do not want Nova's collectors.

  • Six resource types: CPU, memory, disk, network, connection count, queue depth, the saturation big-six
  • Per-host and per-service: drill into one host's saturation or roll up to service-level percentiles
  • BYO Prometheus: use Nova's collectors or read from your existing Prometheus, same UI, same thresholds
app.novaaiops.com / system-gauge · resources
Threshold Lines

Your alerting rules, on the gauge

Every gauge shows the warn and page thresholds from your alert rules as horizontal lines. Crossing a threshold is visible at a glance, no need to know "is 78% bad?" The thresholds come from your existing alert rules so the page agrees with whatever fires your pager.

  • From alert rules: thresholds derived from existing alert rules; no separate config to maintain
  • Warn and page lines: two lines per gauge so you see headroom at both severity levels
  • Visual breach: over-threshold values are red on the gauge; sustained breach gets a small badge
app.novaaiops.com / system-gauge · thresholds
Cross-Resource Correlation

When two gauges move together

Saturation rarely lives alone. The page highlights cross-resource correlations: CPU spike on the API host correlated with connection-count climb on the database host. The correlations are the same engine that drives Cross-Signal Correlation; the gauge view is just the resource-only slice of it.

  • Live correlation: gauges that move together get a visual link line; clicking opens the cross-signal graph
  • Service-graph aware: correlations only fire across services connected in the service map; reduces noise
  • Click to investigate: every correlated pair links to the cross-signal correlation graph for the time window
app.novaaiops.com / system-gauge · correlation
Capacity Forecast

How long until I run out

Each gauge has a small forecast line drawn from the recent slope of the metric. "At current rate, this fills in 14 days." Use it for capacity planning: when does this disk need to be bigger, when does this pool need to be wider. The forecast updates every hour so it tracks reality, not last quarter.

  • Slope-based forecast: simple linear projection from the recent slope; no ML needed for a usable estimate
  • 14-day default horizon: longer than maintenance windows, shorter than quarterly reviews
  • Hourly refresh: forecasts update every hour so you do not act on a stale projection
app.novaaiops.com / system-gauge · forecast
Video walkthrough coming soon

Subscribe to Nova AI Ops on YouTube for demos, tutorials, and feature deep-dives.

A first-stop for "what is saturated"

Most incidents are a saturated resource somewhere. System Gauge tells you which one in seconds.

Get Started Request a Demo