Set Up VictoriaMetrics

High-scale TSDB.

Overview

VictoriaMetrics is a high-scale, low-resource time-series database that speaks PromQL. Compared to Prometheus, it ingests tens of millions of samples per second per node, uses meaningfully less memory, supports years of retention on commodity disk, and scales horizontally via cluster mode (vminsert, vmstorage, vmselect). For deployments that hit Prometheus’ scaling limits or that need years of metrics history, VictoriaMetrics is the operational upgrade.

The approach

The practical approach is single-node first for small deployments (simpler operations), cluster mode when ingestion or retention exceeds single-node capacity, vmagent for collection (lightweight collector replaces Prometheus scrape pods), long retention by default (the storage is cheap, the historical analysis is valuable), and per-cluster topology documented in the infrastructure repo so the configuration is reviewable.

Why this compounds

VictoriaMetrics mastery compounds across services. Each year of retained metrics produces historical investigation capability that no shorter retention can match; the team builds operational muscle for high-cardinality TSDB at lower cost than Prometheus would charge in memory; new services inherit a high-scale metrics surface as a default.

VictoriaMetrics is an infrastructure investment that pays off across years. Nova AI Ops integrates with metrics telemetry, surfaces TSDB patterns, and supports the team’s monitoring discipline.