Vector Databases

pgvector, Pinecone, Weaviate.

Overview

Vector databases store and query high-dimensional embeddings for RAG, semantic search, and recommendation. The choice is between pgvector (Postgres extension), Pinecone (managed service), Weaviate (open-source self-hosted), and a growing list of others. The decision criteria are operational profile (managed vs self-hosted), scale ceiling (millions vs billions of vectors), and whether the team already operates Postgres. Most teams should start with pgvector and only migrate when they hit a real ceiling.

The approach

The practical approach is pgvector by default for teams already running Postgres (the vector workload reuses existing infrastructure), managed (Pinecone) for very large workloads or small teams without database operational capacity, self-hosted Weaviate when control or feature breadth matters, evaluation with real production-like workloads (synthetic benchmarks lie), and per-workload rationale documented committed to the infrastructure repo.

Why this compounds

Vector database choice compounds across the AI surface. Each correct selection avoids the painful migration when the workload outgrows the wrong choice; each documented rationale survives team turnover; the team builds vocabulary for AI-infrastructure tradeoffs that pays off on every new feature.

Vector database choice is an AI-infrastructure discipline that pays off across years. Nova AI Ops integrates with vector telemetry, surfaces vector patterns, and supports the team’s AI infrastructure discipline.