Chaos Engineering with LitmusChaos: 30 Minutes
Chaos engineering in 30 minutes: enough to see your first experiment, not yet enough to design a program.
Step 1: Install Litmus
helm install litmus litmuschaos/litmus -n litmus --create-namespace
Wait for pods; access UI via port-forward.
Step 2: Pick experiment
- ChaosHub: pod-delete, network-loss, cpu-hog, etc.
- Pick pod-delete on a stateless deployment first.
Step 3: Run experiment
Configure: target deployment; scale; chaos duration.
Run; Litmus deletes pods on schedule.
Step 4: Read the result
Probes verify recovery: was the service still serving traffic?
Pass: system handles the chaos.
Fail: investigate; fix; re-run.
Antipatterns
- Production chaos on day one. Start in staging.
- No probes. Cannot verify outcome.
- One-off chaos. Pattern is repeated experiments.
What to do this week
Three moves. (1) Run the tutorial end-to-end on your own laptop / sandbox. (2) Apply the pattern to one production workload. (3) Document the variations you needed; share with the team.