Monitor các metric chính bằng Prometheus + Grafana:
- Queue depth — tăng liên tục báo hiệu consumer lag;
- Consumer count — bằng 0 là vấn đề;
- Unacked messages — message bị stuck;
- Publish/consume rate — throughput;
- Memory usage — backpressure ở 40% mặc định;
- Connection/channel count — phát hiện leak;
- Node health — disk space, GC pauses
Alert: queue depth tăng > 1000/phút, zero consumer > 5 phút, memory > 70%, connection churn > 100/giây.
Dùng plugin rabbitmq_prometheus tích hợp sẵn.
Monitor with Prometheus + Grafana:
- Queue depth (growing = consumer lag);
- Consumer count (zero = problem);
- Unacked messages (stuck messages);
- Publish/consume rates (throughput);
- Memory usage (backpressure at 40% default);
- Connection/channel counts (leaks);
- Node health (disk, GC)
Alert thresholds: queue depth growing >1k/min, zero consumers >5 min, memory >70%.
Use the built-in rabbitmq_prometheus plugin.