Feedback Requested: Production Monitoring with Prometheus

jotak · March 30, 2020, 1:25pm

Just to follow up, as I’ve tested the suggested settings, and quite clearly illustrated the “rate then sum” issue:

This is the volumetry observed on one of my service, from the istio-prometheus instance (the one that keeps temporary metrics):

At about 13:04, I restarted the pods, which shows up on that graph. There’s a constant ~28 rps that drops to 0 at that point.

And now, the same observed from my master prometheus:

The “drop-to-zero” shows up there as a spike to 1390 rps, which isn’t connected to anything real, it’s just a fake spike due to summing before rating.

Topic		Replies	Views
Feedback Requested: Prometheus Operator support in Istio Policies and Telemetry	0	734	May 3, 2019
BYO Prometheus with mTLS Policies and Telemetry	18	5840	December 21, 2020
Issue with federated scraping Policies and Telemetry	2	537	May 20, 2020
Issue with Prometheus and Kiali Policies and Telemetry	0	198	December 1, 2023
Prometheus alerting on Istio Components	30	9143	October 8, 2021

Feedback Requested: Production Monitoring with Prometheus

Related Topics