Managing load concurrency and queueing across multiple pods with Istio CircuitBreaker

kira · March 8, 2023, 8:53pm

Hi, I have a http2 service which is called from outside the k8 cluster.

How do I ensure each instance of my service serves max ‘x’ requests parallel and queue upto ‘y’ requests. As requests come in, they should get distributed to the instance with lowest load (considering the queue size too). I am looking at using the istio circuit breaker http connection pool settings.

http2MaxRequests=x
http1MaxPendingRequests=y

Is this correct approach to achieve what I want?

Thanks!

Topic		Replies	Views
How should one fine-tune circuit breaker values in the context of dynamic service scalability? Networking	1	849	April 25, 2022
Clarification on Circuit Breaking Networking	0	333	August 27, 2019
Limit Connection on Pod	2	1128	January 2, 2023
Controlling concurrent requests to a pod Networking	0	547	April 29, 2019
CircuitBreaker- max connection in connectionPool limit servicelevel or podlevel Config	1	1175	January 12, 2023