[Seek for general guidance] Ingress gateway connection reset under big traffic load

coding_tempura · November 17, 2022, 2:47am

Our team is hosting a big Kubernetes (using AWS/EKS) cluster with Istio to serve traffic, and AWS Load Balancer is used to route our traffic to our cluster. Recently, we found that, seems like we always see a spike of target reset from NLB, along with CPU and IO spike in istiod and ingress gateway, indicating connection reset at Istio level (ingress gateway).
Not so familiar with how everything works internally, so any guidance/ideas would be appreciated:

Usually under what kind of situation we will see connection reset from ingress gateway?
And any suggestion on how to further debug this issue?

Refs:
[1] AWS load balancer: GitHub - kubernetes-sigs/aws-load-balancer-controller: A Kubernetes controller for Elastic Load Balancers
[2] Target reset count metrics reported by NLB: CloudWatch metrics for your Network Load Balancer - Elastic Load Balancing

Topic		Replies	Views
Istio Ingress gateway with Network LoadBalancer Networking security	2	1571	June 16, 2020
Random TCP connection reset Networking	9	8057	January 13, 2023
Istio internal resolution/routing to understand sporadic TCP resets from pod to pod Networking	0	422	February 17, 2022
Istio-ingressgateway pod downsizing causes 502 responses from loadbalancer (ALB) Networking	1	3254	February 10, 2020
Istio Ingress + K8s Ingress Load Balancer Patterns Networking	1	576	July 9, 2019

[Seek for general guidance] Ingress gateway connection reset under big traffic load

Related topics