Istio Give 503 error with no healthy upstream when pods get evicted

Chandra · April 9, 2020, 7:26am

Istio Give 503 error with no healthy upstream when pods get evicted. Ideally pods goes down and come up when there are lack of resources on individual nodes to other nodes. During this time when it happens across multiple times and incase if it is taking a while for pod to come up, istio is still giving “no healthy upstream” error on browser when the external url for the pod is hit. Can anyone help me with this. Do I need to add any external flag or do I need to do any application level change. One solution I found is virtualService have retry strategies like attempts and perTryTimeout. But I don’t feel this is the right way to be implemented in production, either it should be something can be implemented in istio-proxy or istio-ingressgateway to get ride of this.

framled · July 24, 2020, 8:16pm

Did you fix this issue?
I have the exactly the same issue.

Chandra · July 28, 2020, 2:11am

Hi Framled,

I didn’t have the fix from istio but I can help you with the work around I did.

When issue occurs, recreate virtualservice for the application. Recreation of virtualservice will help istio to drop the current state (issue state where the calls are not forwarded to application from istioingress gateway) and creates a fresh connection.
Then we have stabilised our application our to not use more resources and the pods are not getting evicted now.
Please feel free to connect me via linked https://www.linkedin.com/in/chandra-sekhar-palakuri-3274a7175/

~
Thanks
Chandra

Hany_Mhajna1 · July 28, 2020, 10:05am

We face the same issue, after new deployment and new pods comes up, they look healthy , but when call them we got error 503
Pilot restart resolved the issue, but still dont know what the main reason for this behavior
we are running with istio 1.4.7v

Chandra · July 28, 2020, 11:59am

Hi, I guess my scenario and your’s is little different. In my case I am facing this when pods get evicted because lack of resources and they come up automatically. At this point thought pods were running the istio gives an 503 error. I have solved this by recreating virtualservice for the specific application.

Note : Please feel free to connect me via linked https://www.linkedin.com/in/chandra-sekhar-palakuri-3274a7175/

Nilamadhaba_Rath · August 3, 2020, 7:57am

I am facing the same issue.
Did you find a fix?

Topic		Replies	Views
503 errors even when pod is healthy	0	1082	May 5, 2020
Isto 503 no healthy upstream after pod killed Networking	1	1203	May 11, 2023
Intermittently "no healthy upstream" being returned when service is available - istiod restarts fixing the issue Networking	1	1773	April 12, 2023
Pod that return 503 are not called Networking	1	831	September 18, 2019
503s to one of two subsets despite all pods being healthy Networking	2	647	January 21, 2020

Istio Give 503 error with no healthy upstream when pods get evicted

Related topics