Retries for mixer not work with UNAVAILABLE status?

vladK · March 26, 2020, 11:34am

Hi,

I am trying to keep cluster high availability. According to the scenario, one of the nodes with the policy pod may fall, but the cluster cannot lose the request.

I testing the scenario, the user sends a request for a service for which JWT authorization is configured using Istio. When a node crashes, I often get error 503. I found on the official site of Istio that something is connected with an error connecting to the mixer:

UNAVAILABLE : Envoy cannot connect to Mixer and the policy is configured to fail close.

I tried to set up a retry policy for this case, but it looks like Istio is not handling this case. I try:

*retries:*

```
 attempts: 30*
```
```
 perTryTimeout: 1s*
```

 retryOn: 500,503,504,retriable-status-codes,unavailable*

timeout: 30s*

But I still get the error. I also tried to modify the FAIL_CLOSE policy by adding additional retries for HTTP and TCP config:

“typed_config”: {
“@type”: “type.googleapis.com/istio.mixer.v1.config.client.HttpClientConfig”,
“transport”: {
“network_fail_policy”: {
“policy”: “FAIL_CLOSE”,
“max_retry”: 10,
“base_retry_wait”: “0.100s”,
“max_retry_wait”: “1.300s”
}

“name”: “mixer”,
“typed_config”: {
“@type”: “type.googleapis.com/istio.mixer.v1.config.client.TcpClientConfig”,
“transport”: {
“network_fail_policy”: {
“policy”: “FAIL_CLOSE”,
“max_retry”: 10,
“base_retry_wait”: “0.100s”,
“max_retry_wait”: “1.300s”
}

I did not find any changes from this configuration at all. I set up outlier detection and it looks like it really works and throws unheatly endpoint out of the connection pool, however, will I still get a 503 error on the first request.

I am new to Istio and would like to ask the community if there are ways to solve my problem and why the retries for this error are ignored, while I set up the rule in retry-on?

Topic		Replies	Views
Retry option in Istio 503: Istio 1.3.4 Config	0	918	November 23, 2020
Pod that return 503 are not called Networking	1	830	September 18, 2019
Help regarding "unresolved" requests	1	724	February 6, 2019
istio-Ingressgateway pod is failing with 503 error Networking	1	1968	June 25, 2019
503 upstream connect error or disconnect/reset before headers. reset reason: connection failure	3	3389	August 17, 2020

Retries for mixer not work with UNAVAILABLE status?

Related topics