Running into issues getting the Kiali operator v1.16.0 reinstalled. We recently upgraded our EKS cluster to 1.15.x, so not sure if that is the issue. Our script calls the deploy-kiali-operator.sh script with the following params:
deploy-kiali-operator.sh -kcr kiali-cr.yaml -oiv v1.16.0
If I check in the kiali-operator namespace, there is a kiali-operator pod running, but there is no kiali pod in the istio-system namespace. Any ideas what could be causing this? Are there compatibility issues with Kiali v1.16.0 and EKS 1.15.x?
ERROR: The Kiali operator is not running yet. Please make sure it was deployed successfully.
Something is wrong with the operator pod starting up. You’ll have to find out why it didn’t start (look at the k8s event logs as a start). The script won’t install the Kiali CR until it sees the operator come up. See:
Kiali Operator pod appears to start just fine. If I check my deployment logs hangs a long time deploying the operator, but the kiali operator seems to have started right away.
NAME READY STATUS RESTARTS AGE
kiali-operator-749bd695c8-fzk8s 2/2 Running 0 8m1s
Checked the events of the operator pod and don’t see anything out of the ordinary:
Events:
Type Reason Age From Message
Normal Scheduled 9m50s default-scheduler Successfully assigned kiali-operator/kiali-operator-749bd695c8-fzk8s to ip-xx-xx-93-xxx.ec2.internal
Normal Pulling 9m48s kubelet, ip-xx-xx-93-xxx.ec2.internal Pulling image “Quay”
Normal Pulled 9m34s kubelet, ip-xx-xx-93-xxx.ec2.internal Successfully pulled image “Quay”
Normal Created 9m30s kubelet, ip-xx-xx-93-xxx.ec2.internal Created container ansible
Normal Started 9m30s kubelet, ip-xx-xx-93-xxx.ec2.internal Started container ansible
Normal Pulled 9m30s kubelet, ip-xx-xx-93-xxx.ec2.internal Container image “Quay” already present on machine
Normal Created 9m30s kubelet, ip-xx-xx-93-xxx.ec2.internal Created container operator
Normal Started 9m30s kubelet, ip-xx-xx-93-xxx.ec2.internal Started container operator
Still not seeing the kaili pod created in istio-system namespace:
We will also take a look at changing the deployment script in the near future, but for now would like to get this working again. Thanks so much for your help.