Resilience Controller
请在任意节点执行以下步骤验证Resilience Controller的安装状态。
- 通过如下命令查看K8s集群中Resilience Controller的Pod,需要满足Pod的STATUS为Running,READY为1/1。
kubectl get pods -n mindx-dl -o wide | grep resilience-controller
回显示例:
1
resilience-controller-76f4476bb5-fs986 1/1 Running 0 6m52s 192.168.102.67 ubuntu <none> <none>
- 通过如下命令查看K8s集群中Resilience Controller的日志。
kubectl logs -n mindx-dl {Resilience组件Pod名称}
回显示例如下,表示组件正常运行。
1 2 3 4 5 6 7 8 9
root@ubuntu:~# kubectl logs -n mindx-dl resilience-controller-76f4476bb5-fs986 [INFO] 2022/11/17 17:18:46.697010 1 hwlog@v0.0.0/api.go:96 run.log's logger init success [INFO] 2022/11/17 17:18:46.697139 1 cmd/main.go:57 resilience-controller starting and the version is xxx_linux-x86_64 [INFO] 2022/11/17 17:18:47.227913 1 K8stool@v0.0.0/self_K8s_client.go:116 start to decrypt cfg [INFO] 2022/11/17 17:18:47.297559 1 K8stool@v0.0.0/self_K8s_client.go:125 Config loaded from file: ****tc/mindx-dl/resilience-controller/.config/config6 [INFO] 2022/11/17 17:18:47.300066 1 elastic/controller.go:45 Setting up elastic event handlers [INFO] 2022/11/17 17:18:47.300179 1 elastic/controller.go:63 Starting elastic controller, waiting for informer caches to sync [INFO] 2022/11/17 17:18:47.401246 1 cmd/main.go:80 elastic controller started ...
父主题: 组件状态确认