Health Management
Real-Life Health Management Strategies
Advanced Monitoring Setup
apiVersion: monitoring.coreos.com/v1 kind: PodMonitor metadata: name: app-metrics namespace: monitoring spec: selector: matchLabels: app.kubernetes.io/component: backend podMetricsEndpoints: - port: metrics interval: 15s scrapeTimeout: 10s namespaceSelector: matchNames: - production - staging# Monitor etcd health kubectl -n kube-system exec etcd-master -- etcdctl --endpoints=https://127.0.0.1:2379 \ --cacert=/etc/kubernetes/pki/etcd/ca.crt \ --cert=/etc/kubernetes/pki/etcd/server.crt \ --key=/etc/kubernetes/pki/etcd/server.key \ endpoint health # Check API server health kubectl get --raw='/healthz' # Check all component statuses kubectl get componentstatusesapiVersion: apps/v1 kind: DaemonSet metadata: name: node-problem-detector namespace: kube-system spec: selector: matchLabels: app: node-problem-detector template: metadata: labels: app: node-problem-detector spec: containers: - name: node-problem-detector image: k8s.gcr.io/node-problem-detector:v0.8.7 securityContext: privileged: true volumeMounts: - name: log mountPath: /var/log readOnly: true volumes: - name: log hostPath: path: /var/log
Proactive Health Maintenance
Cluster Recovery Procedures
Advanced Autoscaling
Best Practices
Cross-Cloud Health Management
References
Last updated