Patch nodes, rotate images, or upgrade the CSI plugin without interrupting workloads.
Select the maintenance guide you need from the navigation—node checklist, upgrade plan, and future topics all live underneath this page.
This is the multi-page printable view of this section. Click here to print.
Patch nodes, rotate images, or upgrade the CSI plugin without interrupting workloads.
Select the maintenance guide you need from the navigation—node checklist, upgrade plan, and future topics all live underneath this page.
kubectl cordon <node>
kubectl drain <node> --ignore-daemonsets --delete-emptydir-data
Because the Klustre CSI daemonset is a DaemonSet, it is unaffected by --ignore-daemonsets, but draining ensures your workloads move off the node before reboot.
kubectl get pods -n klustre-system -o wide | grep <node>
Expect the daemonset pod to terminate when the node drains and recreate once the node returns.
mount.lustre --version).kubectl uncordon <node>
If the node lost the lustre.csi.klustrefs.io/lustre-client=true label, reapply it after verifying Lustre connectivity.
kubectl rollout status daemonset/klustre-csi-node -n klustre-system
Use kubectl get pods for namespaces that rely on Lustre PVCs to ensure pods are running and mounts succeeded.
kubectl drain hangs due to pods using Lustre PVCs, identify them with kubectl get pods --all-namespaces -o wide | grep <node> and evict manually.Check the klustre-csi-plugin GitHub releases for breaking changes, minimum Kubernetes versions, and image tags.
image.tag and nodePlugin.registrar.image.tag in your values file, then run helm upgrade.manifests/configmap-klustre-csi-settings.yaml (nodeImage, registrarImage) and reapply the manifests.See Update the node daemonset image for detailed steps.
kubectl rollout restart daemonset/klustre-csi-node -n klustre-system
kubectl rollout status daemonset/klustre-csi-node -n klustre-system
The daemonset restarts one node at a time, keeping existing mounts available.
When upgrading kubelet:
kubectl exec into them and run df -h /mnt/lustre).FailedMount events exist.If the new version misbehaves:
nodeImage and related settings to the previous tag.kubectl rollout restart daemonset/klustre-csi-node -n klustre-system.