GPU Pod Rebuild Risks
Check Items
Check whether GPU service pods are rebuilt in a cluster when kubelet is restarted during the upgrade of the cluster.
Solution
Upgrade the cluster when the impact on services is controllable (for example, during off-peak hours) to minimize the impact.
If you need help, submit a service ticket to contact O&M personnel.
Parent topic: Troubleshooting for Pre-upgrade Check Exceptions
- Check Items
- Solution