Zurück | Archiv

Rechner-Cluster - Full global maintenance of the HPC CLAIX Systems

Dienstag 30.09.2025 07:00 - Dienstag 30.09.2025 16:00

Our GPFS global filesystem needs to be updated and will cause the entire CLAIX HPC System to be unavailable. Please note the following: - User access to the HPC system through login nodes, HPC JupyterHub or any other connections will not be possible during the maintenance. - No Slurm jobs, filesystems dependent tasks will be able to run during the maintenance. - Before the maintenance, Slurm will only start jobs that guarantee to be finished before the start of maintenance; any running jobs must finish by then or might be terminated. - Nodes might therefore remain empty leading to the maintenance, as Slurm tries to clear the nodes from user jobs. - Waiting times before and after the maintenance might be higher than usual, as nodes are emptied before or the queue of waiting jobs increases in size afterwards. - Files on your personal or project directories will not be available during the maintenance.

Di 16.09.2025 15:31

Rechner-Cluster - NFS Störung der GPFS Server

Mittwoch 17.09.2025 21:35 - Donnerstag 18.09.2025 10:06

Aktuell drainen alle Knoten aufgrund einer Störung. Wir arbeiten mit dem Hersteller daran.

Do 18.09.2025 09:16

Updates

Das Problem konnte gelöst werden, der Cluster ist wieder in Operation.

Do 18.09.2025 10:06

Rechner-Cluster - $HOME and $WORK filesystems are again unavailable

Freitag 29.08.2025 09:45 - Freitag 29.08.2025 11:00

Due to issues with the underlying filesystem servers for $HOME and $WORK, the batch nodes are currently unavailable, and access to $HOME and $WORK on the login nodes is not possible.

Fr 29.08.2025 09:55

Updates

Access to the filesystems has been restored. We apologize for the inconvenience.

Fr 29.08.2025 11:38

Rechner-Cluster - $HOME and $WORK unavailable on login23-g-1

Dienstag 26.08.2025 15:00 - Mittwoch 27.08.2025 11:30

Due to issues with the GPFS filesystem $HOME and $WORK are not available on login23-g-1. The issue has been resolved.

Mi 27.08.2025 11:40