RWTH High Performance Computing (HPC)

You can find more information about the service in our documentation portal.


Recently expired reports

Slurm and HPC System Down

Outage
Wednesday 02/11/2026 09:19 AM - Wednesday 02/11/2026 12:00 PM

Dear Users,
We had to set our Slurm queuing system down due to a potential security vulnerability.
New jobs will not run and the queue will not be terminated, new jobs might still be queued but is not guaranteed..
Currently running Jobs might still be able to finish to run, but we might have to kill some long running jobs.
The login nodes should still be available for normal data work.
All this might change without notice during the day to accommodate for further mitigation and resolution actions as needed.

11.02.2026 09:24
Updates

The vulnerability has been fixed and the cluster is back in operation.

13.02.2026 14:48

Connection to the cluster fails

Partial Outage
Saturday 02/14/2026 04:25 AM - Tuesday 02/17/2026 10:15 AM

Due to the limitted availabiloty of the GPFS, the connection to the cluster could not be established.

The problem has been resolved and the cluster is available again.

17.02.2026 11:03

Maintenance Announcement for Munge Security Update

Maintenance
Tuesday 02/17/2026 12:00 PM - Tuesday 02/17/2026 01:00 PM

We would like to inform you that a security issue (CVE) has been discovered in the Munge software, which allows for the potential exposure of the Munge key. This key is critical for user authentication in Slurm jobs. To address this security vulnerability, we have rolled out a new version of Munge.
Although we assess the likelihood of the key being compromised as very low, we will be conducting maintenance on the cluster to exchange the key.
We kindly ask all users to review their jobs. If you notice any unknown jobs in your list, please delete them and inform us immediately.
For users with extremely sensitive data in their directories, we offer to temporarily disable job submission for your account upon request. Your account will be re-enabled after the maintenance is completed.

11.02.2026 11:53

Notice of Power Maintenance on CLAIX2023

Notice
Wednesday 02/04/2026 04:40 PM - Thursday 02/05/2026 10:00 AM

On February 5, 2026, from 09:00 to 10:00 AM, electrical work will be carried out that affects the air-cooled rack of CLAIX2023 as well as the Infiniband switches of CLAIX2023.We would like to point out that the systems and switches are redundantly connected, so normally no service interruption is expected. However, in rare cases, brief interruptions may occur due to a misconfiguration.

04.02.2026 16:39