AImageLab SRV

Short electric failure - resume of operations


Article

Dear users,

This is to inform you that we had a very short electric failure in the entire campus.

UPS has worked as expected so vital data has been preserved, but the failure has affected the SLURM controller, all web and virtualization services and some production nodes.

At present all non-computing services (web, virtualization) are back to normal, all_serial nodes are available and we expect to bring the production partitions back to normality in a few minutes.

Some jobs might have exited unexpectedly because of the electric failure, please make sure to check your jobs and re-start them accordingly.

Published: October 8, 2024