taki Cluster Environment Unplanned Downtime
To all taki users:
Starting at around 1:30pm EDT, portions of the taki cluster environment became unavailable.
Staff were able to determine that a portion of our network infrastructure failed. The issue has since been addressed and system administrators are currently in the process of rebooting all machines to ensure a uniform state of all hardware.
At this time, the login nodes are available and portions of the cluster hardware are still powering back on.
We apologize for any loss of running jobs or workflow.
As always, please let us know if you have any questions.
If you notice an issue, please submit a descriptive RT Help Ticket via the following link:
Thank you for your understanding,
Roy Prouty
System Administrator, UMBC HPCF
Posted: March 17, 2021, 6:19 PM