We will be doing maintenance on our virtual server infrastructure on Wednesday August 5th, starting at 8AM. This will require downtime for the virtual servers hosted on this infrastructure.
Pronto (just the login/head node, and not the full cluster) will be down for a couple hours starting at 7AM on July 21st (one week from today). Batch jobs will continue to run uninterrupted, but interactive sessions (srun, tmux, screen) will be terminated.
Services will be down for patching January 9th.
LSS maintenance is complete, and service has been restored. Please email email@example.com with any issues.
We are planning to take down LSS for maintenance on Wed July 17 2019.
During this maintenance window, we will be modifying the LSS networks
to comply with the new Netcom VLAN architecture, and will be applying
Updates are complete & things are back up and running.
Reminder: legion-[1-8] & gpu01 have moved behind pronto, and you'll
need to use slurm on pronto to submit jobs to those servers now.
We will be taking down the ResearchIT servers Wed Mar 20th at 9AM for
some updates & maintenance. We expect to be back up by late afternoon.
The issue with /work has been resolved
We are currently experiencing an issue with one of the lustre servers that composes the /work directories. We are investigating and will post here when we know more.
The issue on condo seems to be resolved, and jobs are running successfully again.
Condo is experiencing intermittent job failures this morning. The HPC team is aware, and looking into the issue.
Condo is operational again
The Condo cluster is currently experiencing an issue, and is being worked on.
They Cyence cluster head-node died last night, and is also in the process of being fixed.
For the past few years, usage of our #bitcom IRC channel has declined and we're now retiring that service. Please contact us through email to firstname.lastname@example.org for all support requests, which will ensure that our full group can respond, and t
# Research IT - Sep '18 Update
## Downtime for patching: Sep 25th
* What: Server downtime
* When: 9/25 9AM-5PM
# Research IT - June '18 Update
## Downtime for Patching June 26th
* What: Server maintenance
* When: 6/26 9AM
* How long: approx 8 hours
The LSS system will experience a brief outage October 17, 2017 at noon for an update in the data center related to connected systems dropping connections. The system will be unavailable during the update, and is expected to be back online in less than an hour.
For labs that have migrated from Isilon to LSS storage:
The LSS system will experience a brief outage tomorrow at noon for a network update in the data center. The system will be unavailable during the update, and is expected to be back online in less than an hour.
If you're a condo user, and haven't already seen the MOTD, please note that the cluster will go down for security patching on 7/5.