Research Computing and Data

Upcoming Maintenance for Summer 2023

The RCD team has planned a scheduled maintenance window for the Palmetto Cluster between Monday, July 10th and Friday, July 14th, 2023. Users should expect the cluster to be unavailable during this time, and any jobs running when maintenance begins will be canceled.

We’ll be working on:

  • Introducing a new mini-cluster as a test for the SLURM scheduler, which will replace PBS on Palmetto during a future maintenance window.
  • Updating configuration on compute nodes to improve I/O performance to the Indigo data lake.
  • Preparing for the end-of-life for /scratch1 and /fastscratch, which is planned for July of 2024.
  • Upgrading infiniband drivers to improve stability and NVIDIA drivers to support newer versions of CUDA.

Scratch Storage Update

An issue with the metadata servers caused an outage for /scratch1 and /fastscratch, starting on March 16th, 2023. Our storage architects have worked diligently this afternoon to re-initialize the /scratch1 and /fastscratch file systems.

While working on this issue, we have also brought our new /scratch storage system online. This is a new all-flash storage system that will be the new primary scratch storage space for Palmetto for the forseeable future.

All scratch file systems are now back online and available for use across the cluster.

However, we encourage users to continue using /scratch for their temporary storage needs. We plan to retire /scratch1 and /fastscratch during a future maintenance period, but no definitive retirement date has been set.