Overview
In response to ever increasing requirements for lower-latency and higher throughput file services to support modern AI/ML workloads, we have added an all-NVMe based WekaFS tier to our storage profile. This highest performance parallel filesystem is directly attached to our new HPC cluster serving I/O over HDR200/100 InfiniBand. The existing Isilon NAS has been moved exclusively to Campus-wide SMB service for the labs and desktops.
All storage systems, lab instruments, and collaborating institutions are connected via GLOBUS, a de-facto standard for data sharing platform among all premier Universities, research Labs and National data repositories. GLOBUS provides very secure & efficient data movements between any GLOBUS-connected end-point, within a room to across the globe, from the convenience of a web browser. This allows users to plan & move their data between Weka and Isilon systems, or anywhere else, in preparation for running their HPC jobs.
What’s NEW
...
with New C2B2 storage setup?
Storage Tiers | C2B2 Cluster | New Cluster |
---|---|---|
home | home/PI_gp/userUNI | users/userUNI |
data | data/PI_gp/userUNI | groups/PI_gp/userUNI |
archive | archive/PI_gp/userUNI | archive/PI_gp/userUNI |
scratch | scratch/PI_gp/userUNI | localscratch/userUNI |
...
No matter how fast filesystem you have, there is VERY HIGH overhead in opening a file for read or write ops. This will certainly add up if you have large number of small files, and you will be surprised how bad it could get. We cannot emphasize enough that one should avoid creating large number of small files at all cost!!! In addition, use only plain-English characters, without spaces, without special characters, to name the files & folders.
Groups and Archive tier storage are available in capacity increments of 1TB.
...