...
The main takeaway from this is that if your research data is made up of many millions of small files inside of millions of directories, then between the expense and complexity of backing it up to a cloud provider, you may be better off keeping your data on Engram, and letting Research Computing handle the backups.
Some Cloud Storage Provider Options and Pricing as of 11/2020 - Trade-offs will be in terms of storage cost, API cost and restore speed. While "archive" tier storage is listed, we do not suggest using them for backups primarily due to API related costs
Provider | Storage Cost per GB | API Requests Cost | Restore Transfer Cost per GB |
---|---|---|---|
AWS S3 | $0.023 | $0.0004 - $0.005 per 10,000 depending on query type: details | $0.01 |
AWS Glacier | $0.004 | $0.0004 - $0.005 per 10,000 depending on query type: details | $0.01 |
Google Cloud Nearline Storage | $0.01 | $0.01 - $0.10 per 10,000 depending on query type: details | $0.01 |
Google Cloud Coldline Storage | $0.004 | $0.05 - $0.10 per 10,000 depending on query type: details | $0.01 |
Google Cloud Archive Storage | $0.0012 | $0.50 per 10,000 | $0.01 |
Azure Cool | $0.01 | $0.004-$0.10 per 10,000 depending on query type: details | $0.01 |
Azure Archive | $0.00099 | $0.004-$0.10 per 10,000 depending on query type, Read are $5 per 10,000: details | $0.02 |
Backblaze b2 | $0.005 | $0.004 per 1,000 or $0.004 per 10,000 depending on query type: details | $0.01 |
...