Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: for Rstudio, specifically referenced srun to a GPU as an option.

Table of Contents

Ginsburg is currently running Red Hat Enterprise Linux release 8.2.

The table below shows software already installed on the cluster system-wide.

...

Name

Version

Location / Module

Category

AFNI23.1.05module load AFNI/23.1.05Analysis of Functional Neuro Images

Anaconda Python 3.11.5 2023.09

Python 3.11.5

module load anaconda/3-2023.09

Python for Scientific Computing

Anaconda Python  2.7.16 2019.10

Python 2.7.16

module load anaconda/2-2019.10

Python for Scientific Computing

ancestry_hmmancestry_hmmmodule load ancestry_hmm/0.94Program designed to infer adaptive introgression from population genomic data.
ANTs2.4.4module load ANTs/2.4.4ANTs computes high-dimensional mappings to capture the statistics of brain structure and function

BCFtools

1.18

module load bcftools/1.18

Reading/writing BCF2/VCF/gVCF files and calling/filtering/summarising SNP and short indel sequence variants

cactus

2.6.7

module load cactus/2.6.7

Comparative Genomics Toolkit

candi

9.4.2-r3

module load candi/9.4.2-r3

The candi.sh shell script downloads, configures, builds, and installs deal.II with common dependencies on linux-based systems

cuda

12.0

module load cuda12.0/toolkit

GPU Computing

cudnn8.0module load cudnn8.0-cuda11.1CUDA Deep Neural Network library
freesurfer7.4module load freesurfer/7.4Brain imaging software
FSL6.0.5.2module load FSL/6.0.5.2Analysis tools for FMRI, MRI and DTI brain imaging data
gcc13.0.1module load gcc/13.0.1Compiler - C / C++
glpk5.0module load glpk/5.0C library for solving large-scale linear programming (LP), mixed integer programming (MIP)
gurobi10.0.3module load gurobi/10/0/3Prescriptive analytics platform and a decision-making technology 
hdf51.10.1module load hdf5/1.10.1High performance data software library & file format
htslib

1.19

module load htslib/1.19

A C library for reading/writing high-throughput sequencing data
Intel oneAPI toolkit

various

module load intel-oneAPI-toolkit <library>

Core set of tools and libraries for developing high-performance, data-centric applications across diverse architectures. 
intel-parallel-studio

2020 Update 4

module load intel-parallel-studio/2020Intel Compiler 
julia1.5.3module load julia/1.5.3Programming Language
knitro13.2.0module load knitro/13.2.0Software package for solving large scale nonlinear mathematical optimization problems; short for "Nonlinear Interior point Trust Region Optimization"
lastz1.04.15module load lastz/1.04.15A program for aligning DNA sequences, a pairwise aligner.
LBPM22.08module load LBPM/22.08Model flow processes based on digital rock physics
leptonica

1.83.0

module load leptonica/1.83.0A ipedagogically-oriented open source library containing software that is broadly useful for image processing and image analysis 
libRadtran2.0.5modulel load libRadtran/2.0.5A library for radiative transfer
Mathematica13.2module load Mathematica/13.2Numerical Computing
Matlab2023amodule load matlab/2023aNumerical Computing
metis

5.1.0

module load metis/5.1.0

A set of serial programs for partitioning graphs, partitioning finite element meshes.
MUMPS

5.6.2

module load mumps/5.6.2

MUltifrontal Massively Parallel Sparse direct Solver
netcdf-fortran-intel4.5.3

module load netcdf-fortran-intel/4.5.3

Array Interface Library
netcdf/gcc4.7.4module load netcdf/gcc/64/gcc/64/4.7.4Array Interface Library
Nextflow

23.10.0

module load nextflow/23.10.0Enables scalable and reproducible scientific workflows using software containers.
occswV2022.3module load ocsswOcean Color Science Software, CLI version
octave5.2.0module load octaveInstalled on all compute nodes, start with 'octave'. Mathematics-oriented syntax with built-in 2D/3D plotting and visualization 
OpenFOAMv2206module load OpenFOAM/v2206Computational fluid dynamics.
openmpi4.1.5a1openmpi/gcc/64/4.1.5a1OpenMPI Compiler
Quantum Espresso7.2module load QE/7.2Quantum Espresso
R3.6.3module load R/3.6.3Programming Language
R4.3.1module load R/4.3.1Programming Language
samtools1.19module load samtools/1.19Suite of programs for interacting with high-throughput sequencing data

ScaLAPACK 

2.2.0module load scalapack/2.2.0Scalable Linear Algebra PACKage
Schrodinger2024-1module load schrodinger/2024-1Modeling, analysis and computational tasks

SeaDAS

8.4.1module load seadas/8.4.1

A comprehensive software package for the processing, display, analysis, and quality control of ocean color data. Requires XQuartz on a Mac or Mobaxterm on Windows.

Singularity3.7.1module load singularity/3.7.1

Run Docker-like containers

Stata18

module load stata/18

General-purpose statistical software
stopos0.93module load stoposCreate and manage computing tasks
tesseract5.3.1module load tesseractOCR
VCFTools0.1.17module load vcftools/0.1.17A set of tools written in Perl and C++ for working with VCF files
vim9.1module load vim/9.1vi improved test editor

Visual Studio Code Server


Not a moduleA server side Integrated Development Environment hosted on Ginsburg compute nodes
WIEN2kWIEN2k_21.1module load WIEN2k_21.1Perform electronic structure calculations of solids using density functional theory.
workbench1.5.0module load workbench/1.5.0Visualization and Discovery tool used to map neuro-imaging data 

...

On Ginsburg, rstudio can be loaded by leveraging an interactive session, i.e., srun, and using a Singularity container via the Rocker Project which provides containers from where this tutorial comes . First you will need to download the Singularity image. This will require two terminal sessions. One session will be used to connect to a compute host and run rstudio. The other terminal session will be used to initiate SSH Port Forwarding/Tunneling to access the resource. The first step is to request an interactive session so you can run rstudio server from a compute node as follows. Please be sure to change your group as noted by "-A".

Code Block
srun -X -A <GROUP> --pty -t 0-01:00 -X /bin/bash

OR, if you want a GPU node

srun -X -A <GROUP> --pty -t 0-01:00 -X --gres=gpu:1 /bin/bash

Load the Singularity module:

...

Code Block
singularity pull --name rstudio.simg docker://rocker/rstudio:4.3.1

In order for RStudio to start in a browser via an interactive session you will need the IP address of the compute node. Note that the IP below will likely be different for you:

Code Block
IP_ADDR=`hostname -i`
echo $IP_ADDR
10.197.16.39

From RStudio 4.2 and later, some added security features require binding of a locally created database file to the database in the container. Don't forget to change the password.

Code Block
mkdir -p run var-lib-rstudio-server

printf 'provider=sqlite\ndirectory=/var/lib/rstudio-server\n' > database.conf

PASSWORD='CHANGEME' singularity exec \
   --bind run:/run,var-lib-rstudio-server:/var/lib/rstudio-server,database.conf:/etc/rstudio/database.conf \
   rstudio.simg \
   /usr/lib/rstudio-server/bin/rserver --auth-none=0 --auth-pam-helper-path=pam-helper --server-user=$USER

This will run the Rstudio server in a Singularity container.


PART 2:  Port forwarding and access from your browser

...