wiki:cypress/software

Version 13 (modified by fuji, 44 hours ago) ( diff )

Available Software

'module command shows a list of available software and set-up environment variables.

Below are some of the software programs installed on Cypress.

Compiler and Programing Environment

name description URL license installed version(As of Nov 2024) latest version(As of Nov 2024)
MatlabMATLAB is a proprietary multi-paradigm programming language and numeric computing environment developed by MathWorks.https://www.mathworks.com/products/matlab.htmlcommercial (site license)R2013b, R2015a, R2015b, R2016a, R2017b, R2020a, R2022b, R2023aR2024b
Rsoftware environment for statistical computing and graphicshttps://www.r-project.org/Public3.1.2, 3.2.5, 3.3.1, 3.4.1, 3.5.2, 3.6.1, 4.1.0, 4.1.1, 4.3.24.4.2
Anacondadistribution of the Python and R programming languages for scientific computinghttps://www.anaconda.com/free2.1.0, 2.5.0, 4.0.0, 5.1.0, 2018.12, 2019.03, 2020.07, 2023.072024.2.1
StataStata is a general-purpose statistical software package developed by StataCorp for data manipulation, visualization, statistics, and automated reporting.https://www.stata.com/commercial (site license (sequential version only))14, 1518
SASstatistical software suitehttps://www.sas.com/Commercial (site license)9.49.4
gccGNU Compiler Collection includes front ends for C, C++, Objective-C, Fortran, Ada, Go, D and Modula-2 as well as libraries for these languages.https://gcc.gnu.org/GPL4.7.4. 4.8.2, 4.9.4, 6.3.0, 8.5.0, 9.5.014.2
rubyA dynamic, open-source programming language with a focus on simplicity and productivity.https://www.ruby-lang.org/en/BSD2.2.3, 2.5.13.3.5
JuliaJulia is a high-level, general-purpose dynamic programming language, still designed to be fast and productive, for e.g. data science, artificial intelligence, machine learning, modeling, and simulation, most used for numerical analysis and computational science.https://julialang.org/MIT1.5.21.11.1
Intel-psxeParallel Studio is composed of several component parts, Intel C/C++/Fortran compiler with OpenMP. Math Kernel Library (MKL), Intel MPI Library, etc.https://www.intel.com/commercial2015-update1, 2016, 2019-update1software rebranded to oneAPI toolkits.

Bioinformatics Tools

name description URL license installed version(As of Nov 2024) latest version(As of Nov 2024)
Ancestry_HMMprogram designed to infer adaptive introgression from population genomic datahttps://github.com/russcd/Ancestry_HMMGPL-3.01.0.21.0.2
Angsdanalyzing NGS datahttps://www.popgen.dk/angsd/index.php/ANGSDPublic0.9410.941
bam-readcountcount DNA sequence reads in BAM fileshttps://github.com/genome/bam-readcountMIT1.0.01.0.1
Samtoolsa suite of programs for interacting with high-throughput sequencing datahttps://www.htslib.org/public0.1.17, 0.1.19, 1.5, 1.10, 1.13, 1.16.11.21
Bcftoolsprograms for interacting with high-throughput sequencing datahttps://samtools.github.io/bcftools/public1.161.21
Htsliban implementation of a unified C library for accessing common file formats, such as SAM, CRAM, VCF, and BCF, used for high-throughput sequencing data. It is the core library used by samtools and bcftools.https://www.htslib.org/public1.13, 1.191.21
Bcl2fastq2demultiplex data and convert Illumina BCL files to FASTQ file formatshttps://emea.support.illumina.com/sequencing/sequencing_software/bcl2fastq-conversion-software.htmlMIT2.17.1, 2.202.2
Bedopsfast, highly scalable and easily-parallelizable genome analysis toolkithttps://github.com/bedops/bedopsGPL-22.4.412.4.41
Bedtools2a powerful toolset for genome arithmetic.https://bedtools.readthedocs.io/en/latest/MIT2.27.1, 2.30.02.31.1
Beaglephasing genotypes and imputing ungenotyped markershttps://faculty.washington.edu/browning/beagle/beagle.htmlGPL5.45.4
BgenBGEN is a compressed binary format for typed and imputed genotype datahttps://enkre.net/cgi-bin/code/bgen/dir?ci\=trunkBoost Software License v1.0, BSD1.1.71.1.7
bowtiean ultrafast and memory-efficient tool for aligning sequencing reads to long reference sequenceshttps://bowtie-bio.sourceforge.net/bowtie2/index.shtmlpublic1.1.1, 2.3.3, 2.5.12.5.4
Bwasoftware package for mapping low-divergent sequences against a large reference genome, such as the human genomehttps://bio-bwa.sourceforge.net/GPL-30.7.15, 0.7.170.7.18
CellrangerA set of analysis pipelines that perform sample demultiplexing, barcode processing, single cell 3' and 5' gene counting, V(D)J transcript sequence assembly and annotation, and Feature Barcode analysis from single cell data.https://www.10xgenomics.com/softwarelimited1.1.0, 2.1.0, 3.0.1, 3.1.0, 5.1.0, 6.0.0, 7.1.0
GatkGenome Analysis Toolkithttps://gatk.broadinstitute.org/hc/en-uspublic4.0.4.0, 4.1.8.1, 4.5.0.04.6.0.0
ncbi-blastThe program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance.https://blast.ncbi.nlm.nih.gov/Blast.cgi?CMD\=Web&PAGE_TYPE\=BlastHomepublic2.5.0+, 2.10.0+, 2.12.0+2.16.0+
regeniewhole genome regression modelling of large genome-wide association studies.https://rgcgithub.github.io/regenie/MIT2.2.4, 3.2.73.6
RsemRNA-Seq by Expectation-Maximizationhttps://deweylab.github.io/RSEM/GPL1.2.19, 1.2.22, 1.2.311.3.3
Stackssoftware pipeline for building loci from short-read sequences, such as those generated on the Illumina platform. https://catchenlab.life.illinois.edu/stacks/GPL1.41, 2.22.68
starSpliced Transcripts Alignment to a Referencehttps://github.com/alexdobin/STARMIT2.3.0e, 2.4.0i, 2.4,2a, 2.5.2a, 2.7.19a, 2.7.11b2.7.11b
Trinityassembles transcript sequences from Illumina RNA-Seq datahttps://github.com/trinityrnaseq/trinityrnaseq/releasesBSD-32012-06-08, 2.4.0, 2.9.52.15.2
qiime2QIIME 2™ (pronounced “chime two” 🔔) is a microbiome multi-omics bioinformatics and data science platform that is trusted, free, open source, extensible, and community developed and supported.https://qiime2.org/2018.2, 2018.11, 2023.22024.1

Computational Chemistry and Physics

name description URL license installed version(As of Nov 2024) latest version(As of Nov 2024)
ROOThigh-energy physics toolshttps://root.cernLGPL v2.1+5.34.366.32.06
Berkeleygwcode that calculates the quasiparticle properties and optical responses of materialshttps://berkeleygw.org/BSD1.1-beta24
GaussianStarting from the fundamental laws of quantum mechanics, Gaussian 16 predicts the energies, molecular structures, vibrational frequencies and molecular properties of compounds and reactions in a wide variety of chemical environments.https://gaussian.com/commercial (IT purchased DVD)09, 1616
Namd2a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems.https://www.ks.uiuc.edu/Research/namd/public2.13b23.0.1
Gromacsmolecular dynamics package designed for simulations of proteins, lipids, and nucleic acids.https://www.gromacs.org/public2016.3, 2020.72024.3
LammpsLarge-scale Atomic/Molecular Massively Parallel Simulator is a molecular dynamics program from Sandia National Laboratories.https://www.lammps.org/#gsc.tab\=0GPL17Nov2016, 2Aug202329-Aug-24
Rosettaa software suite of algorithms for computational modeling and analysis of protein structures.https://rosettacommons.org/software/Non-Commercial License3.93.14
ComsolCOMSOL Multiphysics is a finite element analyzer, solver, and simulation software package for various physics and engineering applications, especially coupled phenomena and multiphysics.https://www.comsol.com/commercial (Dr. Escarra lab)5.2, 5.3, 5.3a, 5.4, 5.6, 6.0, 6.1, 6.2 6.2

Developers Tools and Libraries

name description URL license installed version(As of Nov 2024) latest version(As of Nov 2024)
OpenBLASOptimized BLAS (Basic Linear Algebra Subroutine) libraryhttps://www.openblas.net/BSD0.3.180.3.28
ArmadilloC++ library for linear algebra & scientific computinghttps://arma.sourceforge.net/Apache License 2.012.6.614.0.3
AtlasAutomatically Tuned Linear Algebra Softwarehttps://math-atlas.sourceforge.net/BSD3.10.23.10.3
FftwFFTW is a C subroutine library for computing the discrete Fourier transform (DFT).https://www.fftw.org/GPL2.1.5, 3.3.3, 3.3.43.3.10
HDF5Hierarchical Data Format (HDF) is a set of file formats designed to store and organize large amounts of data.https://www.hdfgroup.org/solutions/hdf5/free1.6.10, 1.8.12, 1.8.14, 1.0.51.14.5
Binutilscollection of binary toolshttps://www.gnu.org/software/binutils/GPL2.372.43
blcrBerkeley Lab Checkpoint/Restarthttps://crd.lbl.gov/divisions/amcr/computer-science-amcr/class/research/past-projects/BLCR/public0.8.50.8.5
BoostC++ source librarieshttps://www.boost.org/Boost Software License1.57.0, 1.76.01.86.0
Cmakebuild system generatorhttps://cmake.org/open-source3.0.2, 3.12.1, 3.24.1 3.30.5
OpenmpiMPI libraryhttps://www.open-mpi.org/BSD1.8.4, 4.1.6, 5.0.55.0.5
valgrindValgrind is an instrumentation framework for building dynamic analysis tools. There are Valgrind tools that can automatically detect many memory management and threading bugs, and profile your programs in detail. You can also use Valgrind to build new tools.https://valgrind.org/GPL23.21.03.24.0
eigenEigen is a C++ template library for linear algebra: matrices, vectors, numerical solvers, and related algorithms.https://eigen.tuxfamily.org/3.2.43.4.0
netcdfNetCDF (Network Common Data Form) is a set of software libraries and machine-independent data formats that support the creation, access, and sharing of array-oriented scientific data.https://www.unidata.ucar.edu/software/netcdf/free4.3.24.9.2
GSLThe GNU Scientific Library (GSL) is a numerical library for C and C++ programmers.https://www.gnu.org/software/gsl/GPL2.62.8

Other

name description URL license installed version(As of Nov 2024) latest version(As of Nov 2024)
AwscliAWS Command Line Interfacehttps://aws.amazon.com/cli/Public1.29.11.35.11
BbcpSecurely and quickly copy data from source to targethttps://www.slac.stanford.edu/~abh/bbcp/GPL15.02.03.01.115.02.03.01.1
Cadavercommand-line WebDAV client for Unixhttps://notroj.github.io/cadaver/GPL-20.23.30.24
SingularityApptainer (formerly Singularity) simplifies the creation and execution of containers, ensuring software components are encapsulated for portability and reproducibility.https://apptainer.org/publicsingularity/3.9.0apptainer/1.3.4
SratoolkitThe SRA Toolkit and SDK from NCBI is a collection of tools and libraries for using data in the INSDC Sequence Read Archives.https://hpc.nih.gov/apps/sratoolkit.htmlpublic3.0.03.1.1
RcloneRclone is a command-line program to manage files on cloud storage such as Boxhttps://rclone.org/MIT1.49.31.68.1
Gnuparallelcommand-line utility allows the user to execute shell scripts or commands in parallel.https://www.gnu.org/software/parallel/GPL20180322, 2023012220241022
Globus Connect PersonalCreate a Globus collection Create a Globus collection .https://www.globus.org/globus-connect-personal/public
Note: See TracWiki for help on using the wiki.