wiki:cypress/WGSA

Version 2 (modified by fuji, 4 years ago) ( diff )

Installing and Setup WGSA in a local directory on Cypress

Decide a folder dedicated for the pipeline, for example '/lustre/project/group/WGSA'.

Setup an environment variable and create workspaces as

export $WGSA_DIR=/lustre/project/group/WGSA
mkdir $WGSA_DIR
cd $WGSA_DIR
mkdir work
mkdir tmp
chmod 777 work
chmod 777 tmp

Create a space for ANNOVAR,

mkdir $WGSA_DIR/annovar

Download the ANNOVAR main package from here. The package comes as annovar.latest.tar.gz, save it to $WGSA_DIR/annovar. Unzip it.

cd $WGSA_DIR/annovar
tar -zxvf annovar.latest.tar.gz

Download RefSeq and !Ensembl gene models for ANNOVAR:

cd $WGSA_DIR/annovar/annovar
perl annotate_variation.pl -buildver hg19 -downdb -webfrom annovar refGene humandb/
perl annotate_variation.pl -buildver hg19 -downdb -webfrom annovar ensGene humandb/
perl annotate_variation.pl -buildver hg19 -downdb -webfrom annovar knownGene humandb/
perl annotate_variation.pl -buildver hg38 -downdb -webfrom annovar refGene humandb/
perl annotate_variation.pl -buildver hg38 -downdb -webfrom annovar ensGene humandb/     
perl annotate_variation.pl -buildver hg38 -downdb -webfrom annovar knownGene humandb/    

Install SnpEff (required for annotating indels with SnpEff or annotating SNVs with SnpEff on-the-fly) Download SnpEff v4.3t main package and save the zip file to $WGSA_DIR/snpeff:

mkdir $WGSA_DIR/snpeff
cd $WGSA_DIR/snpeff
wget http://sourceforge.net/projects/snpeff/files/snpEff_v4_3t_core.zip
unzip snpEff_v4_3t_core.zip

To use a newer version of JavaSDK, you have to login to a computing node.

Start a interactive session:

idev

See here for more about 'idev'.

Once you get to a computing node, make sure your corrent directory is $WGSA_DIR/snpeff

Download RefSeq and Ensembl gene models for SnpEff:

module load java-openjdk/1.8.0
cd snpEff
java -jar snpEff.jar download -v hg19
java -jar snpEff.jar download -v GRCh37.75
java -jar snpEff.jar download -v hg38
java -jar snpEff.jar download -v GRCh38.86

Exit from the computing node:

exit

Install htslib, which is required for VEP API.

mkdir $WGSA_DIR/htslib
cd $WGSA_DIR/htslib
wget https://github.com/samtools/htslib/releases/download/1.9/htslib-1.9.tar.bz2
tar -vxjf htslib-1.9.tar.bz2
cd htslib-1.9
make prefix=$WGSA_DIR/htslib install

Setup the environmental variables

export PATH=$WGSA_DIR/htslib/bin:$PATH
export CPATH=$WGSA_DIR/htslib/include:$CPATH
export LD_LIBRARY_PATH=$WGSA_DIR/htslib/lib:$LD_LIBRARY_PATH

Install VEP (required for annotating indels with VEP or annotating SNVs with VEP on-the-fly)

Download VEP 94 main package and save it to $WGSA_DIR/vep:

mkdir $WGSA_DIR/vep
cd $WGSA_DIR/vep
wget https://github.com/Ensembl/ensembl-vep/archive/release/94.zip
unzip 94.zip

Install VEP API to /WGSA/vep and download RefSeq and Ensembl gene models to $WGSA_DIR/.vep

cd $WGSA_DIR/vep/ensembl-vep-release-94/
mkdir $WGSA_DIR/.vep
perl INSTALL.pl -c $WGSA_DIR/.vep --ASSEMBLY GRCh37

Go through the steps of the installing process and following the guidance at http://useast.ensembl.org/info/docs/tools/vep/script/vep_tutorial.html. When being asked for the cache files, choose “242 : homo_sapiens_merged_vep_94_GRCh37.tar.gz”. When being asked for fasta files, choose “27 : homo_sapiens”. When being asked for the plugins, choose "7:LOF". The fasta file downloading is required for the current version of WGSA.

perl INSTALL.pl -c $WGSA_DIR/.vep --ASSEMBLY GRCh38

When being asked for the cache files, choose "243 : homo_sapiens_merged_vep_94_GRCh38.tar.gz". When being asked for fasta files, choose “54: homo_sapiens”. When being asked for the plugins, choose "n" as LOF has already been installed.

chmod 777 $WGSA_DIR/.vep/Plugins
chmod 777 $WGSA_DIR/.vep/homo_sapiens/94_GRCh37
chmod 777 $WGSA_DIR/.vep/homo_sapiens/94_GRCh38

Install LOFTEE LOF plugin for VEP API

cd $WGSA_DIR/.vep/Plugins
wget https://github.com/konradjk/loftee/archive/v0.1.1-beta.zip
unzip -j v0.1.1-beta.zip
rm v0.1.1-beta.zip
Note: See TracWiki for help on using the wiki.