Repo for bioinformatics bootcamp fall 2020
Here's the protocol for setting up the lab machines:
- IMS sets up an Azure lab template
- Change the password to something easier to remember...
- Start the template and connect via ssh
- Install conda (saying 'yes' when prompted):
wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
bash Miniconda3-latest-Linux-x86_64.sh
- Use conda to install
salmon
,refgenie
, andsra-tools
:
conda install -c bioconda -c conda-forge salmon sra-tools refgenie
- Pull the hg38 Salmon index
export REFGENIE='genome_config.yaml'
refgenie init -c $REFGENIE
refgenie pull hg38/salmon_partial_sa_index
- Download SRA file and convert to fastq
prefetch -O SRR1039521/ SRR1039521
fasterq-dump -e 14 -p -O SRR1039521/ SRR1039521/SRR1039521/SRR1039521.sra
- Align to transcriptome using
salmon
salmon quant -l A -1 SRR1039521/SRR1039521.sra_1.fastq -2 SRR1039521/SRR1039521.sra_2.fastq -i hg38/salmon_partial_sa_index/default/ -p 14 -o Salmon.out/SRR1039521
- To turn on all the machines before lecture (in powershell):
az account set --subscription research
az vm start --ids $(az vm list --resource-group m2600-workshop-basic-rg --query "[].id" -o tsv)
- To export the mahine IP addresses to a csv:
$results = (az network public-ip list --resource-group m2600-workshop-basic-rg --query "[].{name: name, address: ipAddress}" -o json) |ConvertFrom-Json
$results|Export-Csv -Path vmlist.csv -NoTypeInformation
- Sign in:
Username: labadmin
Password: 1234Temp1234
- To power off the machines
az vm deallocate --ids $(az vm list --resource-group m2600-workshop-basic-rg --query "[].id" -o tsv)