GitHub - gforg34/General_DNA_processing_scripts: This repository has different scripts related to the processing of DNA sequencing data

README file to give information about my scripts: Contact: [email protected] (external email)

picarding.sub --> script that tries to generate a dictionary genome file using the picard.jar software Note: picard path can be found only by using $PICARDPATH

Prefast.py --> python script (non-functional through the cluster/use it locally) to retreive the RNAseq data using prefetch (did it in my WUR thesis)

bam_statistics.sub --> bash script including multiple statistics using samtools: - samtools flagstat - samtools stats - samtools view - samtools checkquality

mummer_pipeline.sh --> runs mummer pipeline for seq alignnment: - samtools faidx - nucmer - mummerplot - show-diff - show-aligns

find_the_chimeric.sub --> bash script: uses the grep and awk functions to retreive the chimeric alignmens within the area of 37Mbp and 47Mbp of chromosome 2B(LSYQ02000004.1)

fill_the_Ns.sub --> bash script - tgsgapcall : to fill the NNNs of the Wild Emmer genome assembly using the long-read sequencing data I applied it using the TRI18485 southern Levant data against the WEW v2.0 genome

conda_script.sh --> download modules through the conda environment (my_env)

remove_dups.sub --> removes PCR duplicates from the sequencing data - picard tools Note: use the flagstats to check the PCR duplicates (generated: bam_statistics.sh

merge_fastqs.sub --> merges two fastq files, in order to be used in the tgsgapcloser script

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
genome_assembly		genome_assembly
pangenome_scripts		pangenome_scripts
transcriptomics		transcriptomics
README.md		README.md
conda_create_env_cp.sh		conda_create_env_cp.sh
conda_cutesv.sub		conda_cutesv.sub
merge_fastqs.sub		merge_fastqs.sub
move_files_with_rsync.sub		move_files_with_rsync.sub
mummer.sub		mummer.sub
picard_CreateSequenceDictionary.sub		picard_CreateSequenceDictionary.sub
remove_duplicates.sub		remove_duplicates.sub
run_Prefast.sub		run_Prefast.sub
run_Prefast_fastq_cp.sub		run_Prefast_fastq_cp.sub
run_python_scripts.sub		run_python_scripts.sub
tview.sh		tview.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

gforg34/General_DNA_processing_scripts

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages