Let's try to use regtools for this.
-
regtools junctions-extract function: https://regtools.readthedocs.io/en/latest/commands/junctions-extract/
-
parameters to specify:
-m 20 (should match the min intron parameter from hisat)
-s 0 (let's assume unstranded for now)
o [gggsss]_[assemblyname]_downsampled-RNA-Seq-alignments_[date].bed (output file should have the same name prefix as everything else, but with .bed extension
- input file should be the final, merged, indexed bam file
-
the output file needs to be moved over to our servers, and added to trackList.json.
perl flatfile-to-json.pl --bed OUTPUT-BED-FILE --trackLabel '[gggsss]_[assemblyname]_RNA-Seq-alignments_[date]_junctions' --config '{"style":{"showLabels": false}, "metadata": {BED METADATA BELOW}, "category":"RNA-Seq/Intronic splice junctions" }' --className feature3
BED METADATA
"Analysis provider": "i5k Workspace@NAL",
"Analysis method": "https://github.com/NAL-i5K/NAL_RNA_seq_annotation_pipeline/",
"Data source":"[comma-delimited SRA ACCESSIONS from 'Submission' column in .tsv file]",
"Publication status":"Analysis: NA; Source data: see individual SRA accessions",
"Track legend":"Intronic junction reads generated by Hisat2 aligner and regtools"
Let's try to use regtools for this.
regtools junctions-extract function: https://regtools.readthedocs.io/en/latest/commands/junctions-extract/
parameters to specify:
-m 20(should match the min intron parameter from hisat)-s 0(let's assume unstranded for now)o [gggsss]_[assemblyname]_downsampled-RNA-Seq-alignments_[date].bed(output file should have the same name prefix as everything else, but with .bed extensionthe output file needs to be moved over to our servers, and added to trackList.json.
perl flatfile-to-json.pl --bed OUTPUT-BED-FILE --trackLabel '[gggsss]_[assemblyname]_RNA-Seq-alignments_[date]_junctions' --config '{"style":{"showLabels": false}, "metadata": {BED METADATA BELOW}, "category":"RNA-Seq/Intronic splice junctions" }' --className feature3BED METADATA
"Analysis provider": "i5k Workspace@NAL",
"Analysis method": "https://github.com/NAL-i5K/NAL_RNA_seq_annotation_pipeline/",
"Data source":"[comma-delimited SRA ACCESSIONS from 'Submission' column in .tsv file]",
"Publication status":"Analysis: NA; Source data: see individual SRA accessions",
"Track legend":"Intronic junction reads generated by Hisat2 aligner and regtools"