Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Uninitialized value errors when running JUM_C.sh #30

Open
lindayqlin opened this issue May 5, 2020 · 8 comments
Open

Uninitialized value errors when running JUM_C.sh #30

lindayqlin opened this issue May 5, 2020 · 8 comments

Comments

@lindayqlin
Copy link

lindayqlin commented May 5, 2020

Hello,

I am encountering a lot of uninitialized value errors when running JUM_C.sh and am not sure what is wrong. These include:

  • in print at /DATA5/JUM/JUM_2.0.2/identify_gene_name_for_JUM_output_3.pl line 45, line 1278 on
  • Lots of things in /media/nkaplin1/DATA5/JUM/JUM_2.0.2/final_process_MXE_output.pl lines 191-262, line 1
  • $junctionID[10] in pattern match (m//) at /DATA5/JUM/JUM_2.0.2/identify_gene_name_for_JUM_output_1.pl line 50, line 428/435
  • in string eq at /DATA5/JUM/JUM_2.0.2/identify_gene_name_for_JUM_output_1.pl line 53, line 428/435
  • in print at /DATA5/JUM/JUM_2.0.2/identify_gene_name_for_JUM_output_2.pl line 60
  • in print at /DATA5/JUM/JUM_2.0.2/identify_gene_name_for_JUM_output_3.pl line 45, line 2 on

I am trying to use JUM to identify IR events. More than half of the identified IR events have gene="NONE" or even nothing (empty string?) in the final IR output. This behavior is not exactly expected because IR events should occur within genes, and some of these locations do map onto genes when inspected. I am guessing that these issues might be related to each other?

I tried to follow all the directions in the manual for v2.0.2 and ran the scripts as in the examples provided. I did not encounter any errors until this step, and I believe my refFlat file matches the format specifications. Would you be able to help me troubleshoot this please?

Thank you very much in advance!

@qqwang-berkeley
Copy link
Owner

qqwang-berkeley commented May 11, 2020 via email

@lindayqlin
Copy link
Author

Hi Qingqing,

Thank you for the reply! The first 50 lines are attached:

refFlat.txt

@qqwang-berkeley
Copy link
Owner

qqwang-berkeley commented May 14, 2020 via email

@lindayqlin
Copy link
Author

Hi Qingqing,

I see. Sure -- are these the input files that you need?
input_files.zip

I ran JUM_C.sh using the command
nohup bash /DATA5/JUM/JUM_2.0.2/JUM_C.sh --Folder /DATA5/JUM/JUM_2.0.2 --Test pvalue --Cutoff 0.05 --TotalCondition1FileNum 3 --TotalCondition2FileNum 3 --REF /DATA5/linda_JUM/Araport11_genePred_refFlat_formatted.txt > JUM_C.out &
from inside JUM_diff/FINAL_JUM_OUTPUT_pvalue_0.05/

Thank you for your help!

@qqwang-berkeley
Copy link
Owner

qqwang-berkeley commented May 25, 2020 via email

@lindayqlin
Copy link
Author

No worries. Thank you for all your help!

That worked well for me. I didn't realize that there was this discrepancy -- thanks for pointing it out.

I still get some (although way fewer) "NONE" results in the IR output. I inspected these visually and it appears that approximately half are annotated as exons/UTRs rather than introns, while a few do correspond to introns but didn't get identified as such. Do you know why this might be? (The rest are not annotated so "NONE" seems fitting for those.)

One other question for you: what is the criteria for calling "INF" for delta PSI?

Thank you in advance!

@qqwang-berkeley
Copy link
Owner

qqwang-berkeley commented Jun 18, 2020 via email

@lindayqlin
Copy link
Author

Apologies for the delayed response. Here are some IGV screenshots. They are named as comment_chr_pos. (I checked the expanded gene model track and there are no alternative isoforms with introns at these sites.)
None.zip

I'm still a bit unclear on the "INF" results. You write in the manual that IR "deltaPSI = intron_inclusion_isoform / (intron_inclusion_isoform+intron_exclusion_isoform) (under control) - intron_inclusion_isoform / (intron_inclusion_isoform+intron_exclusion_isoform) (under treatment)." Are you saying that (intron_inclusion_isoform+intron_exclusion_isoform) would be zero under a condition? (No gene expression whatsoever? If so, would it be possible to just make the fraction zero?) I'm not sure if I'm understanding this correctly.

Thank you in advance!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants