Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handling ‘NA’ Gene IDs and Annotation of GTF Files in ESPRESSO #42

Open
yycc9897 opened this issue Nov 21, 2023 · 1 comment
Open

Comments

@yycc9897
Copy link

Hello,
Thank you for developing ESPRESSO! As a newcomer to this field, I am seeking some guidance. I am working with nanopore sequencing data from a local pig breed. The reference genome annotation for this breed is not complete. I used the Espresso software and discovered a total of 29,494 transcripts, of which 4,924 have gene IDs labeled as ‘NA’.I am unsure how to proceed with these ‘NA’ gene ID transcripts. Could you provide some advice on this?
Additionally, I am considering using software like StringTie2 or FLAIR to annotate the GTF file prior to running Espresso. Would this be a beneficial step, or is it unnecessary?
I greatly appreciate any advice or suggestions you can provide.
Best wishes.

@EricKutschera
Copy link
Contributor

For transcripts that are not in the GTF, ESPRESSO will try to find a gene ID by looking for any splice junctions in that transcript which are also in some transcript from the GTF. If ESPRESSO doesn't find a shared splice junction then it will use NA as the gene ID. For those transcripts without a gene ID you can check to see if the coordinates are nearby anything in the GTF

Generating a GTF with another tool and then giving that GTF to ESPRESSO might help. ESPRESSO doesn't require a GTF file, but if one is provided it will treat the transcripts and splice junctions in that file as high confidence

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants