Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Annotate papers with source organism #10

Open
blakesweeney opened this issue Jun 13, 2022 · 1 comment
Open

Annotate papers with source organism #10

blakesweeney opened this issue Jun 13, 2022 · 1 comment

Comments

@blakesweeney
Copy link
Member

We can mark all papers by what organism the paper is about. This will be key to integrating this into miRBase as they use this in their current approach (https://academic.oup.com/nar/article/47/D1/D155/5179337?login=false). This is needed because some miRNA names are not specific to a species. This means simple text mining may well match something from a different species than the intended species. To get around this they use the organism database (https://organisms.jensenlab.org/Search) which contains a mapping of taxid -> paper. We should import and annotate this. This would then be provided as a facet in the widget as well.

@carlosribas
Copy link
Contributor

I started this task and at least for now I am loading the data from the csv file into a table in the database (litscan_load_organism). That done, for the articles that LitScan found that have pmid, I check which organisms were annotated. The results are saved in the litscan_organism table.

I'm going to review this process and think about the best way to automate it, but I imagine it will be something similar to what we already do with the rest of the RNAcentral data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants