-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ingest: Derive URL column during ingest #80
Ingest: Derive URL column during ingest #80
Conversation
ingest/rules/curate.smk
Outdated
@@ -107,6 +107,8 @@ rule curate: | |||
--abbr-authors-field {params.abbr_authors_field} \ | |||
| augur curate apply-geolocation-rules \ | |||
--geolocation-rules {input.all_geolocation_rules} \ | |||
| jq -c --arg GENBANK "{params.id_field}" \ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If a workflow used strain
as the id_field
, this would create invalid URLs...Hmm, might be best to add a new config param (e.g. genbank_accession_field
) that is explicitly for this command.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah, good point, especially if we use id_field=strain
for segmented viruses (e.g. flu or do you have an example? I assume in that case strain name is in the header line of sequences.fasta instead of genbank) to match isolates across tangle trees.
I can add an explicit genbank_accession_field
here
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, added an explicit config['curate']['genbank_accession']
field during b2936f8
58a9a99
to
edeee12
Compare
edeee12
to
b2936f8
Compare
After discussion: #76 (comment) Moved away from the |
Description of proposed changes
Add URL column during ingest so the node call out works automatically. Attempting the
jq
methodRelated issue(s)
url
column for accessions #76Checklist