Skip to content

Commit 436805f

Browse files
committed
WIP: Add rule to create column for accessions
1 parent a443038 commit 436805f

File tree

1 file changed

+18
-1
lines changed

1 file changed

+18
-1
lines changed

ingest/rules/curate.smk

Lines changed: 18 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -116,10 +116,27 @@ rule curate:
116116
--output-seq-field {params.sequence_field} ) 2>> {log}
117117
"""
118118

119+
rule add_metadata_columns:
120+
"""Add columns to metadata
121+
Notable columns:
122+
- [NEW] url: URL linking to the NCBI GenBank record ('https://www.ncbi.nlm.nih.gov/nuccore/*').
123+
"""
124+
input:
125+
metadata = "data/all_metadata.tsv"
126+
output:
127+
metadata = "data/all_metadata_added.tsv"
128+
shell:
129+
"""
130+
csvtk mutate2 -t \
131+
-n url \
132+
-e '"https://www.ncbi.nlm.nih.gov/nuccore/" + $accession' \
133+
{input.metadata} \
134+
> {output.metadata}
135+
"""
119136

120137
rule subset_metadata:
121138
input:
122-
metadata="data/all_metadata.tsv",
139+
metadata="data/all_metadata_added.tsv",
123140
output:
124141
subset_metadata="data/subset_metadata.tsv",
125142
params:

0 commit comments

Comments
 (0)