Skip to content

Commit

Permalink
Merge pull request #77 from nextstrain/properly-handle-tsvs
Browse files Browse the repository at this point in the history
ingest: properly handle TSVs with `csvtk`
  • Loading branch information
joverlee521 authored Dec 9, 2024
2 parents f55c6c2 + 241b1bb commit aa3dd48
Show file tree
Hide file tree
Showing 3 changed files with 4 additions and 6 deletions.
2 changes: 1 addition & 1 deletion ingest/rules/curate.smk
Original file line number Diff line number Diff line change
Expand Up @@ -126,6 +126,6 @@ rule subset_metadata:
metadata_fields=",".join(config["curate"]["metadata_columns"]),
shell:
"""
tsv-select -H -f {params.metadata_fields} \
csvtk cut -t -f {params.metadata_fields} \
{input.metadata} > {output.subset_metadata}
"""
6 changes: 2 additions & 4 deletions ingest/rules/fetch_from_ncbi.smk
Original file line number Diff line number Diff line change
Expand Up @@ -97,11 +97,9 @@ rule format_ncbi_dataset_report:
--fields {params.ncbi_datasets_fields:q} \
--elide-header \
| csvtk fix-quotes -Ht \
| csvtk add-header -t -l -n {params.ncbi_datasets_fields:q} \
| csvtk add-header -t -n {params.ncbi_datasets_fields:q} \
| csvtk rename -t -f accession -n accession_version \
| csvtk -t mutate -f accession_version -n accession -p "^(.+?)\." \
| csvtk del-quotes -t \
| tsv-select -H -f accession --rest last \
| csvtk -t mutate -f accession_version -n accession -p "^(.+?)\." --at 1 \
> {output.ncbi_dataset_tsv}
"""

Expand Down
2 changes: 1 addition & 1 deletion ingest/rules/nextclade.smk
Original file line number Diff line number Diff line change
Expand Up @@ -77,7 +77,7 @@ rule nextclade_metadata:
--id-column {params.nextclade_id_field:q} \
--field-map {params.nextclade_field_map:q} \
--output-metadata - \
| tsv-select --header --fields {params.nextclade_fields:q} \
| csvtk cut -t --fields {params.nextclade_fields:q} \
> {output.nextclade_metadata:q}
"""

Expand Down

0 comments on commit aa3dd48

Please sign in to comment.