Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add lineages coloring from https://dengue-lineages.org/ #93

Open
wants to merge 8 commits into
base: main
Choose a base branch
from

Conversation

j23414
Copy link
Contributor

@j23414 j23414 commented Feb 14, 2025

Description of proposed changes

This PR updates the dengue lineages to match https://dengue-lineages.org in response to various feedback including a slack message and various issues. Please see the following issue for further context:

The trees with updated lineage designations have been pushed to a staged site, linked in the table below as a convenience to PR reviewers.

See the below staged builds for the Verity Hill, 2024 lineage coloring:

serotype genome E
all all/genome all/E
denv1 denv1/genome denv1/E
denv2 denv2/genome denv2/E
denv3 denv3/genome denv3/E
denv4 denv4/genome denv4/E

Metadata with new columns "genotype", "major_lineage", and "minor_lineage" available at: https://data.nextstrain.org/files/workflows/dengue/trials/20250214hill/metadata_all.tsv.zst

dengue-lineages.org displays lineages using distinct colors for Genotype, Major, and Minor lineages, as illustrated in the diagram below with dengue 3 as an example:

I have implemented a similar distinct lineage color scheme in our tree visualization, also showing dengue 3 below as an example:

dengue3_genotypemajorminor

Related issue(s)

Checklist

  • Checks pass

[edit: added on 2025-02-18, after emailing Grubaugh et al]

Results:

  • 12,012 of 12,137 matched or 99% accuracy
  • 67 records mismatched but most were child or parent of the correct call: dengue_mismatch.txt

[edit: added on 2025-02-20

  • Several of us investigated the mismatch for OK040058 (4II_A.1 vs 1I_B). After some digging, it turns out version 1 classified as a dengue 4 sequence and version 2 classified as a a dengue 1 sequence.

  • Validate lineage calls against references listed in https://github.com/DENV-lineages/lineages-paper

@j23414 j23414 linked an issue Feb 14, 2025 that may be closed by this pull request
@j23414 j23414 force-pushed the 2025/hill_nextclade_dataset branch 2 times, most recently from 65e24f8 to 7d0e7b7 Compare February 14, 2025 22:32
@j23414 j23414 force-pushed the 2025/hill_nextclade_dataset branch from 7d0e7b7 to e0e3281 Compare February 14, 2025 23:17
@j23414 j23414 changed the title WIP: Add lineages coloring from https://dengue-lineages.org/ Add lineages coloring from https://dengue-lineages.org/ Feb 18, 2025
@j23414 j23414 marked this pull request as ready for review February 18, 2025 18:32
@j23414 j23414 force-pushed the 2025/hill_nextclade_dataset branch from 3535b0f to d08c4bc Compare February 18, 2025 23:24
Copy link
Contributor

@joverlee521 joverlee521 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Totally support using existing community Nextclade datasets developed by those maintaining the dengue lineage system! As I've said in our discussions, I'm slightly worried that the community Nextclade datasets might not be maintained indefinitely, but we can tackle that when/if the time comes.

I only left a few non-blocking comments from a high level review.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add lineages from https://dengue-lineages.org/
3 participants