Skip to content

Commit

Permalink
chore: release 2025-01-30--18-05-53Z
Browse files Browse the repository at this point in the history
Updated datasets:

- H5Nx clade 2.3.4.4 (community/moncla-lab/iav-h5/ha/2.3.4.4)
- H5Nx (all clades) (community/moncla-lab/iav-h5/ha/all-clades)
- H5Nx clade 2.3.2.1 (community/moncla-lab/iav-h5/ha/2.3.2.1)
  • Loading branch information
nextstrain-bot committed Jan 30, 2025
1 parent 82d1600 commit b0d27df
Show file tree
Hide file tree
Showing 28 changed files with 501,254 additions and 9 deletions.
2 changes: 1 addition & 1 deletion data/community/moncla-lab/iav-h5/ha/2.3.2.1/CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
## Unreleased
## 2025-01-30T18:05:53Z

Updates to README, adding reference to the bioRxiv preprint on A(H5) Nextclade datasets (https://doi.org/10.1101/2025.01.07.631789) and changes for consistency with the preprint

Expand Down
2 changes: 1 addition & 1 deletion data/community/moncla-lab/iav-h5/ha/2.3.4.4/CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
## Unreleased
## 2025-01-30T18:05:53Z

Updates to README, adding reference to the bioRxiv preprint on A(H5) Nextclade datasets (https://doi.org/10.1101/2025.01.07.631789) and changes for consistency with the preprint

Expand Down
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
## Unreleased
## 2025-01-30T18:05:53Z

Updates to README, adding reference to the bioRxiv preprint on A(H5) Nextclade datasets (https://doi.org/10.1101/2025.01.07.631789) and changes for consistency with the preprint

Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
## 2025-01-30T18:05:53Z

Updates to README, adding reference to the bioRxiv preprint on A(H5) Nextclade datasets (https://doi.org/10.1101/2025.01.07.631789) and changes for consistency with the preprint

## 2024-05-08T11:39:52Z

Initial release for Nextclade v3!

Read more about Nextclade datasets in the documentation: https://docs.nextstrain.org/projects/nextclade/en/stable/user/datasets.html
Original file line number Diff line number Diff line change
@@ -0,0 +1,53 @@
# A(H5) clade 2.3.2.1 dataset with A/duck/Vietnam/NCVD-1584/2012 reference

| attribute | value |
| -------------------- | ---------------------------------------- |
| authors |[Jordan Ort](https://lmoncla.github.io/monclalab/team/JordanOrt/), [Louise Moncla](https://lmoncla.github.io/monclalab/team/LouiseMoncla/)|
| dataset name | A(H5) clade 2.3.2.1 (provisional) |
| reference strain | A/duck/Vietnam/NCVD-1584/2012(H5N1) |
| reference accession | EPI424984 |

## Authors and contacts

Maintained by: [Jordan Ort](https://lmoncla.github.io/monclalab/team/JordanOrt/)

With the help of: [Louise Moncla](https://lmoncla.github.io/monclalab/team/LouiseMoncla/), Samuel Shephard, Sonja Zolonski, Tommy Lam, Todd Davis, and Richard Neher

## Reference

See [Ort et al. (preprint)](https://doi.org/10.1101/2025.01.07.631789) for more details on the development of this dataset. If you use these A(H5) Nextclade datasets, please cite: Ort, J. T., Shepard, S. S., Zolnoski, S. A., Lam, T. T.-Y., Davis, T., Neher, R., & Moncla, L. H. (2025). Development of avian influenza A(H5) virus datasets for Nextclade enables rapid and accurate clade assignment. bioRxiv, https://doi.org/10.1101/2025.01.07.631789.

## Scope of this dataset

This dataset uses an A(H5) candidate vaccine virus (CVV) from clade `2.3.2.1` (A/duck/Vietnam/NCVD-1584/2012) as a reference and is suitable for the analysis of A(H5) sequences belonging to clade `2.3.2.1` and its sub-clades `2.3.2.1a` through `2.3.2.1g`. Sequences belonging to other clades cannot be annotated by this dataset and will be left `unassigned`.

## Features

This dataset supports

* Assignment to clades and subclades based on the provisional nomenclature defined by the FAO/WHO/WOAH H5 Nomenclature Working Group
* Sequence quality control (QC)
* Phylogenetic placement
* Annotations for glycosylation sties, HA cleavage site sequence, and presence/absence of a polybasic cleavage site

## Clades of A(H5) avian influenza viruses

The FAO/WHO/WOAH H5 Nomenclature Working Group define "clades" using HA gene seguences, and define clades as genetically distinct, monophyletic groups of viruses. Viruses falling into a given clade share a common ancestor with significant bootstrap support and have low levels of within-clade diversity. [Past nomenclature updates](https://onlinelibrary.wiley.com/doi/10.1111/irv.12324) have required viruses in the same clade to be monophyletic with bootstrap suppor of at least 60%, with within-clade pairwise distances of less than 1.5%. These requirements are sometimes relaxed, and clades are periodically updated to account for expanding viral diversity.

Clade `2.3.2.1` viruses and their descendants have circulated since 2007 and are endemic in Southeast Asia. These viruses have diversified into eight additional sub-clades, named `2.3.2.1a` through `2.3.2.1g` due to high circulating diversity within the clade.
This Nextclade dataset incorporates these provisional `2.3.2.1` sub-clades.

## Alternative, and complementary approaches for A(H5) clade assignment

Two additional tools exist for assigning clades to A(H5) viruses that accommodate the recent `2.3.2.1` clade splits.

1. [LABEL](https://wonder.cdc.gov/amd/flu/label/): this command-line tool is built and maintained by Sam Shepard, and performs clade assignment for all current `2.3.2.1` and `2.3.4.4` clade splits.
2. [BVBRC Subspecies Classification Tool](https://www.bv-brc.org/app/SubspeciesClassification): this is a drag and drop tool that classifies a variety of viruses, including influenza A H1N1, H3N2, and H5N1.

The clade assignments in this Nextclade dataset were validated against LABEL assignments and shown to be generally well-matched across subclades. The figure below shows a direct comparison of assignments for 2,406 HA sequences from GISAID, performed using LABEL and this NextClade dataset for clade `2.3.2.1` and its subclades.

![Figure 1: Comparison between LABEL and Nextclade for 2.3.2.1 assignments](https://raw.githubusercontent.com/moncla-lab/h5-nextclade/refs/heads/main/figures-for-dataset-readmes/2321.svg)

## What is Nextclade dataset

Read more about Nextclade datasets in Nextclade documentation: https://docs.nextstrain.org/projects/nextclade/en/stable/user/datasets.html
Binary file not shown.
Loading

0 comments on commit b0d27df

Please sign in to comment.