-
Notifications
You must be signed in to change notification settings - Fork 27
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
chore: release 2025-01-30--18-05-53Z
Updated datasets: - H5Nx clade 2.3.4.4 (community/moncla-lab/iav-h5/ha/2.3.4.4) - H5Nx (all clades) (community/moncla-lab/iav-h5/ha/all-clades) - H5Nx clade 2.3.2.1 (community/moncla-lab/iav-h5/ha/2.3.2.1)
- Loading branch information
1 parent
82d1600
commit b0d27df
Showing
28 changed files
with
501,254 additions
and
9 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
9 changes: 9 additions & 0 deletions
9
...utput/community/moncla-lab/iav-h5/ha/2.3.2.1/2025-01-30--18-05-53Z/CHANGELOG.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,9 @@ | ||
## 2025-01-30T18:05:53Z | ||
|
||
Updates to README, adding reference to the bioRxiv preprint on A(H5) Nextclade datasets (https://doi.org/10.1101/2025.01.07.631789) and changes for consistency with the preprint | ||
|
||
## 2024-05-08T11:39:52Z | ||
|
||
Initial release for Nextclade v3! | ||
|
||
Read more about Nextclade datasets in the documentation: https://docs.nextstrain.org/projects/nextclade/en/stable/user/datasets.html |
53 changes: 53 additions & 0 deletions
53
data_output/community/moncla-lab/iav-h5/ha/2.3.2.1/2025-01-30--18-05-53Z/README.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,53 @@ | ||
# A(H5) clade 2.3.2.1 dataset with A/duck/Vietnam/NCVD-1584/2012 reference | ||
|
||
| attribute | value | | ||
| -------------------- | ---------------------------------------- | | ||
| authors |[Jordan Ort](https://lmoncla.github.io/monclalab/team/JordanOrt/), [Louise Moncla](https://lmoncla.github.io/monclalab/team/LouiseMoncla/)| | ||
| dataset name | A(H5) clade 2.3.2.1 (provisional) | | ||
| reference strain | A/duck/Vietnam/NCVD-1584/2012(H5N1) | | ||
| reference accession | EPI424984 | | ||
|
||
## Authors and contacts | ||
|
||
Maintained by: [Jordan Ort](https://lmoncla.github.io/monclalab/team/JordanOrt/) | ||
|
||
With the help of: [Louise Moncla](https://lmoncla.github.io/monclalab/team/LouiseMoncla/), Samuel Shephard, Sonja Zolonski, Tommy Lam, Todd Davis, and Richard Neher | ||
|
||
## Reference | ||
|
||
See [Ort et al. (preprint)](https://doi.org/10.1101/2025.01.07.631789) for more details on the development of this dataset. If you use these A(H5) Nextclade datasets, please cite: Ort, J. T., Shepard, S. S., Zolnoski, S. A., Lam, T. T.-Y., Davis, T., Neher, R., & Moncla, L. H. (2025). Development of avian influenza A(H5) virus datasets for Nextclade enables rapid and accurate clade assignment. bioRxiv, https://doi.org/10.1101/2025.01.07.631789. | ||
|
||
## Scope of this dataset | ||
|
||
This dataset uses an A(H5) candidate vaccine virus (CVV) from clade `2.3.2.1` (A/duck/Vietnam/NCVD-1584/2012) as a reference and is suitable for the analysis of A(H5) sequences belonging to clade `2.3.2.1` and its sub-clades `2.3.2.1a` through `2.3.2.1g`. Sequences belonging to other clades cannot be annotated by this dataset and will be left `unassigned`. | ||
|
||
## Features | ||
|
||
This dataset supports | ||
|
||
* Assignment to clades and subclades based on the provisional nomenclature defined by the FAO/WHO/WOAH H5 Nomenclature Working Group | ||
* Sequence quality control (QC) | ||
* Phylogenetic placement | ||
* Annotations for glycosylation sties, HA cleavage site sequence, and presence/absence of a polybasic cleavage site | ||
|
||
## Clades of A(H5) avian influenza viruses | ||
|
||
The FAO/WHO/WOAH H5 Nomenclature Working Group define "clades" using HA gene seguences, and define clades as genetically distinct, monophyletic groups of viruses. Viruses falling into a given clade share a common ancestor with significant bootstrap support and have low levels of within-clade diversity. [Past nomenclature updates](https://onlinelibrary.wiley.com/doi/10.1111/irv.12324) have required viruses in the same clade to be monophyletic with bootstrap suppor of at least 60%, with within-clade pairwise distances of less than 1.5%. These requirements are sometimes relaxed, and clades are periodically updated to account for expanding viral diversity. | ||
|
||
Clade `2.3.2.1` viruses and their descendants have circulated since 2007 and are endemic in Southeast Asia. These viruses have diversified into eight additional sub-clades, named `2.3.2.1a` through `2.3.2.1g` due to high circulating diversity within the clade. | ||
This Nextclade dataset incorporates these provisional `2.3.2.1` sub-clades. | ||
|
||
## Alternative, and complementary approaches for A(H5) clade assignment | ||
|
||
Two additional tools exist for assigning clades to A(H5) viruses that accommodate the recent `2.3.2.1` clade splits. | ||
|
||
1. [LABEL](https://wonder.cdc.gov/amd/flu/label/): this command-line tool is built and maintained by Sam Shepard, and performs clade assignment for all current `2.3.2.1` and `2.3.4.4` clade splits. | ||
2. [BVBRC Subspecies Classification Tool](https://www.bv-brc.org/app/SubspeciesClassification): this is a drag and drop tool that classifies a variety of viruses, including influenza A H1N1, H3N2, and H5N1. | ||
|
||
The clade assignments in this Nextclade dataset were validated against LABEL assignments and shown to be generally well-matched across subclades. The figure below shows a direct comparison of assignments for 2,406 HA sequences from GISAID, performed using LABEL and this NextClade dataset for clade `2.3.2.1` and its subclades. | ||
|
||
 | ||
|
||
## What is Nextclade dataset | ||
|
||
Read more about Nextclade datasets in Nextclade documentation: https://docs.nextstrain.org/projects/nextclade/en/stable/user/datasets.html |
Binary file added
BIN
+29.3 MB
data_output/community/moncla-lab/iav-h5/ha/2.3.2.1/2025-01-30--18-05-53Z/dataset.zip
Binary file not shown.
Oops, something went wrong.