Releases · nextstrain/augur

18 Nov 23:51

github-actions

32.1.0

58f8b1a

32.1.0 Latest

Latest

These release notes are automatically extracted from the full changelog.

Features

augur.io.read_metadata: Added a new parameter keep_id_as_column to keep the resolved id column as a column in addition to setting it as the DataFrame index. #1917 (@victorlin)
subsample: Filepaths in the config file can now be relative to the config file's parent directory in addition to the current working directory. Custom directories can also be specified using a new command line option --search-paths or environment variable AUGUR_SEARCH_PATHS. #1897 (@victorlin)
A helper function – augur.subsample.get_referenced_files – has been added to optimize usage of augur subsample in Snakemake workflows. This is experimental and not yet part of the public API. #1918 (@victorlin)

Bug fixes

filter: Previously, --query, --exclude-where, and --include-where did not work for the id column (strain, name, or other from --metadata-id-columns). This has been fixed. #1915 (@corneliusroemer, @victorlin)
export v2: Support export of URLs for non-string values. #1926 (@joverlee521)

Contributors

victorlin, corneliusroemer, and joverlee521

Assets 4

21 Oct 23:36

github-actions

32.0.0

b043eff

32.0.0

These release notes are automatically extracted from the full changelog.

Major Changes

ancestral, translate: These will now error when the length of any reference gene is indivisible by 3, instead of silently padding with N to translate to 'X'. #1895 (@victorlin)
augur.utils.load_features is deprecated and will be removed in a future major version. Users should use augur.io.load_features instead. #1912 (@victorlin)

Features

augur curate apply-record-annotations will now warn if an annotation was unnecessary, often indicative of the upstream data being updated. #1893 (@jameshadfield)

Contributors

jameshadfield and victorlin

Assets 4

17 Sep 23:30

github-actions

31.5.0

5aab0d5

31.5.0

These release notes are automatically extracted from the full changelog.

A new command, augur subsample, supports complex subsampling using file-based configuration. See the updated Filtering and Subsampling guide for a comparison with augur filter. #635 (@victorlin)

Contributors

victorlin

Assets 4

14 Aug 20:44

github-actions

31.4.0

3ee0c31

31.4.0

These release notes are automatically extracted from the full changelog.

Features

schema: Allow parentheses (()) in gene names. #1819 (@kimandrews)
geolocation rules: Add rules to define region per country to ensure that regions are labelled for all countries. This is especially useful for data sources that do not include region in the metadata. #1844 (@joverlee521)
support numpy v2 in addition to v1. #1855 (@corneliusroemer)
support for Python 3.13. #1857 (@corneliusroemer)
tree: Prefer iqtree3 binary over iqtree2 and iqtree when available. #1875 (@joverlee521)
export v2: URLs encoded in metadata (both TSV and node-data JSONs) will be associated with the value in the exported JSON. Given a column/key <X> then a valid URL in a column/key named <X>__url will be automatically used. This allows values to be a clickable link when viewed in Auspice. #1852 (@jameshadfield)

Bug fixes

filter: Improved speed of using --group-by month on large datasets. #1845 (@victorlin)
merge: Added validation to require at least two sequence inputs for merging, consistent with metadata merging behavior. #1865 (@victorlin)
validate: Send all log messages to stderr. #1869 (@victorlin)
validate: only print the entire merged Auspice config to stderr when there's a validation error. #1878(@joverlee521)

Contributors

jameshadfield, victorlin, and 3 other contributors

Assets 4

03 Jul 21:25

github-actions

31.3.0

6f5cc08

31.3.0

These release notes are automatically extracted from the full changelog.

Features

traits: Added new options --branch-labels and --branch-confidence to export branch labels for nodes which have a corresponding state change. These are useful for creating streamtrees which convey geographic jumps. #1814 (@jameshadfield)
filter, merge: Added a new option --nthreads to configure parallelism. Right now, it is only passed to SeqKit, but it may be used for other internal optimizations in the future. #1833 (@victorlin)
filter: Added a new option --skip-checks to bypass checks for duplicates in sequences and whether ids in metadata have a sequence entry. Mainly useful when working with larger files. #1833 (@victorlin)
Added a new AUGUR_PROFILE environment variable. If set, Augur will run with Python's cProfile profiler and save results to the value which should be a file path. This may result in slightly slower run times, and should only be used for debugging purposes. #1835 (@victorlin)

Bug fixes

filter, merge: Improved run time of sequence I/O operations, especially in the common use case of having a workflow manager run multiple invocations simultaneously. #1833 (@victorlin)
filter, merge: Previously, SeqKit was hardcoded to use its default of 4 threads per command, which could have resulted in oversubscription of resources in the common use case of having a workflow manager run multiple invocations simultaneously. The default behavior has been updated to use 1 thread per command to discourage oversubscription of resources. It is configurable with the new --nthreads option described above. #1833 (@victorlin)

Contributors

jameshadfield and victorlin

Assets 4

12 Jun 17:59

github-actions

31.2.1

8f7d126

31.2.1

These release notes are automatically extracted from the full changelog.

Bug fixes

curate format-dates: Removed redundant warning messages that were previously displayed when using --failure-reporting "warn". #1816 (@victorlin)
filter: Improved performance of --output-sequences by using SeqKit internally. #1794 (@victorlin)
filter: Improved performance when using --sequences without --sequence-index by skipping indexing of --sequences when no sequence-based filters are used. #1827 (@victorlin)
filter: Fixed a bug that prevented proper checking of duplicates and sequence index mismatches on VCF inputs. #1826 (@victorlin)
merge: Fixed a performance bug where input sequence file validation unnecessarily loaded file contents into device memory. #1820 (@victorlin)
refine: Fixed a bug where inferred dates were being wrongly marked as not inferred. #1829 (@victorlin)

Contributors

victorlin

Assets 4

05 Jun 19:26

github-actions

31.2.0

1f6d64c

31.2.0

These release notes are automatically extracted from the full changelog.

Features

merge: Support merging of sequence files with --sequences. #1579 (@victorlin)
read-file: Multiple files are now accepted. #1815 (@victorlin)
schema: Added fields for streamtrees and default zoom branch label. #1813 (@jameshadfield)

Bug fixes

Added a missing redirect for the environment variables documentation page from its previous location. #1812 (@tsibley)

Contributors

tsibley, jameshadfield, and victorlin

Assets 4

27 May 23:14

github-actions

31.1.0

6df50e3

31.1.0

These release notes are automatically extracted from the full changelog.

Features

schema: Allow full stop character (.) in gene names. #955 (@jameshadfield)

Bug fixes

filter: Improved speed of using --group-by, --min-date, and --max-date on large datasets. #1792, #1811 (@victorlin)

Contributors

jameshadfield and victorlin

Assets 4

19 May 19:01

github-actions

31.0.0

9644574

31.0.0

These release notes are automatically extracted from the full changelog.

Major Changes

augur mask --mask, augur tree --exclude-sites: BED files with inconsistent CHROM values (i.e., values in the first column of data lines) will throw an error, as Augur (implicitly) expects to be working on a single piece of DNA (chromosome, segment, etc), and multiple CHROM values in a BED file indicate a violation of this expectation. This is a breaking change. #945 (@genehack)
filter: Empty values in the metadata id column will result in an error that can only be resolved by editing the metadata file or by specifying a different id column with --metadata-id-columns. #1807 (@joverlee521)

Bug fixes

augur mask --mask, augur tree --exclude-sites: Providing an empty BED file, or one with only header lines and no data lines, will no longer cause an error to be thrown. #945 (@genehack)
augur.utils.read_bed_file() was rewritten for increased compliance with the BED file specification. In particular, header line dectection is improved and multiple header lines are now supported. #945 (@genehack)
export v2: Improved the error message that is displayed when the metadata index column has duplicated values #1791 (@genehack)
tree: Improved help text for --tree-builder-args to explain some IQ-TREE options won't work because of defline rewriting #875 (@genehack)
export v2: Automatically rename fields within the filters and colorings configs of the provided auspice config file to match the renamed fields in the exported nodes. #1804 (@joverlee521)
export v2: Divergence values are now exported with increased precision, showing up to 6 significant digits instead of 3. #1801 (@rneher)

Contributors

genehack, rneher, and joverlee521

Assets 4

28 Apr 18:25

github-actions

30.0.1

5ba94d1

30.0.1

These release notes are automatically extracted from the full changelog.

Bug fixes

filter: Removed the note that appeared in output when running with --sequences and without --sequence-index. The help text of both options has been updated to clarify the relationship between the two. #1797 (@victorlin)

Contributors

victorlin

Assets 4

Releases: nextstrain/augur

32.1.0

Features

Bug fixes

Contributors

Uh oh!

32.0.0

Major Changes

Features

Contributors

Uh oh!

31.5.0

Contributors

Uh oh!

31.4.0

Features

Bug fixes

Contributors

Uh oh!

31.3.0

Features

Bug fixes

Contributors

Uh oh!

31.2.1

Bug fixes

Contributors

Uh oh!

31.2.0

Features

Bug fixes

Contributors

Uh oh!

31.1.0

Features

Bug fixes

Contributors

Uh oh!

31.0.0

Major Changes

Bug fixes

Contributors

Uh oh!

30.0.1

Bug fixes

Contributors

Uh oh!