You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- Updated 'run_argnorm.py' to support the 'hamronization' tool option rather than the '--hamronization' flag
- Removed '@pytest.mark.parametrize('hamronized', [True, False])' in 'test_normalizers.py'. Added test_hamronization_normalizer() which normalizes and tests every file in examples/hamronized
- Updated example outputs in `outputs/` directory to match changes to input_gene and reference_gene processing in HamronizationNormalizer
- Notes on changes to example outputs:
1) `resfinder.resfinder.reads.tsv` and `resfinder.resfinder.orfs.tsv`: ant(6)-Ia, cat, tet(X), and tet(O/32/O) have been mapped to their exact entries in resfinder ARO mapping because their reference_accession is also being used
2) `abricate.resfinder.tsv`: aadA24_1 has been mapped to its exact entry in resfinder ARO mapping. blaCTX-M-63_1_EU660216 and blaCARB-4_1_FJ785525 don't exist in the resfinder database anymore. They were previously mapped as their gene_symbols (blaCTX-M-63 and blaCARB-4) could be mapped to other genes. Now, as their reference_accession is also used, they aren't mapped to AROs anymore.
3) `abricate.ncbi.tsv`: Many new genes have been mapped to the ARO (e.g. dfrB3). However, there has been no change to the preprocessing of ncbi or amrfinderplus inputs. These genes are getting mapped as HamronizationNormalizer combines ARO mapping tables of all dbs, and these genes are being mapped to ARO mapping tables of other dbs. This will be corrected in a future commit where the gene_name of ncbi and amrfinderplus inputs will be used (this will pinpoint the exact unique gene in NCBI ARO mapping).
0 commit comments