-
Notifications
You must be signed in to change notification settings - Fork 32
Open
Labels
Status: In ProgressHas been assigned and is being worked on.Has been assigned and is being worked on.
Description
Hi, I am looking to get some clarification on allele calling method.
At the completion of alleleCall module, it produces a log message with information such as:
Assigned identifiers to 174 new alleles for 162 loci.
Getting original sequence identifiers for new alleles...
Getting data for new representative alleles...
Adding the BLASTp self-score for the new representatives to ..schema/short/self_scores
Creating FASTA files with the new alleles...
Adding new alleles to schema...
Added 174 new alleles to the schema.
Added 1 new representative alleles to the schema.
I would like to know how a schema will be updated/effected if:
- an analysis dataset is discovered to have contaminated species
- low quality genomes
- running AlleleCall on subset of the whole dataset initially used
- re-run AlleleCall of duplicate dataset.
How is the schema altered in each of these scenarios, and whether there are internal quality checks in place before a new allele is added to the schema. Is it better to create a copy of the schema to use for test runs?
If you could provide some advice/suggestions on what should be the best practice, that will be great.
Thank you.
Metadata
Metadata
Assignees
Labels
Status: In ProgressHas been assigned and is being worked on.Has been assigned and is being worked on.