Skip to content

Addition of alleles to schema #222

@arshdeep1k

Description

@arshdeep1k

Hi, I am looking to get some clarification on allele calling method.

At the completion of alleleCall module, it produces a log message with information such as:

Assigned identifiers to 174 new alleles for 162 loci.
Getting original sequence identifiers for new alleles...
Getting data for new representative alleles...
Adding the BLASTp self-score for the new representatives to ..schema/short/self_scores
Creating FASTA files with the new alleles...
Adding new alleles to schema...
Added 174 new alleles to the schema.
Added 1 new representative alleles to the schema.

I would like to know how a schema will be updated/effected if:

  • an analysis dataset is discovered to have contaminated species
  • low quality genomes
  • running AlleleCall on subset of the whole dataset initially used
  • re-run AlleleCall of duplicate dataset.

How is the schema altered in each of these scenarios, and whether there are internal quality checks in place before a new allele is added to the schema. Is it better to create a copy of the schema to use for test runs?

If you could provide some advice/suggestions on what should be the best practice, that will be great.
Thank you.

Metadata

Metadata

Assignees

Labels

Status: In ProgressHas been assigned and is being worked on.

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions