-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Description
monologue-underling variants get called as "alt-probable" although they should be confirmed. The reason seems to be that the MNP P13L gets called as a wild type, when it is actually in the sequence:
{
"amino-acid-change": "P13L",
"codon-change": "CCC-CTT",
"gene": "N",
"one-based-reference-position": 28310,
"predicted-effect": "non-synonymous",
"protein": "nucleocapsid phosphoprotein",
"protein-codon-position": 13,
"reference-base": "CCC",
"type": "MNP",
"variant-base": "CTT",
"status": "no-detect"
}
In my example both positions 28311 and 28312 are T. I suspect the problem is related to the "one-based-reference-position" pointing to a base that is ref in the sample.
I am attaching the example I used.
example_barcode05.muscle.aln.fasta.zip
I spoke to the author of the definitions and we agreed that MNPs are denoted inconsistently across the yaml files. There will be an update so that all MNPs will:
- MNPs will always be per-codon, so length 3, even if they span two neighbouring codons (neighbouring codons = two MNPs)
- the "one-based-reference-position" will always be the position of the first nucleotide in the codon, whether that changes or not
- the "reference-base" and the "variant-base", will always have all 3 nucleotides, whether they change or not, essentially giving the same information than the 'codon-change' field
- non-changing nucleotides in the MNP will be shown as N in the "variant-base" fields to indicate that the MNP is to be called no matter what the query sequence is at that position.
Sorry about the faff.
Ulf
Metadata
Metadata
Assignees
Labels
No labels