forked from datacarpentry/R-genomics
-
Notifications
You must be signed in to change notification settings - Fork 0
Files
/
Copy pathjournal.pone.0081760.s004.csv
Latest commit


Ryan Williams
Ryan Williams
403 lines (403 loc) · 60.7 KB
/
journal.pone.0081760.s004.csv
1 | LocusNum | Context | NonSynonymous | AnnotationType | AminoAcids | Codons | SNPPositiononProtein | RefGenomeID | PositionInRefGenome | Protein_GI | EC_number | locus_tag | gene | note | product | protein_id | function | primary_tag |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2 | 119 | AAAAAAATC.ATCAGGAAA | 0 | OnProtein | S_S | TCC_TCT | 346 | EcoO104H4_209EL-2071 | 3631161 | 407064277 | O3O_07785 | COG2199 FOG: GGDEF domain | membrane protein | AFS85324.1 | CDS | |||
3 | 188 | AAAAAACCA.CAACCTTGA | 0 | UnannotatedRegion | ||||||||||||||
4 | 264 | AAAAAACTG.TCAAAAACC | 0 | OnProtein | E_E | GAA_GAG | 140 | EcoO104H4_209EL-2071 | 78175 | 407067695 | O3O_25165 | COG0262 Dihydrofolate reductase | dihydrofolate reductase | AFS88742.1 | CDS | |||
5 | 975 | AAAAACGCG.TTTTTCTAC | 1 | OnProtein | A_T | ACG_GCG | 67 | EcoETEC_H10407 | 2141974 | 309702173 | ETEC_1985 | conserved hypothetical protein | CBJ01488.1 | CDS | ||||
6 | 1956 | AAAAATGCG.TTTCCGGTA | 0 | OnProtein | N_N | AAC_AAT | 76 | Eco53638 | 4261609 | 188490518 | Ec53638_4379 | conserved hypothetical protein | EDU65621.1 | CDS | ||||
7 | 2272 | AAAACAAGC.ATTACTTGG | 1 | OnProtein | N_Y | AAT_TAT | 52 | EcoO104H4_209EL-2071 | 3227526 | 407064655 | O3O_09685 | hypothetical protein | AFS85702.1 | CDS | ||||
8 | 2329 | AAAACACAG.AAAACTTAA | 0 | UnannotatedRegion | ||||||||||||||
9 | 2400 | AAAACACTG.CGCACTCCA | 0 | UnannotatedRegion | ||||||||||||||
10 | 3070 | AAAACCGGT.GGACTGAGT | 0 | OnProtein | V_V | GTC_GTT | 134 | EcoO104H4_209EL-2071 | 3196197 | 407064683 | O3O_09840 | hypothetical protein | AFS85730.1 | CDS | ||||
11 | 4036 | AAAACTTTG.CCGCGAAGC | 0 | OnProtein | G_G | GGA_GGG | 107 | EcoETEC_H10407 | 2134520 | 309702162 | ETEC_1974 | conserved hypothetical protein | CBJ01477.1 | CDS | ||||
12 | 4131 | AAAAGAAGA.CTGATTCAG | 0 | OnProtein | E_E | GAA_GAG | 35 | EcoO104H4_209EL-2071 | 3215465 | 407064667 | O3O_09745 | putative ATP-binding protein | AFS85714.1 | CDS | ||||
13 | 5742 | AAAATACGG.CAGGTGGAC | 0 | OnProtein | G_G | GGA_GGG | 414 | EcoNA114 | 909181 | 333968870 | ECNA114_0852 | Putative phage protein | AEG35675.1 | CDS | ||||
14 | 9552 | AAACCGCCG.GTCACCATG | 1 | OnProtein | A_S | GCG_TCG | 83 | Eco042 | 2611045 | 284922222 | 5.99.1.3 | EC042_2471 | gyrA | DNA gyrase subunit A | CBG35304.1 | CDS | ||
15 | 10548 | AAACGCTCC.GAAATAAAC | 0 | OnProtein | R_R | AGA_CGA | 25 | Eco042 | 890676 | 284920579 | EC042_0815 | ninE | prophage protein | CBG33641.1 | CDS | |||
16 | 11678 | AAACTGCAA.TACCTTCCA | 1 | OnProtein | K_N | AAA_AAT | 17 | Eco042 | 890654 | 284920579 | EC042_0815 | ninE | prophage protein | CBG33641.1 | CDS | |||
17 | 13044 | AAAGATCAT.ATGCGCTTT | 1 | OnProtein | H_Y | CAT_TAT | 97 | EcoO104H4_209EL-2071 | 3216728 | 407064666 | O3O_09740 | COG2301 Citrate lyase beta subunit | hypothetical protein | AFS85713.1 | CDS | |||
18 | 13260 | AAAGCAACG.GTGGTGAGT | 1 | OnProtein | C_R | CGT_TGT | 18 | EcoABU83972 | 1367983 | 307553145 | ECABU_c13590 | hypothetical protein encoded by prophage | ADN45920.1 | CDS | ||||
19 | 13886 | AAAGCGAAG.ACTATCAGC | 1 | OnProtein | D_N | AAC_GAC | 384 | Eco042 | 712504 | 284920405 | EC042_0641 | conserved hypothetical protein | CBG33466.1 | CDS | ||||
20 | 14707 | AAAGGCAGA.AAAGCAACG | 0 | OnProtein | E_E | GAA_GAG | 14 | EcoLF82 | 1002689 | 222032715 | LF82_099 | corresponding to LF82_p099 in publication : Miquel et al., PLoS One; CFT073; E2348 | hypothetical protein | CAP75454.1 | CDS | |||
21 | 15038 | AAAGGGATT.TGCTGCTGG | 1 | OnProtein | L_V | GTG_TTG | 236 | EcoO104H4_209EL-2071 | 2275955 | 407065619 | O3O_14595 | COG2199 FOG: GGDEF domain | diguanylate cyclase | AFS86666.1 | CDS | |||
22 | 16020 | AAAGTTAAA.CTGCATAAA | 0 | NotProteinCoding | Eco53638 | 4565074 | gene | locus_tag.location=4565053..4565232 | ||||||||||
23 | 16020 | AAAGTTAAA.CTGCATAAA | 0 | NotProteinCoding | Eco53638 | 4565074 | misc_RNA | locus_tag.location=4565053..4565232 | product.location=4565053..4565232 | |||||||||
24 | 16801 | AAATACACC.AAAACAAAA | 0 | UnannotatedRegion | ||||||||||||||
25 | 17368 | AAATATCCT.CAATCCACC | 1 | OnProtein | K_Q | AAG_CAG | 453 | Eco042 | 5078737 | 284924433 | EC042_4738 | conserved hypothetical protein | CBG37558.1 | CDS | ||||
26 | 17730 | AAATCAATT.GTTATTGAA | 0 | UnannotatedRegion | ||||||||||||||
27 | 20553 | AAATTGCCG.CATTTTACC | 1 | OnProtein | A_V | GCC_GTC | 216 | Eco53638 | 3390372 | 188489615 | 1.3.1.- | Ec53638_3450 | hcaB | identified by match to protein family HMM PF00106 | 2,3-dihydroxy-2,3-dihydrophenylpropionate dehydrogenase | EDU64718.1 | CDS | |
28 | 20627 | AAATTGGTA.TAACGTTGA | 1 | OnProtein | N_S | AAT_AGT | 140 | EcoO104H4_209EL-2071 | 2364253 | 407065535 | O3O_14155 | COG1045 Serine acetyltransferase | putative acetyltransferase | AFS86582.1 | CDS | |||
29 | 20998 | AACAAAAAA.CCATCAACC | 0 | NotProteinCoding | Eco53638 | 415300 | gene | gene.location=415231..415309 | locus_tag.location=415231..415309 | |||||||||
30 | 20998 | AACAAAAAA.CCATCAACC | 0 | NotProteinCoding | Eco53638 | 415300 | misc_RNA | gene.location=415231..415309 | locus_tag.location=415231..415309 | product.location=415231..415309 | ||||||||
31 | 22171 | AACACCAGC.AACTGCATG | 1 | OnProtein | *_E | GAA_TAA | 286 | EcoO104H4_209EL-2071 | 3222441 | 407064660 | O3O_09710 | COG4245 Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain | TerY3 | AFS85707.1 | CDS | |||
32 | 22976 | AACAGCCTG.CCGTCAGGA | 0 | OnProtein | G_G | GGC_GGT | 253 | EcoO104H4_209EL-2071 | 2390393 | 407065498 | O3O_13970 | COG0582 Integrase | integrase | AFS86545.1 | CDS | |||
33 | 26231 | AACCCTATA.AAACGTTCG | 1 | OnProtein | E_K | AAA_GAA | 718 | Eco53638 | 198513 | 188490250 | Ec53638_0242 | biofilm PGA synthesis protein PgaA | EDU65353.1 | CDS | ||||
34 | 27459 | AACCGTGCG.AGTGCTTCG | 0 | OnProtein | L_L | CTA_CTT | 98 | EcoETEC_H10407 | 2134493 | 309702162 | ETEC_1974 | conserved hypothetical protein | CBJ01477.1 | CDS | ||||
35 | 27733 | AACCTCTCC.GTTCGCTGT | 1 | OnProtein | C_R | CGT_TGT | 397 | EcoBL21DE3 | 311935 | 242376113 | B21_00283 | yahJ | predicted deaminase with metallo-dependent hydrolase domain | CAQ30800.1 | CDS | |||
36 | 28766 | AACGAGGTG.AAATGATAT | 0 | UnannotatedRegion | ||||||||||||||
37 | 30110 | AACGCTGAT.CTGTCTCAC | 0 | OnProtein | I_I | ATC_ATT | 54 | Eco042 | 3951128 | 284923451 | EC042_3722 | rpoH | RNA polymerase sigma-32 factor | CBG36546.1 | CDS | |||
38 | 30174 | AACGCTGGA.GCTGACGGC | 0 | OnProtein | E_E | GAA_GAG | 75 | EcoETEC_H10407 | 2148842 | 309702181 | ETEC_1993 | putative phage small terminase subunit | CBJ01496.1 | CDS | ||||
39 | 30530 | AACGGATTG.GTGGGCATC | 1 | OnProtein | H_R | CAC_CGC | 66 | EcoO104H4_209EL-2071 | 2388654 | 407065501 | O3O_13985 | hypothetical protein | AFS86548.1 | CDS | ||||
40 | 32531 | AACTATTGA.CAGAACGGT | 0 | OnProtein | E_E | GAA_GAG | 283 | Eco042 | 2451306 | 284922097 | EC042_2341 | fimbrial outer membrane usher protein | CBG35176.1 | CDS | ||||
41 | 33640 | AACTGTAAC.CTCTCCCCA | 0 | UnannotatedRegion | ||||||||||||||
42 | 33770 | AACTTAAAA.CTGCCGCTG | 0 | UnannotatedRegion | ||||||||||||||
43 | 33898 | AACTTCACG.GGAGAATGG | 0 | OnProtein | R_R | CGA_CGC | 242 | EcoO104H4_209EL-2071 | 3214176 | 407064669 | O3O_09755 | hypothetical protein | AFS85716.1 | CDS | ||||
44 | 35077 | AAGAATGAA.CCGGATATT | 1 | OnProtein | K_N | AAA_AAT | 2 | EcoABU83972 | 1170477 | 307552925 | ECABU_c11320 | hypothetical protein | ADN45700.1 | CDS | ||||
45 | 35148 | AAGAATTAT.AGAGCGGTA | 1 | OnProtein | E_Q | CAG_GAG | 38 | EcoO104H4_209EL-2071 | 2389346 | 407065499 | O3O_13975 | COG2944 Predicted transcriptional regulator | repressor protein C | AFS86546.1 | CDS | |||
46 | 35699 | AAGAGCAAT.TCCTCGTTC | 1 | OnProtein | D_E | GAA_GAT | 105 | EcoABU83972 | 1367720 | 307553145 | ECABU_c13590 | hypothetical protein encoded by prophage | ADN45920.1 | CDS | ||||
47 | 36332 | AAGATGATG.CAAACCTAC | 1 | OnProtein | A_T | ACA_GCA | 2 | EcoO104H4_209EL-2071 | 2542670 | 407065339 | O3O_13170 | hypothetical protein | AFS86386.1 | CDS | ||||
48 | 37114 | AAGCAGCCA.CTGACTGCC | 1 | OnProtein | R_S | AGA_AGC | 203 | EcoO104H4_209EL-2071 | 3215194 | 407064668 | O3O_09750 | COG0561 Predicted hydrolases of the HAD superfamily | hypothetical protein | AFS85715.1 | CDS | |||
49 | 37172 | AAGCAGGAT.AACGCCAGA | 1 | OnProtein | F_L | TTC_TTG | 54 | EcoO7K1_CE10 | 2514340 | 349738550 | CE10_2487 | hypothetical protein | AEQ13256.1 | CDS | ||||
50 | 39941 | AAGGAGCCA.TGATTATAT | 0 | OnProtein | Q_Q | CAA_CAG | 39 | EcoO104H4_209EL-2071 | 77872 | 407067695 | O3O_25165 | COG0262 Dihydrofolate reductase | dihydrofolate reductase | AFS88742.1 | CDS | |||
51 | 40417 | AAGGCGCGA.CCCGACGCT | 1 | OnProtein | N_T | AAC_ACC | 89 | EcoO104H4_209EL-2071 | 4949068 | 407063061 | O3O_01590 | COG3541 Predicted nucleotidyltransferase | hypothetical protein | AFS84108.1 | CDS | |||
52 | 41346 | AAGGTTCAC.CCGCTGTGA | 0 | OnProtein | G_G | GGG_GGT | 301 | EcoO104H4_209EL-2071 | 3213285 | 407064670 | O3O_09760 | COG0188 Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit | hypothetical protein | AFS85717.1 | CDS | |||
53 | 41563 | AAGTAAGTG.CGTCAGTGA | 1 | OnProtein | D_G | GAC_GGC | 45 | EcoETEC_H10407 | 2135432 | 309702165 | ETEC_1977 | conserved hypothetical protein | CBJ01480.1 | CDS | ||||
54 | 42126 | AAGTCGTCT.TAGGCGACT | 0 | UnannotatedRegion | ||||||||||||||
55 | 42745 | AAGTTAAAC.GTAAAATAA | 1 | OnProtein | G_S | AGT_GGT | 76 | EcoO104H4_209EL-2071 | 2364060 | 407065535 | O3O_14155 | COG1045 Serine acetyltransferase | putative acetyltransferase | AFS86582.1 | CDS | |||
56 | 44467 | AATAATAAA.AAATCTATC | 1 | OnProtein | K_N | AAA_AAT | 344 | EcoO104H4_209EL-2071 | 3631155 | 407064277 | O3O_07785 | COG2199 FOG: GGDEF domain | membrane protein | AFS85324.1 | CDS | |||
57 | 47253 | AATATGATA.ATAACAACG | 0 | UnannotatedRegion | ||||||||||||||
58 | 47307 | AATATGCGC.AAAGTATTA | 0 | OnProtein | F_F | TTC_TTT | 35 | Eco042 | 2871852 | 284922450 | EC042_2704 | putative signal transduction protein | CBG35537.1 | CDS | ||||
59 | 48258 | AATCAATGA.ATTTTCAAT | 0 | OnProtein | I_I | ATA_ATT | 3 | EcoO104H4_209EL-2071 | 77764 | 407067695 | O3O_25165 | COG0262 Dihydrofolate reductase | dihydrofolate reductase | AFS88742.1 | CDS | |||
60 | 52887 | AATCTTTTT.ACTGAATGA | 0 | OnProtein | V_V | GTA_GTG | 27 | EcoNA114 | 964767 | 333968938 | ECNA114_0920 | Hypothetical protein | AEG35743.1 | CDS | ||||
61 | 56603 | AATGGTCAT.TGCGTGACT | 1 | OnProtein | K_R | AAA_AGA | 67 | EcoETEC_H10407 | 2135498 | 309702165 | ETEC_1977 | conserved hypothetical protein | CBJ01480.1 | CDS | ||||
62 | 57380 | AATGTGGAC.CATCGTGGA | 0 | UnannotatedRegion | ||||||||||||||
63 | 57444 | AATGTGTTA.CAGGAATCC | 0 | UnannotatedRegion | ||||||||||||||
64 | 57838 | AATTAAATG.TTAATTTCA | 0 | UnannotatedRegion | ||||||||||||||
65 | 58333 | AATTACCGG.GTTGCAGGA | 1 | OnProtein | A_E | GAG_GCG | 306 | Eco042 | 2422313 | 284922070 | EC042_2315 | mdtC | multidrug resistance protein | CBG35149.1 | CDS | |||
66 | 58449 | AATTACTGG.GGCGAAGTG | 1 | OnProtein | G_R | AGG_GGG | 68 | Eco53638 | 1315236 | 188488068 | Ec53638_1350 | identified by match to protein family HMM PF00577 | fimbrial usher protein | EDU63171.1 | CDS | |||
67 | 58883 | AATTCACAG.CGAGGAAAT | 1 | OnProtein | A_D | GAC_GCC | 124 | EcoO104H4_209EL-2071 | 2582074 | 407065298 | O3O_12965 | COG3561 Phage anti-repressor protein | phage anti-repressor protein AntB | AFS86345.1 | CDS | |||
68 | 65394 | ACACGTCGC.GGTCCGCGT | 1 | OnProtein | L_P | CCG_CTG | 80 | EcoO104H4_209EL-2071 | 2381245 | 407065514 | O3O_14050 | hypothetical protein | AFS86561.1 | CDS | ||||
69 | 65656 | ACACTGTTC.CGATTGCCC | 1 | OnProtein | P_S | CCG_TCG | 656 | Eco042 | 4749922 | 284924152 | EC042_4427 | uvrA | UvrABC system protein A (excinuclease ABC subunit A) | CBG37251.1 | CDS | |||
70 | 66230 | ACAGATCAT.TATATGTCT | 1 | OnProtein | L_V | GTA_TTA | 94 | EcoO104H4_209EL-2071 | 78035 | 407067695 | O3O_25165 | COG0262 Dihydrofolate reductase | dihydrofolate reductase | AFS88742.1 | CDS | |||
71 | 66340 | ACAGATTGA.GCTGGCGAA | 1 | OnProtein | D_E | GAG_GAT | 33 | Eco53638 | 3846708 | 188491376 | Ec53638_3926 | identified by match to protein family HMM PF00005 | ABC transporter, ATP-binding protein | EDU66479.1 | CDS | |||
72 | 67053 | ACAGCGTGA.AAGCTGGCA | 0 | OnProtein | E_E | GAA_GAG | 77 | EcoO104H4_209EL-2071 | 2385468 | 407065511 | O3O_14035 | phage replication protein | AFS86558.1 | CDS | ||||
73 | 67258 | ACAGGAAAG.GATAAAACC | 0 | OnProtein | R_R | AGA_AGG | 139 | EcoO104H4_209EL-2071 | 2535550 | 407065353 | O3O_13240 | hypothetical protein | AFS86400.1 | CDS | ||||
74 | 67573 | ACAGGCCAG.CTAAAGGGC | 0 | OnProtein | R_R | AGA_AGG | 45 | EcoO104H4_209EL-2071 | 2375231 | 407065520 | O3O_14080 | terminase, endonuclease subunit | AFS86567.1 | CDS | ||||
75 | 68056 | ACAGGTGCC.AAAAATTAG | 1 | OnProtein | K_Q | AAA_CAA | 18 | EcoO104H4_209EL-2071 | 2367110 | 407065533 | O3O_14145 | COG5301 Phage-related tail fibre protein | putative side tail fiber protein | AFS86580.1 | CDS | |||
76 | 68232 | ACAGTAGCA.CGGATGTTG | 1 | OnProtein | A_S | GCG_TCG | 183 | EcoO104H4_209EL-2071 | 2364483 | 407065534 | O3O_14150 | putative tail fiber assembly protein | AFS86581.1 | CDS | ||||
77 | 68961 | ACATATTAG.AACCAACCA | 0 | NotProteinCoding | Eco042 | 2923451 | misc_feature | note.location=2922497..2927740 | ||||||||||
78 | 71392 | ACCACAAAA.CTGACGTTC | 0 | OnProtein | S_S | AGC_AGT | 268 | Eco53638 | 4536923 | 188489810 | Ec53638_4683 | identified by match to protein family HMM PF01408; match to protein family HMM PF02894 | gfo/idh/mocA family | EDU64913.1 | CDS | |||
79 | 71465 | ACCACACGG.CTGGCGTTG | 0 | OnProtein | G_G | GGA_GGG | 64 | EcoO104H4_209EL-2071 | 2385893 | 407065510 | O3O_14030 | hypothetical protein | AFS86557.1 | CDS | ||||
80 | 72032 | ACCACGTGC.CCCTGTAGC | 0 | OnProtein | G_G | GGG_GGT | 232 | EcoIHE3034 | 2659408 | 294491088 | ECOK1_2573 | identified by match to protein family HMM PF04860; match to protein family HMM TIGR01540 | phage portal protein, PBSX family | ADE89844.1 | CDS | |||
81 | 72381 | ACCAGAGCG.AGATAATCG | 1 | OnProtein | F_S | TCC_TTC | 9 | Eco042 | 1551330 | 284921246 | EC042_1488 | phage endopeptidase/lysis protein | CBG34312.1 | CDS | ||||
82 | 73513 | ACCATGCCT.GAATATCCG | 1 | OnProtein | L_P | CCA_CTA | 87 | Eco042 | 1156775 | 284920831 | EC042_1072 | torC | cytochrome c-type protein | CBG33894.1 | CDS | |||
83 | 73620 | ACCATTACC.GTTATGCGT | 0 | OnProtein | P_P | CCC_CCG | 62 | EcoO104H4_209EL-2071 | 3230006 | 407064652 | O3O_09670 | COG1794 Aspartate racemase | aspartate racemase | AFS85699.1 | CDS | |||
84 | 73949 | ACCCACCCC.CCTGACGGA | 0 | OnProtein | G_G | GGC_GGG | 292 | EcoO104H4_209EL-2071 | 3213258 | 407064670 | O3O_09760 | COG0188 Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit | hypothetical protein | AFS85717.1 | CDS | |||
85 | 74679 | ACCCGACAC.CATGTACGG | 0 | OnProtein | T_T | ACA_ACG | 7 | Eco042 | 5149649 | 284924499 | EC042_4810 | conserved hypothetical protein | CBG37633.1 | CDS | ||||
86 | 75556 | ACCCTGGTC.CGGGTTACC | 0 | OnProtein | R_R | CGC_CGT | 23 | EcoO104H4_209EL-2071 | 3213519 | 407064669 | O3O_09755 | hypothetical protein | AFS85716.1 | CDS | ||||
87 | 76481 | ACCGATCAC.CTAACCCGG | 0 | NotProteinCoding | EcoO104H4_209EL-2071 | 2527407 | misc_feature | note.location=2526505..2528577 | ||||||||||
88 | 77828 | ACCGCTGGC.CGGAAAATC | 0 | OnProtein | R_R | CGC_CGT | 279 | EcoO104H4_209EL-2071 | 4633614 | 407063356 | O3O_03115 | COG4227 Antirestriction protein | hypothetical protein | AFS84403.1 | CDS | |||
89 | 77956 | ACCGCTTTC.GGATGTGCT | 0 | OnProtein | P_P | CCA_CCT | 55 | EcoO104H4_209EL-2071 | 2581108 | 407065301 | O3O_12980 | hypothetical protein | AFS86348.1 | CDS | ||||
90 | 78567 | ACCGGCCCC.AGCCCAACG | 0 | OnProtein | L_L | CTA_CTT | 95 | EcoO104H4_209EL-2071 | 3213735 | 407064669 | O3O_09755 | hypothetical protein | AFS85716.1 | CDS | ||||
91 | 81072 | ACCTGCGCG.AGAAAAAGC | 0 | UnannotatedRegion | ||||||||||||||
92 | 81417 | ACCTGTTGG.GACGCTGAA | 1 | OnProtein | A_V | GCG_GTG | 165 | Eco53638 | 1668992 | 188489905 | 2.6.1.57 | Ec53638_1710 | tyrB | identified by match to protein family HMM PF00155 | aromatic-amino-acid transaminase | EDU65008.1 | CDS | |
93 | 82916 | ACGACCAGC.ATTGATCCG | 0 | OnProtein | A_A | GCC_GCT | 14 | EcoO104H4_209EL-2071 | 2385657 | 407065511 | O3O_14035 | phage replication protein | AFS86558.1 | CDS | ||||
94 | 83188 | ACGACGTCT.AGCCACCGG | 0 | OnProtein | L_L | CTA_CTC | 52 | EcoO104H4_209EL-2071 | 2542607 | 407065340 | O3O_13175 | NinE protein | AFS86387.1 | CDS | ||||
95 | 86029 | ACGCCTCCG.CGTTCACTC | 1 | OnProtein | A_G | GCC_GGC | 57 | EcoETEC_H10407 | 2168219 | 309702205 | ETEC_2018 | bacteriophage regulatory protein. bacteriophage regulatory protein | DNA-binding transcriptional regulator prophage remnant | CBJ01521.1 | CDS | |||
96 | 87196 | ACGCTAACC.TTTCGCTAC | 1 | OnProtein | H_R | CAT_CGT | 18 | EcoO104H4_209EL-2071 | 2388798 | 407065501 | O3O_13985 | hypothetical protein | AFS86548.1 | CDS | ||||
97 | 87509 | ACGCTGAAG.CGTATCAGA | 1 | OnProtein | A_T | ACG_GCG | 218 | EcoIHE3034 | 2654404 | 294492024 | ECOK1_2568 | identified by match to protein family HMM PF05840 | putative replication gene A protein | ADE90780.1 | CDS | |||
98 | 88192 | ACGGAAAAA.GCTTAATTG | 1 | OnProtein | H_R | CAT_CGT | 5 | EcoO104H4_209EL-2071 | 2387438 | 407065506 | O3O_14010 | hypothetical protein | AFS86553.1 | CDS | ||||
99 | 88531 | ACGGATACC.CCGACGCAG | 1 | OnProtein | A_T | ACC_GCC | 110 | EcoO104H4_209EL-2071 | 2368256 | 407065531 | O3O_14135 | COG3948 Phage-related baseplate assembly protein | baseplate assembly protein | AFS86578.1 | CDS | |||
100 | 88734 | ACGGCAACC.CACATAACT | 1 | OnProtein | A_V | GCG_GTG | 100 | EcoETEC_H10407 | 2163462 | 309702199 | ETEC_2012 | phage tail E family protein | CBJ01515.1 | CDS | ||||
101 | 90340 | ACGGTCGCA.CTGATGTTC | 1 | OnProtein | P_S | CCT_TCT | 248 | EcoO104H4_209EL-2071 | 2390376 | 407065498 | O3O_13970 | COG0582 Integrase | integrase | AFS86545.1 | CDS | |||
102 | 94136 | ACTCAGAAG.GGCATTATA | 0 | UnannotatedRegion | ||||||||||||||
103 | 94234 | ACTCAGTAA.CAGCTATAC | 0 | OnProtein | K_K | AAA_AAG | 145 | EcoO104H4_209EL-2071 | 2582010 | 407065298 | O3O_12965 | COG3561 Phage anti-repressor protein | phage anti-repressor protein AntB | AFS86345.1 | CDS | |||
104 | 95433 | ACTGAAAGA.CATGAACAG | 0 | OnProtein | E_E | GAA_GAG | 162 | EcoETEC_H10407 | 2155404 | 309702193 | ETEC_2005 | H | Similar to Bacteriophage P2 h probable tail fiber protein (gph). UniProt:P26700 (669 aa) fasta scores: E()=4.2e-63, 49.746% id in 788 aa, and C-terminus is similar to Shigella sonnei. bv' UniProt:Q53813 (EMBL:D00660 (318 aa) fasta scores: E()=2.3e-122, 95.597% id in 318 aa | probable tail fiber protein (gph) | CBJ01508.1 | CDS | ||
105 | 96582 | ACTGCACTG.CCCCCGATC | 1 | OnProtein | A_S | GCC_TCC | 102 | Eco042 | 789217 | 284920475 | EC042_0714 | putative esterase/lipase | CBG33537.1 | CDS | ||||
106 | 97613 | ACTGGAAGC.TTCCATAAA | 0 | OnProtein | A_A | GCC_GCT | 46 | EcoETEC_H10407 | 2168187 | 309702205 | ETEC_2018 | bacteriophage regulatory protein. bacteriophage regulatory protein | DNA-binding transcriptional regulator prophage remnant | CBJ01521.1 | CDS | |||
107 | 99689 | ACTGTTGGT.GCACTGCTG | 0 | OnProtein | A_A | GCA_GCG | 474 | EcoDH1 | 2448008 | 260449499 | EcDH1_2274 | PFAM: tail fiber repeat containing protein; tail fiber repeat 2 protein; Tail Collar domain protein; Prophage tail fibre domain protein; KEGG: ssn:SSON_2410 phage protein-related | Prophage tail fibre domain protein | ACX39921.1 | CDS | |||
108 | 101732 | AGAAAATAC.TTGATGATA | 1 | OnProtein | I_V | ATT_GTT | 37 | EcoO104H4_209EL-2071 | 2543424 | 407065337 | O3O_13160 | Protein ninH from prophage | AFS86384.1 | CDS | ||||
109 | 103161 | AGAACTGAA.TCTGATTAC | 0 | OnProtein | N_N | AAC_AAT | 155 | EcoO104H4_209EL-2071 | 2581980 | 407065298 | O3O_12965 | COG3561 Phage anti-repressor protein | phage anti-repressor protein AntB | AFS86345.1 | CDS | |||
110 | 104279 | AGAATAGGG.AATTCCGCC | 0 | UnannotatedRegion | ||||||||||||||
111 | 104464 | AGAATGACC.GGGGAGCCG | 1 | OnProtein | L_Q | CAG_CTG | 355 | EcoABU83972 | 2300316 | 307554037 | ECABU_c22930 | ibrA | immunoglobulin-binding regulator A | ADN46812.1 | CDS | |||
112 | 105054 | AGACATAAA.ACGCCTTAA | 0 | UnannotatedRegion | ||||||||||||||
113 | 105090 | AGACATCCC.GTCGCCCTC | 0 | OnProtein | P_P | CCC_CCT | 52 | EcoO104H4_209EL-2071 | 3216595 | 407064666 | O3O_09740 | COG2301 Citrate lyase beta subunit | hypothetical protein | AFS85713.1 | CDS | |||
114 | 105299 | AGACCCGGT.GCCCGTGCC | 0 | OnProtein | V_V | GTA_GTG | 95 | EcoO104H4_209EL-2071 | 2385414 | 407065511 | O3O_14035 | phage replication protein | AFS86558.1 | CDS | ||||
115 | 105737 | AGACGGACC.CATTTTCAG | 0 | OnProtein | P_P | CCA_CCG | 119 | EcoETEC_H10407 | 2150895 | 309702185 | ETEC_1997 | putative phage PS3 | CBJ01500.1 | CDS | ||||
116 | 106016 | AGACTCCCC.ATGAGCGGC | 0 | OnProtein | I_I | ATC_ATT | 634 | EcoIHE3034 | 2655654 | 294492024 | ECOK1_2568 | identified by match to protein family HMM PF05840 | putative replication gene A protein | ADE90780.1 | CDS | |||
117 | 110618 | AGATTTAAT.GTCACTCCG | 1 | OnProtein | C_S | AGT_TGT | 60 | EcoO104H4_209EL-2071 | 2542629 | 407065340 | O3O_13175 | NinE protein | AFS86387.1 | CDS | ||||
118 | 112294 | AGCACTGAG.CGTATACCG | 1 | OnProtein | A_T | ACG_GCG | 37 | EcoO104H4_209EL-2071 | 3190460 | 407064691 | O3O_09880 | hypothetical protein | AFS85738.1 | CDS | ||||
119 | 113559 | AGCATGCCC.CCTGCGCTC | 0 | OnProtein | G_G | GGA_GGC | 653 | EcoO104H4_209EL-2071 | 2365203 | 407065533 | O3O_14145 | COG5301 Phage-related tail fibre protein | putative side tail fiber protein | AFS86580.1 | CDS | |||
120 | 114668 | AGCCATAAA.TAAGCAACC | 0 | UnannotatedRegion | ||||||||||||||
121 | 114728 | AGCCATCAG.CACCTCCCA | 1 | OnProtein | A_V | GCC_GTC | 394 | EcoETEC_H10407 | 2164331 | 309702201 | ETEC_2014 | phage tail sheath protein | CBJ01517.1 | CDS | ||||
122 | 117385 | AGCGAGTCA.CTTGGGAAG | 0 | OnProtein | L_L | CTG_TTG | 171 | EcoO104H4_209EL-2071 | 1543566 | 407066321 | O3O_18175 | COG2207 AraC-type DNA-binding domain-containing proteins | putative DNA-binding protein, ARAC-type | AFS87368.1 | CDS | |||
123 | 118995 | AGCGGCATT.CATCACCAT | 1 | OnProtein | F_L | TTA_TTC | 48 | EcoABU83972 | 987132 | 307552749 | 2.7.4.14 | ECABU_c09480 | cmk | cytidylate kinase | ADN45524.1 | CDS | ||
124 | 120693 | AGCTCGCGT.ATTAGCAGA | 0 | OnProtein | V_V | GTA_GTG | 74 | EcoO104H4_209EL-2071 | 2581322 | 407065300 | O3O_12975 | hypothetical protein | AFS86347.1 | CDS | ||||
125 | 121137 | AGCTGGCAA.TGAAGGTAA | 1 | OnProtein | N_S | AAT_AGT | 102 | Eco042 | 5050095 | 284924410 | EC042_4714 | cybC | soluble cytochrome b562 | CBG37534.1 | CDS | |||
126 | 121209 | AGCTGGGCC.CAGGGCCCC | 0 | OnProtein | P_P | CCG_CCT | 367 | EcoO104H4_209EL-2071 | 2560392 | 407065318 | O3O_13065 | putative tail fiber protein | AFS86365.1 | CDS | ||||
127 | 121519 | AGCTTCCTG.TTTGTAGCC | 0 | OnProtein | K_K | AAA_AAG | 84 | EcoO83H1_NRG857C | 1009208 | 312945528 | NRG857_04640 | putative recombination protein | ADR26355.1 | CDS | ||||
128 | 121811 | AGGAAAAAT.TGATACATA | 0 | NotProteinCoding | Shifl_2a_301 | 3806674 | gene | db_xref.location=3592993..3952918 | locus_tag.location=3592993..3952918 | |||||||||
129 | 121811 | AGGAAAAAT.TGATACATA | 0 | NotProteinCoding | Shifl_2a_301 | 3806674 | misc_feature | note.location=3806606..3836346 | ||||||||||
130 | 122877 | AGGATATTT.TTGCACGTC | 0 | UnannotatedRegion | ||||||||||||||
131 | 123716 | AGGCAGGGT.ACGGACGTC | 0 | OnProtein | V_V | GTC_GTT | 26 | EcoO104H4_209EL-2071 | 2359364 | 407065540 | O3O_14180 | COG3498 Phage tail tube protein FII | putative tail tube protein | AFS86587.1 | CDS | |||
132 | 125518 | AGGCTTCTT.CTTCGCTTC | 1 | OnProtein | E_K | AAA_GAA | 53 | EcoO104H4_209EL-2071 | 2385542 | 407065511 | O3O_14035 | phage replication protein | AFS86558.1 | CDS | ||||
133 | 125779 | AGGGAGGTC.CTATGTTCC | 0 | UnannotatedRegion | ||||||||||||||
134 | 125844 | AGGGATCCG.ACGCGGCTG | 0 | OnProtein | V_V | GTA_GTC | 76 | EcoO104H4_209EL-2071 | 3214813 | 407064668 | O3O_09750 | COG0561 Predicted hydrolases of the HAD superfamily | hypothetical protein | AFS85715.1 | CDS | |||
135 | 126374 | AGGGGAAGA.TCGACATCG | 1 | OnProtein | I_T | ACT_ATT | 92 | EcoO104H4_209EL-2071 | 3205405 | 407064678 | O3O_09800 | COG2963 Transposase and inactivated derivatives | IS66 transposase | AFS85725.1 | CDS | |||
136 | 127204 | AGGTAGCGG.ACTGGAGCA | 1 | OnProtein | A_V | GCA_GTA | 188 | EcoO104H4_209EL-2071 | 2390197 | 407065498 | O3O_13970 | COG0582 Integrase | integrase | AFS86545.1 | CDS | |||
137 | 128865 | AGTAACATC.GCAAACTCA | 0 | OnProtein | A_A | GCA_GCT | 24 | EcoO104H4_209EL-2071 | 2389306 | 407065499 | O3O_13975 | COG2944 Predicted transcriptional regulator | repressor protein C | AFS86546.1 | CDS | |||
138 | 130133 | AGTCACGAC.TCACCCGGA | 1 | OnProtein | H_R | CAT_CGT | 165 | Eco53638 | 1844792 | 188489254 | Ec53638_1874 | identified by match to protein family HMM PF03406 | putative phage protein | EDU64357.1 | CDS | |||
139 | 130581 | AGTCCAGAG.AGTTTTACC | 1 | OnProtein | A_S | GCT_TCT | 60 | EcoO104H4_209EL-2071 | 49160 | 407067734 | O3O_25360 | hypothetical protein | AFS88781.1 | CDS | ||||
140 | 137281 | ATAAGAAAA.GTGAAAACA | 0 | UnannotatedRegion | ||||||||||||||
141 | 137579 | ATAAGGCAC.CTATCTCAC | 0 | UnannotatedRegion | ||||||||||||||
142 | 139119 | ATACATTCA.CAGCTTAAC | 1 | OnProtein | S_T | ACC_AGC | 292 | EcoNA114 | 3558063 | 333971535 | ECNA114_3501 | fliD | Flagellar capping protein FliD | AEG38340.1 | CDS | |||
143 | 139478 | ATACCGGAG.GAAGAGAAA | 0 | OnProtein | S_S | AGC_AGT | 87 | EcoO104H4_209EL-2071 | 2581283 | 407065300 | O3O_12975 | hypothetical protein | AFS86347.1 | CDS | ||||
144 | 139563 | ATACCTGCA.CCTAAATAA | 0 | UnannotatedRegion | ||||||||||||||
145 | 139736 | ATACGCCGC.AGGACGGAA | 1 | OnProtein | *_W | TAG_TGG | 366 | EcoO104H4_209EL-2071 | 3233912 | 407064645 | O3O_09635 | COG3969 Predicted phosphoadenosine phosphosulfate sulfotransferase | hypothetical protein | AFS85692.1 | CDS | |||
146 | 142127 | ATATAAGCG.ACGTTCACC | 0 | OnProtein | V_V | GTA_GTG | 166 | Eco042 | 680538 | 284920374 | EC042_0610 | pheP | phenylalanine-specific permease | CBG33435.1 | CDS | |||
147 | 142345 | ATATATAAC.AAAAAGAGC | 0 | OnProtein | T_T | ACA_ACT | 181 | EcoO104H4_209EL-2071 | 2364377 | 407065535 | O3O_14155 | COG1045 Serine acetyltransferase | putative acetyltransferase | AFS86582.1 | CDS | |||
148 | 143466 | ATATGGTGA.GGAATGCCG | 1 | OnProtein | D_E | GAG_GAT | 89 | EcoNA114 | 921458 | 333968887 | ECNA114_0869 | Hypothetical protein | AEG35692.1 | CDS | ||||
149 | 144786 | ATCAAATAA.CCCTGAAGA | 0 | OnProtein | G_G | GGA_GGG | 24 | EcoO104H4_209EL-2071 | 2386013 | 407065510 | O3O_14030 | hypothetical protein | AFS86557.1 | CDS | ||||
150 | 146894 | ATCAGCATC.AAACATTCC | 1 | OnProtein | F_L | TTA_TTC | 743 | EcoKO11FL | 890919 | 323377234 | EKO11_0853 | KEGG: eoh:ECO103_3453 putative selenate reductase subunit YgfK; TIGRFAM: selenate reductase YgfK; PFAM: FAD-dependent pyridine nucleotide-disulphide oxidoreductase | selenate reductase YgfK | ADX49502.1 | CDS | |||
151 | 147961 | ATCATGATG.GCTTTTAAC | 0 | OnProtein | A_A | GCA_GCG | 95 | EcoO104H4_209EL-2071 | 3216724 | 407064666 | O3O_09740 | COG2301 Citrate lyase beta subunit | hypothetical protein | AFS85713.1 | CDS | |||
152 | 148735 | ATCCAGATT.AACATCTCA | 1 | OnProtein | F_L | TTA_TTT | 31 | EcoCloneDi2 | 1307299 | 355419698 | i02_1315 | hypothetical protein | AER83895.1 | CDS | ||||
153 | 150442 | ATCCGCTTT.CTGACAGCG | 1 | OnProtein | R_S | AGG_AGT | 59 | EcoETEC_H10407 | 2163068 | 309702198 | ETEC_2011 | putative tail protein | CBJ01514.1 | CDS | ||||
154 | 150696 | ATCCGGGAT.GAAGAAATA | 0 | OnProtein | S_S | TCG_TCT | 42 | EcoO104H4_209EL-2071 | 3229946 | 407064652 | O3O_09670 | COG1794 Aspartate racemase | aspartate racemase | AFS85699.1 | CDS | |||
155 | 150888 | ATCCGTCAG.GTGACACTG | 0 | OnProtein | S_S | AGC_AGT | 129 | EcoETEC_H10407 | 2152951 | 309702189 | ETEC_2001 | putative phage baseplate assembly protein V | CBJ01504.1 | CDS | ||||
156 | 152670 | ATCGCCAAC.TGTATAATC | 0 | OnProtein | L_L | CTG_TTG | 39 | Eco042 | 743859 | 284920435 | EC042_0670 | mrdA | penicillin-binding protein 2 | CBG33496.1 | CDS | |||
157 | 156122 | ATCTGATGT.AAAACCTTC | 0 | OnProtein | V_V | GTA_GTT | 37 | EcoO104H4_209EL-2071 | 2542562 | 407065340 | O3O_13175 | NinE protein | AFS86387.1 | CDS | ||||
158 | 158202 | ATGAATTAG.GCATCATCA | 0 | OnProtein | A_A | GCA_GCG | 35 | EcoO55H7_RM12579 | 304362 | 374357194 | ECO55CA74_01345 | hypothetical protein | AEZ38901.1 | CDS | ||||
159 | 158547 | ATGACGAGT.TCGTAGGCC | 0 | OnProtein | E_E | GAA_GAG | 168 | EcoO104H4_209EL-2071 | 2581941 | 407065298 | O3O_12965 | COG3561 Phage anti-repressor protein | phage anti-repressor protein AntB | AFS86345.1 | CDS | |||
160 | 159439 | ATGATGCAG.CGCAGAACA | 1 | OnProtein | A_S | GCG_TCG | 524 | Eco53638 | 4003782 | 188489603 | 2.7.10.1 | Ec53638_4096 | wzc | identified by match to protein family HMM PF02706; match to protein family HMM TIGR01007 | tyrosine-protein kinase wzc | EDU64706.1 | CDS | |
161 | 162464 | ATGGAGAGA.CCGATTTGA | 1 | OnProtein | N_S | AAC_AGC | 28 | EcoABU83972 | 1367952 | 307553145 | ECABU_c13590 | hypothetical protein encoded by prophage | ADN45920.1 | CDS | ||||
162 | 164746 | ATGTAGCTG.CAGGGCCCC | 1 | OnProtein | C_W | TGC_TGG | 91 | EcoP12b | 496679 | 383101841 | P12B_c0469 | hypothetical protein | AFG39350.1 | CDS | ||||
163 | 166170 | ATTAAAATT.ATTTCATGA | 0 | OnProtein | I_I | ATC_ATT | 368 | Eco042 | 3998917 | 284923502 | EC042_3773 | conserved hypothetical protein | CBG36597.1 | CDS | ||||
164 | 166253 | ATTAAAGAC.CACCTCGCG | 0 | OnProtein | T_T | ACA_ACG | 145 | EcoO104H4_209EL-2071 | 3212817 | 407064670 | O3O_09760 | COG0188 Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit | hypothetical protein | AFS85717.1 | CDS | |||
165 | 166861 | ATTAATAGG.TCTGATGAA | 0 | OnProtein | E_E | GAA_GAG | 30 | Eco042 | 1972370 | 284921640 | EC042_1886 | arpB | putative type III effector protein (ankyrin repeat protein B) | CBG34712.1 | CDS | |||
166 | 168341 | ATTATAAAG.TCTAAGAAT | 0 | UnannotatedRegion | ||||||||||||||
167 | 168411 | ATTATATTC.TCGCACGCC | 0 | UnannotatedRegion | ||||||||||||||
168 | 169985 | ATTCATCAA.AAATATATC | 1 | OnProtein | F_I | ATT_TTT | 39 | EcoO104H4_209EL-2071 | 2355689 | 407065543 | O3O_14195 | hypothetical protein | AFS86590.1 | CDS | ||||
169 | 170394 | ATTCCCCTT.TACCTGTCC | 1 | OnProtein | K_Q | AAA_CAA | 159 | EcoIHE3034 | 2751852 | 294490080 | ECOK1_2677 | identified by match to protein family HMM PF04447; match to protein family HMM PF04448 | conserved hypothetical protein | ADE88836.1 | CDS | |||
170 | 170688 | ATTCCTTAA.TGTCATATC | 0 | UnannotatedRegion | ||||||||||||||
171 | 173056 | ATTGCCGGA.TAACAAAAA | 0 | UnannotatedRegion | ||||||||||||||
172 | 173341 | ATTGCGTGA.GCCTTCCAG | 0 | OnProtein | A_A | GCA_GCC | 10 | EcoABU83972 | 4043923 | 307555644 | ECABU_c39800 | hypothetical protein | ADN48419.1 | CDS | ||||
173 | 174447 | ATTGTGTGC.CTTTGGGTG | 1 | OnProtein | A_T | ACT_GCT | 67 | EcoO104H4_209EL-2071 | 3260121 | 407064624 | O3O_09530 | hypothetical protein | AFS85671.1 | CDS | ||||
174 | 174681 | ATTTAAAAT.AATATATTA | 0 | UnannotatedRegion | ||||||||||||||
175 | 175419 | ATTTATTCA.AAGCTTGCA | 0 | UnannotatedRegion | ||||||||||||||
176 | 176006 | ATTTCTCGC.TTATTATCC | 1 | OnProtein | L_P | CCT_CTT | 23 | EcoUMNK88 | 3599896 | 332344925 | UMNK88_3723 | hypothetical protein | AEE58259.1 | CDS | ||||
177 | 177365 | ATTTTGCAG.GTCTCTAAA | 0 | OnProtein | T_T | ACA_ACT | 720 | Eco53638 | 234627 | 188489187 | Ec53638_0275 | torS | identified by match to protein family HMM PF00072; match to protein family HMM PF00512; match to protein family HMM PF00672; match to protein family HMM PF01627; match to protein family HMM PF02518; match to protein family HMM TIGR02956 | sensor histidine kinase/response regulator TorS | EDU64290.1 | CDS | ||
178 | 177833 | ATTTTTGCA.TAAGCAGCG | 0 | OnProtein | L_L | CTA_TTA | 77 | EcoO104H4_209EL-2071 | 2388622 | 407065501 | O3O_13985 | hypothetical protein | AFS86548.1 | CDS | ||||
179 | 179606 | CAAACGCAT.CGCAAAAAA | 0 | OnProtein | I_I | ATC_ATT | 11 | EcoO104H4_209EL-2071 | 2389667 | 407065498 | O3O_13970 | COG0582 Integrase | integrase | AFS86545.1 | CDS | |||
180 | 181102 | CAAATCCAT.CCCGCCAGC | 0 | OnProtein | G_G | GGA_GGC | 43 | Eco042 | 5170043 | 284924513 | EC042_4826 | gntP | high-affinity gluconate transporter | CBG37650.1 | CDS | |||
181 | 183513 | CAACGAATG.CAATTAATC | 1 | OnProtein | *_W | TGA_TGG | 37 | EcoO7K1_CE10 | 5030807 | 349740883 | CE10_4923 | yjfL | inner membrane protein, UPF0719 family | AEQ15589.1 | CDS | |||
182 | 184116 | CAACGGCGA.AGGGGAGCG | 1 | OnProtein | I_V | ATC_GTC | 313 | EcoO104H4_209EL-2071 | 3214387 | 407064669 | O3O_09755 | hypothetical protein | AFS85716.1 | CDS | ||||
183 | 184275 | CAACGGTTG.CGGATGCGG | 1 | OnProtein | A_V | GCC_GTC | 10 | EcoKO11FL | 1849396 | 323378102 | EKO11_1751 | manually curated; KEGG: sfv:SFV_2308 ribonucleotide-diphosphate reductase subunit beta | hypothetical protein | ADX50370.1 | CDS | |||
184 | 185940 | CAAGGAGCG.AGCGACTGG | 0 | UnannotatedRegion | ||||||||||||||
185 | 188250 | CAATGCTGC.GGATGCGGC | 0 | UnannotatedRegion | ||||||||||||||
186 | 188696 | CAATTCACA.CCGAGGAAA | 1 | OnProtein | A_T | ACC_GCC | 124 | EcoO104H4_209EL-2071 | 2582075 | 407065298 | O3O_12965 | COG3561 Phage anti-repressor protein | phage anti-repressor protein AntB | AFS86345.1 | CDS | |||
187 | 189434 | CACAAGAAT.CGGTCAAAC | 0 | OnProtein | R_R | CGC_CGT | 347 | EcoO104H4_209EL-2071 | 3214491 | 407064669 | O3O_09755 | hypothetical protein | AFS85716.1 | CDS | ||||
188 | 192257 | CACCAGCCC.TCATTCATC | 0 | OnProtein | D_D | GAC_GAT | 538 | Eco042 | 4053526 | 284923549 | EC042_3819 | putative outer membrane assembly protein | CBG36644.1 | CDS | ||||
189 | 192991 | CACCATGGT.ATGGCCACA | 0 | OnProtein | V_V_V | GTA_GTG_GTT | 149 | EcoETEC_H10407 | 2155365 | 309702193 | ETEC_2005 | H | Similar to Bacteriophage P2 h probable tail fiber protein (gph). UniProt:P26700 (669 aa) fasta scores: E()=4.2e-63, 49.746% id in 788 aa, and C-terminus is similar to Shigella sonnei. bv' UniProt:Q53813 (EMBL:D00660 (318 aa) fasta scores: E()=2.3e-122, 95.597% id in 318 aa | probable tail fiber protein (gph) | CBJ01508.1 | CDS | ||
190 | 193489 | CACCCGCAA.CTTGACCGC | 0 | OnProtein | N_N | AAC_AAT | 89 | EcoBL21DE3 | 3723945 | 242379196 | B21_03483 | aec79 | aec79 | CAQ34000.1 | CDS | |||
191 | 194945 | CACCGGTAG.ATTCCTTAA | 0 | UnannotatedRegion | ||||||||||||||
192 | 195751 | CACCTGGCT.ATTGAAAAA | 0 | OnProtein | L_L | CTA_CTG | 53 | EcoO104H4_209EL-2071 | 2388692 | 407065501 | O3O_13985 | hypothetical protein | AFS86548.1 | CDS | ||||
193 | 198373 | CACGTCATT.ACGCGGGTC | 0 | OnProtein | V_V | GTC_GTT | 54 | EcoETEC_H10407 | 2165350 | 309702201 | ETEC_2014 | phage tail sheath protein | CBJ01517.1 | CDS | ||||
194 | 199495 | CACTGCCAC.GCAGGATCC | 1 | OnProtein | L_Q | CAG_CTG | 35 | Eco042 | 518316 | 284920244 | EC042_0472 | putative lipoprotein | CBG33303.1 | CDS | ||||
195 | 200828 | CAGAAACAT.AAAATCGGG | 1 | OnProtein | H_Y | CAT_TAT | 7 | Eco042 | 685484 | 284920379 | 6.3.-.- | EC042_0615 | ybdK | carboxylate-amine ligase | CBG33440.1 | CDS | ||
196 | 205278 | CAGCACTGA.ACGTATACC | 0 | OnProtein | E_E | GAA_GAG | 36 | EcoABU83972 | 1261123 | 307553029 | ECABU_c12400 | hypothetical protein | ADN45804.1 | CDS | ||||
197 | 206669 | CAGCATTGC.GCAAATACG | 0 | OnProtein | A_A | GCA_GCG | 89 | EcoO104H4_209EL-2071 | 2385818 | 407065510 | O3O_14030 | hypothetical protein | AFS86557.1 | CDS | ||||
198 | 207903 | CAGCCGGTT.TAACGCTGC | 1 | OnProtein | F_Y | TAT_TTT | 640 | EcoO104H4_209EL-2071 | 2365243 | 407065533 | O3O_14145 | COG5301 Phage-related tail fibre protein | putative side tail fiber protein | AFS86580.1 | CDS | |||
199 | 211403 | CAGCTCGCG.AATTAGCAG | 1 | OnProtein | A_V | GCA_GTA | 74 | EcoO104H4_209EL-2071 | 2581323 | 407065300 | O3O_12975 | hypothetical protein | AFS86347.1 | CDS | ||||
200 | 211659 | CAGCTGGCC.AGCATATGG | 0 | OnProtein | L_L | CTC_CTT | 108 | EcoO104H4_209EL-2071 | 3214909 | 407064668 | O3O_09750 | COG0561 Predicted hydrolases of the HAD superfamily | hypothetical protein | AFS85715.1 | CDS | |||
201 | 212097 | CAGGAAAAC.CTGTTGATG | 0 | NotProteinCoding | EcoO104H4_209EL-2071 | 2526741 | misc_feature | note.location=2526505..2528577 | ||||||||||
202 | 213625 | CAGGCGCTG.GAATCGATA | 0 | OnProtein | S_S | TCC_TCT | 193 | EcoO104H4_209EL-2071 | 3878275 | 407064033 | O3O_06555 | COG3696 Putative silver efflux pump | copper/silver efflux system, membrane component | AFS85080.1 | CDS |