Skip to content

Files

Latest commit

Ryan WilliamsRyan Williams
Ryan Williams
and
Ryan Williams
Jul 27, 2015
b3f33b7 · Jul 27, 2015

History

History
403 lines (403 loc) · 60.7 KB

journal.pone.0081760.s004.csv

File metadata and controls

403 lines (403 loc) · 60.7 KB
1
LocusNumContextNonSynonymousAnnotationTypeAminoAcidsCodonsSNPPositiononProteinRefGenomeIDPositionInRefGenomeProtein_GIEC_numberlocus_taggenenoteproductprotein_idfunctionprimary_tag
2
119AAAAAAATC.ATCAGGAAA0OnProteinS_STCC_TCT346EcoO104H4_209EL-20713631161407064277O3O_07785COG2199 FOG: GGDEF domainmembrane proteinAFS85324.1CDS
3
188AAAAAACCA.CAACCTTGA0UnannotatedRegion
4
264AAAAAACTG.TCAAAAACC0OnProteinE_EGAA_GAG140EcoO104H4_209EL-207178175407067695O3O_25165COG0262 Dihydrofolate reductasedihydrofolate reductaseAFS88742.1CDS
5
975AAAAACGCG.TTTTTCTAC1OnProteinA_TACG_GCG67EcoETEC_H104072141974309702173ETEC_1985conserved hypothetical proteinCBJ01488.1CDS
6
1956AAAAATGCG.TTTCCGGTA0OnProteinN_NAAC_AAT76Eco536384261609188490518Ec53638_4379conserved hypothetical proteinEDU65621.1CDS
7
2272AAAACAAGC.ATTACTTGG1OnProteinN_YAAT_TAT52EcoO104H4_209EL-20713227526407064655O3O_09685hypothetical proteinAFS85702.1CDS
8
2329AAAACACAG.AAAACTTAA0UnannotatedRegion
9
2400AAAACACTG.CGCACTCCA0UnannotatedRegion
10
3070AAAACCGGT.GGACTGAGT0OnProteinV_VGTC_GTT134EcoO104H4_209EL-20713196197407064683O3O_09840hypothetical proteinAFS85730.1CDS
11
4036AAAACTTTG.CCGCGAAGC0OnProteinG_GGGA_GGG107EcoETEC_H104072134520309702162ETEC_1974conserved hypothetical proteinCBJ01477.1CDS
12
4131AAAAGAAGA.CTGATTCAG0OnProteinE_EGAA_GAG35EcoO104H4_209EL-20713215465407064667O3O_09745putative ATP-binding proteinAFS85714.1CDS
13
5742AAAATACGG.CAGGTGGAC0OnProteinG_GGGA_GGG414EcoNA114909181333968870ECNA114_0852Putative phage proteinAEG35675.1CDS
14
9552AAACCGCCG.GTCACCATG1OnProteinA_SGCG_TCG83Eco04226110452849222225.99.1.3EC042_2471gyrADNA gyrase subunit ACBG35304.1CDS
15
10548AAACGCTCC.GAAATAAAC0OnProteinR_RAGA_CGA25Eco042890676284920579EC042_0815ninEprophage proteinCBG33641.1CDS
16
11678AAACTGCAA.TACCTTCCA1OnProteinK_NAAA_AAT17Eco042890654284920579EC042_0815ninEprophage proteinCBG33641.1CDS
17
13044AAAGATCAT.ATGCGCTTT1OnProteinH_YCAT_TAT97EcoO104H4_209EL-20713216728407064666O3O_09740COG2301 Citrate lyase beta subunithypothetical proteinAFS85713.1CDS
18
13260AAAGCAACG.GTGGTGAGT1OnProteinC_RCGT_TGT18EcoABU839721367983307553145ECABU_c13590hypothetical protein encoded by prophageADN45920.1CDS
19
13886AAAGCGAAG.ACTATCAGC1OnProteinD_NAAC_GAC384Eco042712504284920405EC042_0641conserved hypothetical proteinCBG33466.1CDS
20
14707AAAGGCAGA.AAAGCAACG0OnProteinE_EGAA_GAG14EcoLF821002689222032715LF82_099corresponding to LF82_p099 in publication : Miquel et al., PLoS One; CFT073; E2348hypothetical proteinCAP75454.1CDS
21
15038AAAGGGATT.TGCTGCTGG1OnProteinL_VGTG_TTG236EcoO104H4_209EL-20712275955407065619O3O_14595COG2199 FOG: GGDEF domaindiguanylate cyclaseAFS86666.1CDS
22
16020AAAGTTAAA.CTGCATAAA0NotProteinCodingEco536384565074genelocus_tag.location=4565053..4565232
23
16020AAAGTTAAA.CTGCATAAA0NotProteinCodingEco536384565074misc_RNAlocus_tag.location=4565053..4565232product.location=4565053..4565232
24
16801AAATACACC.AAAACAAAA0UnannotatedRegion
25
17368AAATATCCT.CAATCCACC1OnProteinK_QAAG_CAG453Eco0425078737284924433EC042_4738conserved hypothetical proteinCBG37558.1CDS
26
17730AAATCAATT.GTTATTGAA0UnannotatedRegion
27
20553AAATTGCCG.CATTTTACC1OnProteinA_VGCC_GTC216Eco5363833903721884896151.3.1.-Ec53638_3450hcaBidentified by match to protein family HMM PF001062,3-dihydroxy-2,3-dihydrophenylpropionate dehydrogenaseEDU64718.1CDS
28
20627AAATTGGTA.TAACGTTGA1OnProteinN_SAAT_AGT140EcoO104H4_209EL-20712364253407065535O3O_14155COG1045 Serine acetyltransferaseputative acetyltransferaseAFS86582.1CDS
29
20998AACAAAAAA.CCATCAACC0NotProteinCodingEco53638415300genegene.location=415231..415309locus_tag.location=415231..415309
30
20998AACAAAAAA.CCATCAACC0NotProteinCodingEco53638415300misc_RNAgene.location=415231..415309locus_tag.location=415231..415309product.location=415231..415309
31
22171AACACCAGC.AACTGCATG1OnProtein*_EGAA_TAA286EcoO104H4_209EL-20713222441407064660O3O_09710COG4245 Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domainTerY3AFS85707.1CDS
32
22976AACAGCCTG.CCGTCAGGA0OnProteinG_GGGC_GGT253EcoO104H4_209EL-20712390393407065498O3O_13970COG0582 IntegraseintegraseAFS86545.1CDS
33
26231AACCCTATA.AAACGTTCG1OnProteinE_KAAA_GAA718Eco53638198513188490250Ec53638_0242biofilm PGA synthesis protein PgaAEDU65353.1CDS
34
27459AACCGTGCG.AGTGCTTCG0OnProteinL_LCTA_CTT98EcoETEC_H104072134493309702162ETEC_1974conserved hypothetical proteinCBJ01477.1CDS
35
27733AACCTCTCC.GTTCGCTGT1OnProteinC_RCGT_TGT397EcoBL21DE3311935242376113B21_00283yahJpredicted deaminase with metallo-dependent hydrolase domainCAQ30800.1CDS
36
28766AACGAGGTG.AAATGATAT0UnannotatedRegion
37
30110AACGCTGAT.CTGTCTCAC0OnProteinI_IATC_ATT54Eco0423951128284923451EC042_3722rpoHRNA polymerase sigma-32 factorCBG36546.1CDS
38
30174AACGCTGGA.GCTGACGGC0OnProteinE_EGAA_GAG75EcoETEC_H104072148842309702181ETEC_1993putative phage small terminase subunitCBJ01496.1CDS
39
30530AACGGATTG.GTGGGCATC1OnProteinH_RCAC_CGC66EcoO104H4_209EL-20712388654407065501O3O_13985hypothetical proteinAFS86548.1CDS
40
32531AACTATTGA.CAGAACGGT0OnProteinE_EGAA_GAG283Eco0422451306284922097EC042_2341fimbrial outer membrane usher proteinCBG35176.1CDS
41
33640AACTGTAAC.CTCTCCCCA0UnannotatedRegion
42
33770AACTTAAAA.CTGCCGCTG0UnannotatedRegion
43
33898AACTTCACG.GGAGAATGG0OnProteinR_RCGA_CGC242EcoO104H4_209EL-20713214176407064669O3O_09755hypothetical proteinAFS85716.1CDS
44
35077AAGAATGAA.CCGGATATT1OnProteinK_NAAA_AAT2EcoABU839721170477307552925ECABU_c11320hypothetical proteinADN45700.1CDS
45
35148AAGAATTAT.AGAGCGGTA1OnProteinE_QCAG_GAG38EcoO104H4_209EL-20712389346407065499O3O_13975COG2944 Predicted transcriptional regulatorrepressor protein CAFS86546.1CDS
46
35699AAGAGCAAT.TCCTCGTTC1OnProteinD_EGAA_GAT105EcoABU839721367720307553145ECABU_c13590hypothetical protein encoded by prophageADN45920.1CDS
47
36332AAGATGATG.CAAACCTAC1OnProteinA_TACA_GCA2EcoO104H4_209EL-20712542670407065339O3O_13170hypothetical proteinAFS86386.1CDS
48
37114AAGCAGCCA.CTGACTGCC1OnProteinR_SAGA_AGC203EcoO104H4_209EL-20713215194407064668O3O_09750COG0561 Predicted hydrolases of the HAD superfamilyhypothetical proteinAFS85715.1CDS
49
37172AAGCAGGAT.AACGCCAGA1OnProteinF_LTTC_TTG54EcoO7K1_CE102514340349738550CE10_2487hypothetical proteinAEQ13256.1CDS
50
39941AAGGAGCCA.TGATTATAT0OnProteinQ_QCAA_CAG39EcoO104H4_209EL-207177872407067695O3O_25165COG0262 Dihydrofolate reductasedihydrofolate reductaseAFS88742.1CDS
51
40417AAGGCGCGA.CCCGACGCT1OnProteinN_TAAC_ACC89EcoO104H4_209EL-20714949068407063061O3O_01590COG3541 Predicted nucleotidyltransferasehypothetical proteinAFS84108.1CDS
52
41346AAGGTTCAC.CCGCTGTGA0OnProteinG_GGGG_GGT301EcoO104H4_209EL-20713213285407064670O3O_09760COG0188 Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunithypothetical proteinAFS85717.1CDS
53
41563AAGTAAGTG.CGTCAGTGA1OnProteinD_GGAC_GGC45EcoETEC_H104072135432309702165ETEC_1977conserved hypothetical proteinCBJ01480.1CDS
54
42126AAGTCGTCT.TAGGCGACT0UnannotatedRegion
55
42745AAGTTAAAC.GTAAAATAA1OnProteinG_SAGT_GGT76EcoO104H4_209EL-20712364060407065535O3O_14155COG1045 Serine acetyltransferaseputative acetyltransferaseAFS86582.1CDS
56
44467AATAATAAA.AAATCTATC1OnProteinK_NAAA_AAT344EcoO104H4_209EL-20713631155407064277O3O_07785COG2199 FOG: GGDEF domainmembrane proteinAFS85324.1CDS
57
47253AATATGATA.ATAACAACG0UnannotatedRegion
58
47307AATATGCGC.AAAGTATTA0OnProteinF_FTTC_TTT35Eco0422871852284922450EC042_2704putative signal transduction proteinCBG35537.1CDS
59
48258AATCAATGA.ATTTTCAAT0OnProteinI_IATA_ATT3EcoO104H4_209EL-207177764407067695O3O_25165COG0262 Dihydrofolate reductasedihydrofolate reductaseAFS88742.1CDS
60
52887AATCTTTTT.ACTGAATGA0OnProteinV_VGTA_GTG27EcoNA114964767333968938ECNA114_0920Hypothetical proteinAEG35743.1CDS
61
56603AATGGTCAT.TGCGTGACT1OnProteinK_RAAA_AGA67EcoETEC_H104072135498309702165ETEC_1977conserved hypothetical proteinCBJ01480.1CDS
62
57380AATGTGGAC.CATCGTGGA0UnannotatedRegion
63
57444AATGTGTTA.CAGGAATCC0UnannotatedRegion
64
57838AATTAAATG.TTAATTTCA0UnannotatedRegion
65
58333AATTACCGG.GTTGCAGGA1OnProteinA_EGAG_GCG306Eco0422422313284922070EC042_2315mdtCmultidrug resistance proteinCBG35149.1CDS
66
58449AATTACTGG.GGCGAAGTG1OnProteinG_RAGG_GGG68Eco536381315236188488068Ec53638_1350identified by match to protein family HMM PF00577fimbrial usher proteinEDU63171.1CDS
67
58883AATTCACAG.CGAGGAAAT1OnProteinA_DGAC_GCC124EcoO104H4_209EL-20712582074407065298O3O_12965COG3561 Phage anti-repressor proteinphage anti-repressor protein AntBAFS86345.1CDS
68
65394ACACGTCGC.GGTCCGCGT1OnProteinL_PCCG_CTG80EcoO104H4_209EL-20712381245407065514O3O_14050hypothetical proteinAFS86561.1CDS
69
65656ACACTGTTC.CGATTGCCC1OnProteinP_SCCG_TCG656Eco0424749922284924152EC042_4427uvrAUvrABC system protein A (excinuclease ABC subunit A)CBG37251.1CDS
70
66230ACAGATCAT.TATATGTCT1OnProteinL_VGTA_TTA94EcoO104H4_209EL-207178035407067695O3O_25165COG0262 Dihydrofolate reductasedihydrofolate reductaseAFS88742.1CDS
71
66340ACAGATTGA.GCTGGCGAA1OnProteinD_EGAG_GAT33Eco536383846708188491376Ec53638_3926identified by match to protein family HMM PF00005ABC transporter, ATP-binding proteinEDU66479.1CDS
72
67053ACAGCGTGA.AAGCTGGCA0OnProteinE_EGAA_GAG77EcoO104H4_209EL-20712385468407065511O3O_14035phage replication proteinAFS86558.1CDS
73
67258ACAGGAAAG.GATAAAACC0OnProteinR_RAGA_AGG139EcoO104H4_209EL-20712535550407065353O3O_13240hypothetical proteinAFS86400.1CDS
74
67573ACAGGCCAG.CTAAAGGGC0OnProteinR_RAGA_AGG45EcoO104H4_209EL-20712375231407065520O3O_14080terminase, endonuclease subunitAFS86567.1CDS
75
68056ACAGGTGCC.AAAAATTAG1OnProteinK_QAAA_CAA18EcoO104H4_209EL-20712367110407065533O3O_14145COG5301 Phage-related tail fibre proteinputative side tail fiber proteinAFS86580.1CDS
76
68232ACAGTAGCA.CGGATGTTG1OnProteinA_SGCG_TCG183EcoO104H4_209EL-20712364483407065534O3O_14150putative tail fiber assembly proteinAFS86581.1CDS
77
68961ACATATTAG.AACCAACCA0NotProteinCodingEco0422923451misc_featurenote.location=2922497..2927740
78
71392ACCACAAAA.CTGACGTTC0OnProteinS_SAGC_AGT268Eco536384536923188489810Ec53638_4683identified by match to protein family HMM PF01408; match to protein family HMM PF02894gfo/idh/mocA familyEDU64913.1CDS
79
71465ACCACACGG.CTGGCGTTG0OnProteinG_GGGA_GGG64EcoO104H4_209EL-20712385893407065510O3O_14030hypothetical proteinAFS86557.1CDS
80
72032ACCACGTGC.CCCTGTAGC0OnProteinG_GGGG_GGT232EcoIHE30342659408294491088ECOK1_2573identified by match to protein family HMM PF04860; match to protein family HMM TIGR01540phage portal protein, PBSX familyADE89844.1CDS
81
72381ACCAGAGCG.AGATAATCG1OnProteinF_STCC_TTC9Eco0421551330284921246EC042_1488phage endopeptidase/lysis proteinCBG34312.1CDS
82
73513ACCATGCCT.GAATATCCG1OnProteinL_PCCA_CTA87Eco0421156775284920831EC042_1072torCcytochrome c-type proteinCBG33894.1CDS
83
73620ACCATTACC.GTTATGCGT0OnProteinP_PCCC_CCG62EcoO104H4_209EL-20713230006407064652O3O_09670COG1794 Aspartate racemaseaspartate racemaseAFS85699.1CDS
84
73949ACCCACCCC.CCTGACGGA0OnProteinG_GGGC_GGG292EcoO104H4_209EL-20713213258407064670O3O_09760COG0188 Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunithypothetical proteinAFS85717.1CDS
85
74679ACCCGACAC.CATGTACGG0OnProteinT_TACA_ACG7Eco0425149649284924499EC042_4810conserved hypothetical proteinCBG37633.1CDS
86
75556ACCCTGGTC.CGGGTTACC0OnProteinR_RCGC_CGT23EcoO104H4_209EL-20713213519407064669O3O_09755hypothetical proteinAFS85716.1CDS
87
76481ACCGATCAC.CTAACCCGG0NotProteinCodingEcoO104H4_209EL-20712527407misc_featurenote.location=2526505..2528577
88
77828ACCGCTGGC.CGGAAAATC0OnProteinR_RCGC_CGT279EcoO104H4_209EL-20714633614407063356O3O_03115COG4227 Antirestriction proteinhypothetical proteinAFS84403.1CDS
89
77956ACCGCTTTC.GGATGTGCT0OnProteinP_PCCA_CCT55EcoO104H4_209EL-20712581108407065301O3O_12980hypothetical proteinAFS86348.1CDS
90
78567ACCGGCCCC.AGCCCAACG0OnProteinL_LCTA_CTT95EcoO104H4_209EL-20713213735407064669O3O_09755hypothetical proteinAFS85716.1CDS
91
81072ACCTGCGCG.AGAAAAAGC0UnannotatedRegion
92
81417ACCTGTTGG.GACGCTGAA1OnProteinA_VGCG_GTG165Eco5363816689921884899052.6.1.57Ec53638_1710tyrBidentified by match to protein family HMM PF00155aromatic-amino-acid transaminaseEDU65008.1CDS
93
82916ACGACCAGC.ATTGATCCG0OnProteinA_AGCC_GCT14EcoO104H4_209EL-20712385657407065511O3O_14035phage replication proteinAFS86558.1CDS
94
83188ACGACGTCT.AGCCACCGG0OnProteinL_LCTA_CTC52EcoO104H4_209EL-20712542607407065340O3O_13175NinE proteinAFS86387.1CDS
95
86029ACGCCTCCG.CGTTCACTC1OnProteinA_GGCC_GGC57EcoETEC_H104072168219309702205ETEC_2018bacteriophage regulatory protein. bacteriophage regulatory proteinDNA-binding transcriptional regulator prophage remnantCBJ01521.1CDS
96
87196ACGCTAACC.TTTCGCTAC1OnProteinH_RCAT_CGT18EcoO104H4_209EL-20712388798407065501O3O_13985hypothetical proteinAFS86548.1CDS
97
87509ACGCTGAAG.CGTATCAGA1OnProteinA_TACG_GCG218EcoIHE30342654404294492024ECOK1_2568identified by match to protein family HMM PF05840putative replication gene A proteinADE90780.1CDS
98
88192ACGGAAAAA.GCTTAATTG1OnProteinH_RCAT_CGT5EcoO104H4_209EL-20712387438407065506O3O_14010hypothetical proteinAFS86553.1CDS
99
88531ACGGATACC.CCGACGCAG1OnProteinA_TACC_GCC110EcoO104H4_209EL-20712368256407065531O3O_14135COG3948 Phage-related baseplate assembly proteinbaseplate assembly proteinAFS86578.1CDS
100
88734ACGGCAACC.CACATAACT1OnProteinA_VGCG_GTG100EcoETEC_H104072163462309702199ETEC_2012phage tail E family proteinCBJ01515.1CDS
101
90340ACGGTCGCA.CTGATGTTC1OnProteinP_SCCT_TCT248EcoO104H4_209EL-20712390376407065498O3O_13970COG0582 IntegraseintegraseAFS86545.1CDS
102
94136ACTCAGAAG.GGCATTATA0UnannotatedRegion
103
94234ACTCAGTAA.CAGCTATAC0OnProteinK_KAAA_AAG145EcoO104H4_209EL-20712582010407065298O3O_12965COG3561 Phage anti-repressor proteinphage anti-repressor protein AntBAFS86345.1CDS
104
95433ACTGAAAGA.CATGAACAG0OnProteinE_EGAA_GAG162EcoETEC_H104072155404309702193ETEC_2005HSimilar to Bacteriophage P2 h probable tail fiber protein (gph). UniProt:P26700 (669 aa) fasta scores: E()=4.2e-63, 49.746% id in 788 aa, and C-terminus is similar to Shigella sonnei. bv' UniProt:Q53813 (EMBL:D00660 (318 aa) fasta scores: E()=2.3e-122, 95.597% id in 318 aaprobable tail fiber protein (gph)CBJ01508.1CDS
105
96582ACTGCACTG.CCCCCGATC1OnProteinA_SGCC_TCC102Eco042789217284920475EC042_0714putative esterase/lipaseCBG33537.1CDS
106
97613ACTGGAAGC.TTCCATAAA0OnProteinA_AGCC_GCT46EcoETEC_H104072168187309702205ETEC_2018bacteriophage regulatory protein. bacteriophage regulatory proteinDNA-binding transcriptional regulator prophage remnantCBJ01521.1CDS
107
99689ACTGTTGGT.GCACTGCTG0OnProteinA_AGCA_GCG474EcoDH12448008260449499EcDH1_2274PFAM: tail fiber repeat containing protein; tail fiber repeat 2 protein; Tail Collar domain protein; Prophage tail fibre domain protein; KEGG: ssn:SSON_2410 phage protein-relatedProphage tail fibre domain proteinACX39921.1CDS
108
101732AGAAAATAC.TTGATGATA1OnProteinI_VATT_GTT37EcoO104H4_209EL-20712543424407065337O3O_13160Protein ninH from prophageAFS86384.1CDS
109
103161AGAACTGAA.TCTGATTAC0OnProteinN_NAAC_AAT155EcoO104H4_209EL-20712581980407065298O3O_12965COG3561 Phage anti-repressor proteinphage anti-repressor protein AntBAFS86345.1CDS
110
104279AGAATAGGG.AATTCCGCC0UnannotatedRegion
111
104464AGAATGACC.GGGGAGCCG1OnProteinL_QCAG_CTG355EcoABU839722300316307554037ECABU_c22930ibrAimmunoglobulin-binding regulator AADN46812.1CDS
112
105054AGACATAAA.ACGCCTTAA0UnannotatedRegion
113
105090AGACATCCC.GTCGCCCTC0OnProteinP_PCCC_CCT52EcoO104H4_209EL-20713216595407064666O3O_09740COG2301 Citrate lyase beta subunithypothetical proteinAFS85713.1CDS
114
105299AGACCCGGT.GCCCGTGCC0OnProteinV_VGTA_GTG95EcoO104H4_209EL-20712385414407065511O3O_14035phage replication proteinAFS86558.1CDS
115
105737AGACGGACC.CATTTTCAG0OnProteinP_PCCA_CCG119EcoETEC_H104072150895309702185ETEC_1997putative phage PS3CBJ01500.1CDS
116
106016AGACTCCCC.ATGAGCGGC0OnProteinI_IATC_ATT634EcoIHE30342655654294492024ECOK1_2568identified by match to protein family HMM PF05840putative replication gene A proteinADE90780.1CDS
117
110618AGATTTAAT.GTCACTCCG1OnProteinC_SAGT_TGT60EcoO104H4_209EL-20712542629407065340O3O_13175NinE proteinAFS86387.1CDS
118
112294AGCACTGAG.CGTATACCG1OnProteinA_TACG_GCG37EcoO104H4_209EL-20713190460407064691O3O_09880hypothetical proteinAFS85738.1CDS
119
113559AGCATGCCC.CCTGCGCTC0OnProteinG_GGGA_GGC653EcoO104H4_209EL-20712365203407065533O3O_14145COG5301 Phage-related tail fibre proteinputative side tail fiber proteinAFS86580.1CDS
120
114668AGCCATAAA.TAAGCAACC0UnannotatedRegion
121
114728AGCCATCAG.CACCTCCCA1OnProteinA_VGCC_GTC394EcoETEC_H104072164331309702201ETEC_2014phage tail sheath proteinCBJ01517.1CDS
122
117385AGCGAGTCA.CTTGGGAAG0OnProteinL_LCTG_TTG171EcoO104H4_209EL-20711543566407066321O3O_18175COG2207 AraC-type DNA-binding domain-containing proteinsputative DNA-binding protein, ARAC-typeAFS87368.1CDS
123
118995AGCGGCATT.CATCACCAT1OnProteinF_LTTA_TTC48EcoABU839729871323075527492.7.4.14ECABU_c09480cmkcytidylate kinaseADN45524.1CDS
124
120693AGCTCGCGT.ATTAGCAGA0OnProteinV_VGTA_GTG74EcoO104H4_209EL-20712581322407065300O3O_12975hypothetical proteinAFS86347.1CDS
125
121137AGCTGGCAA.TGAAGGTAA1OnProteinN_SAAT_AGT102Eco0425050095284924410EC042_4714cybCsoluble cytochrome b562CBG37534.1CDS
126
121209AGCTGGGCC.CAGGGCCCC0OnProteinP_PCCG_CCT367EcoO104H4_209EL-20712560392407065318O3O_13065putative tail fiber proteinAFS86365.1CDS
127
121519AGCTTCCTG.TTTGTAGCC0OnProteinK_KAAA_AAG84EcoO83H1_NRG857C1009208312945528NRG857_04640putative recombination proteinADR26355.1CDS
128
121811AGGAAAAAT.TGATACATA0NotProteinCodingShifl_2a_3013806674genedb_xref.location=3592993..3952918locus_tag.location=3592993..3952918
129
121811AGGAAAAAT.TGATACATA0NotProteinCodingShifl_2a_3013806674misc_featurenote.location=3806606..3836346
130
122877AGGATATTT.TTGCACGTC0UnannotatedRegion
131
123716AGGCAGGGT.ACGGACGTC0OnProteinV_VGTC_GTT26EcoO104H4_209EL-20712359364407065540O3O_14180COG3498 Phage tail tube protein FIIputative tail tube proteinAFS86587.1CDS
132
125518AGGCTTCTT.CTTCGCTTC1OnProteinE_KAAA_GAA53EcoO104H4_209EL-20712385542407065511O3O_14035phage replication proteinAFS86558.1CDS
133
125779AGGGAGGTC.CTATGTTCC0UnannotatedRegion
134
125844AGGGATCCG.ACGCGGCTG0OnProteinV_VGTA_GTC76EcoO104H4_209EL-20713214813407064668O3O_09750COG0561 Predicted hydrolases of the HAD superfamilyhypothetical proteinAFS85715.1CDS
135
126374AGGGGAAGA.TCGACATCG1OnProteinI_TACT_ATT92EcoO104H4_209EL-20713205405407064678O3O_09800COG2963 Transposase and inactivated derivativesIS66 transposaseAFS85725.1CDS
136
127204AGGTAGCGG.ACTGGAGCA1OnProteinA_VGCA_GTA188EcoO104H4_209EL-20712390197407065498O3O_13970COG0582 IntegraseintegraseAFS86545.1CDS
137
128865AGTAACATC.GCAAACTCA0OnProteinA_AGCA_GCT24EcoO104H4_209EL-20712389306407065499O3O_13975COG2944 Predicted transcriptional regulatorrepressor protein CAFS86546.1CDS
138
130133AGTCACGAC.TCACCCGGA1OnProteinH_RCAT_CGT165Eco536381844792188489254Ec53638_1874identified by match to protein family HMM PF03406putative phage proteinEDU64357.1CDS
139
130581AGTCCAGAG.AGTTTTACC1OnProteinA_SGCT_TCT60EcoO104H4_209EL-207149160407067734O3O_25360hypothetical proteinAFS88781.1CDS
140
137281ATAAGAAAA.GTGAAAACA0UnannotatedRegion
141
137579ATAAGGCAC.CTATCTCAC0UnannotatedRegion
142
139119ATACATTCA.CAGCTTAAC1OnProteinS_TACC_AGC292EcoNA1143558063333971535ECNA114_3501fliDFlagellar capping protein FliDAEG38340.1CDS
143
139478ATACCGGAG.GAAGAGAAA0OnProteinS_SAGC_AGT87EcoO104H4_209EL-20712581283407065300O3O_12975hypothetical proteinAFS86347.1CDS
144
139563ATACCTGCA.CCTAAATAA0UnannotatedRegion
145
139736ATACGCCGC.AGGACGGAA1OnProtein*_WTAG_TGG366EcoO104H4_209EL-20713233912407064645O3O_09635COG3969 Predicted phosphoadenosine phosphosulfate sulfotransferasehypothetical proteinAFS85692.1CDS
146
142127ATATAAGCG.ACGTTCACC0OnProteinV_VGTA_GTG166Eco042680538284920374EC042_0610phePphenylalanine-specific permeaseCBG33435.1CDS
147
142345ATATATAAC.AAAAAGAGC0OnProteinT_TACA_ACT181EcoO104H4_209EL-20712364377407065535O3O_14155COG1045 Serine acetyltransferaseputative acetyltransferaseAFS86582.1CDS
148
143466ATATGGTGA.GGAATGCCG1OnProteinD_EGAG_GAT89EcoNA114921458333968887ECNA114_0869Hypothetical proteinAEG35692.1CDS
149
144786ATCAAATAA.CCCTGAAGA0OnProteinG_GGGA_GGG24EcoO104H4_209EL-20712386013407065510O3O_14030hypothetical proteinAFS86557.1CDS
150
146894ATCAGCATC.AAACATTCC1OnProteinF_LTTA_TTC743EcoKO11FL890919323377234EKO11_0853KEGG: eoh:ECO103_3453 putative selenate reductase subunit YgfK; TIGRFAM: selenate reductase YgfK; PFAM: FAD-dependent pyridine nucleotide-disulphide oxidoreductaseselenate reductase YgfKADX49502.1CDS
151
147961ATCATGATG.GCTTTTAAC0OnProteinA_AGCA_GCG95EcoO104H4_209EL-20713216724407064666O3O_09740COG2301 Citrate lyase beta subunithypothetical proteinAFS85713.1CDS
152
148735ATCCAGATT.AACATCTCA1OnProteinF_LTTA_TTT31EcoCloneDi21307299355419698i02_1315hypothetical proteinAER83895.1CDS
153
150442ATCCGCTTT.CTGACAGCG1OnProteinR_SAGG_AGT59EcoETEC_H104072163068309702198ETEC_2011putative tail proteinCBJ01514.1CDS
154
150696ATCCGGGAT.GAAGAAATA0OnProteinS_STCG_TCT42EcoO104H4_209EL-20713229946407064652O3O_09670COG1794 Aspartate racemaseaspartate racemaseAFS85699.1CDS
155
150888ATCCGTCAG.GTGACACTG0OnProteinS_SAGC_AGT129EcoETEC_H104072152951309702189ETEC_2001putative phage baseplate assembly protein VCBJ01504.1CDS
156
152670ATCGCCAAC.TGTATAATC0OnProteinL_LCTG_TTG39Eco042743859284920435EC042_0670mrdApenicillin-binding protein 2CBG33496.1CDS
157
156122ATCTGATGT.AAAACCTTC0OnProteinV_VGTA_GTT37EcoO104H4_209EL-20712542562407065340O3O_13175NinE proteinAFS86387.1CDS
158
158202ATGAATTAG.GCATCATCA0OnProteinA_AGCA_GCG35EcoO55H7_RM12579304362374357194ECO55CA74_01345hypothetical proteinAEZ38901.1CDS
159
158547ATGACGAGT.TCGTAGGCC0OnProteinE_EGAA_GAG168EcoO104H4_209EL-20712581941407065298O3O_12965COG3561 Phage anti-repressor proteinphage anti-repressor protein AntBAFS86345.1CDS
160
159439ATGATGCAG.CGCAGAACA1OnProteinA_SGCG_TCG524Eco5363840037821884896032.7.10.1Ec53638_4096wzcidentified by match to protein family HMM PF02706; match to protein family HMM TIGR01007tyrosine-protein kinase wzcEDU64706.1CDS
161
162464ATGGAGAGA.CCGATTTGA1OnProteinN_SAAC_AGC28EcoABU839721367952307553145ECABU_c13590hypothetical protein encoded by prophageADN45920.1CDS
162
164746ATGTAGCTG.CAGGGCCCC1OnProteinC_WTGC_TGG91EcoP12b496679383101841P12B_c0469hypothetical proteinAFG39350.1CDS
163
166170ATTAAAATT.ATTTCATGA0OnProteinI_IATC_ATT368Eco0423998917284923502EC042_3773conserved hypothetical proteinCBG36597.1CDS
164
166253ATTAAAGAC.CACCTCGCG0OnProteinT_TACA_ACG145EcoO104H4_209EL-20713212817407064670O3O_09760COG0188 Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunithypothetical proteinAFS85717.1CDS
165
166861ATTAATAGG.TCTGATGAA0OnProteinE_EGAA_GAG30Eco0421972370284921640EC042_1886arpBputative type III effector protein (ankyrin repeat protein B)CBG34712.1CDS
166
168341ATTATAAAG.TCTAAGAAT0UnannotatedRegion
167
168411ATTATATTC.TCGCACGCC0UnannotatedRegion
168
169985ATTCATCAA.AAATATATC1OnProteinF_IATT_TTT39EcoO104H4_209EL-20712355689407065543O3O_14195hypothetical proteinAFS86590.1CDS
169
170394ATTCCCCTT.TACCTGTCC1OnProteinK_QAAA_CAA159EcoIHE30342751852294490080ECOK1_2677identified by match to protein family HMM PF04447; match to protein family HMM PF04448conserved hypothetical proteinADE88836.1CDS
170
170688ATTCCTTAA.TGTCATATC0UnannotatedRegion
171
173056ATTGCCGGA.TAACAAAAA0UnannotatedRegion
172
173341ATTGCGTGA.GCCTTCCAG0OnProteinA_AGCA_GCC10EcoABU839724043923307555644ECABU_c39800hypothetical proteinADN48419.1CDS
173
174447ATTGTGTGC.CTTTGGGTG1OnProteinA_TACT_GCT67EcoO104H4_209EL-20713260121407064624O3O_09530hypothetical proteinAFS85671.1CDS
174
174681ATTTAAAAT.AATATATTA0UnannotatedRegion
175
175419ATTTATTCA.AAGCTTGCA0UnannotatedRegion
176
176006ATTTCTCGC.TTATTATCC1OnProteinL_PCCT_CTT23EcoUMNK883599896332344925UMNK88_3723hypothetical proteinAEE58259.1CDS
177
177365ATTTTGCAG.GTCTCTAAA0OnProteinT_TACA_ACT720Eco53638234627188489187Ec53638_0275torSidentified by match to protein family HMM PF00072; match to protein family HMM PF00512; match to protein family HMM PF00672; match to protein family HMM PF01627; match to protein family HMM PF02518; match to protein family HMM TIGR02956sensor histidine kinase/response regulator TorSEDU64290.1CDS
178
177833ATTTTTGCA.TAAGCAGCG0OnProteinL_LCTA_TTA77EcoO104H4_209EL-20712388622407065501O3O_13985hypothetical proteinAFS86548.1CDS
179
179606CAAACGCAT.CGCAAAAAA0OnProteinI_IATC_ATT11EcoO104H4_209EL-20712389667407065498O3O_13970COG0582 IntegraseintegraseAFS86545.1CDS
180
181102CAAATCCAT.CCCGCCAGC0OnProteinG_GGGA_GGC43Eco0425170043284924513EC042_4826gntPhigh-affinity gluconate transporterCBG37650.1CDS
181
183513CAACGAATG.CAATTAATC1OnProtein*_WTGA_TGG37EcoO7K1_CE105030807349740883CE10_4923yjfLinner membrane protein, UPF0719 familyAEQ15589.1CDS
182
184116CAACGGCGA.AGGGGAGCG1OnProteinI_VATC_GTC313EcoO104H4_209EL-20713214387407064669O3O_09755hypothetical proteinAFS85716.1CDS
183
184275CAACGGTTG.CGGATGCGG1OnProteinA_VGCC_GTC10EcoKO11FL1849396323378102EKO11_1751manually curated; KEGG: sfv:SFV_2308 ribonucleotide-diphosphate reductase subunit betahypothetical proteinADX50370.1CDS
184
185940CAAGGAGCG.AGCGACTGG0UnannotatedRegion
185
188250CAATGCTGC.GGATGCGGC0UnannotatedRegion
186
188696CAATTCACA.CCGAGGAAA1OnProteinA_TACC_GCC124EcoO104H4_209EL-20712582075407065298O3O_12965COG3561 Phage anti-repressor proteinphage anti-repressor protein AntBAFS86345.1CDS
187
189434CACAAGAAT.CGGTCAAAC0OnProteinR_RCGC_CGT347EcoO104H4_209EL-20713214491407064669O3O_09755hypothetical proteinAFS85716.1CDS
188
192257CACCAGCCC.TCATTCATC0OnProteinD_DGAC_GAT538Eco0424053526284923549EC042_3819putative outer membrane assembly proteinCBG36644.1CDS
189
192991CACCATGGT.ATGGCCACA0OnProteinV_V_VGTA_GTG_GTT149EcoETEC_H104072155365309702193ETEC_2005HSimilar to Bacteriophage P2 h probable tail fiber protein (gph). UniProt:P26700 (669 aa) fasta scores: E()=4.2e-63, 49.746% id in 788 aa, and C-terminus is similar to Shigella sonnei. bv' UniProt:Q53813 (EMBL:D00660 (318 aa) fasta scores: E()=2.3e-122, 95.597% id in 318 aaprobable tail fiber protein (gph)CBJ01508.1CDS
190
193489CACCCGCAA.CTTGACCGC0OnProteinN_NAAC_AAT89EcoBL21DE33723945242379196B21_03483aec79aec79CAQ34000.1CDS
191
194945CACCGGTAG.ATTCCTTAA0UnannotatedRegion
192
195751CACCTGGCT.ATTGAAAAA0OnProteinL_LCTA_CTG53EcoO104H4_209EL-20712388692407065501O3O_13985hypothetical proteinAFS86548.1CDS
193
198373CACGTCATT.ACGCGGGTC0OnProteinV_VGTC_GTT54EcoETEC_H104072165350309702201ETEC_2014phage tail sheath proteinCBJ01517.1CDS
194
199495CACTGCCAC.GCAGGATCC1OnProteinL_QCAG_CTG35Eco042518316284920244EC042_0472putative lipoproteinCBG33303.1CDS
195
200828CAGAAACAT.AAAATCGGG1OnProteinH_YCAT_TAT7Eco0426854842849203796.3.-.-EC042_0615ybdKcarboxylate-amine ligaseCBG33440.1CDS
196
205278CAGCACTGA.ACGTATACC0OnProteinE_EGAA_GAG36EcoABU839721261123307553029ECABU_c12400hypothetical proteinADN45804.1CDS
197
206669CAGCATTGC.GCAAATACG0OnProteinA_AGCA_GCG89EcoO104H4_209EL-20712385818407065510O3O_14030hypothetical proteinAFS86557.1CDS
198
207903CAGCCGGTT.TAACGCTGC1OnProteinF_YTAT_TTT640EcoO104H4_209EL-20712365243407065533O3O_14145COG5301 Phage-related tail fibre proteinputative side tail fiber proteinAFS86580.1CDS
199
211403CAGCTCGCG.AATTAGCAG1OnProteinA_VGCA_GTA74EcoO104H4_209EL-20712581323407065300O3O_12975hypothetical proteinAFS86347.1CDS
200
211659CAGCTGGCC.AGCATATGG0OnProteinL_LCTC_CTT108EcoO104H4_209EL-20713214909407064668O3O_09750COG0561 Predicted hydrolases of the HAD superfamilyhypothetical proteinAFS85715.1CDS
201
212097CAGGAAAAC.CTGTTGATG0NotProteinCodingEcoO104H4_209EL-20712526741misc_featurenote.location=2526505..2528577
202
213625CAGGCGCTG.GAATCGATA0OnProteinS_STCC_TCT193EcoO104H4_209EL-20713878275407064033O3O_06555COG3696 Putative silver efflux pumpcopper/silver efflux system, membrane componentAFS85080.1CDS