6
6
7
7
8
8
9
- # Workshop materials
9
+ # Materiales del seminario
10
10
11
11
* All slides, code and data are available at
12
12
[ http://github.com/PoonLab/courses ] ( http://github.com/PoonLab/courses )
43
43
44
44
45
45
46
- # Let's do a little UNIX
46
+ # Hagamos un poco de UNIX
47
47
48
48
```
49
-
49
+ $ pwd
50
+ /Users/art/git/courses/Cuernavaca
51
+ $ ls
52
+ data scripts slides
53
+ $ mkdir sandbox
50
54
```
51
55
52
56
60
64
61
65
62
66
63
- # The Illumina data files
67
+ # Los archivos de datos Illumina
64
68
65
69
* Raw data files are too large to allow these to accumulate uncompressed
66
70
* Compression reduces file size by about X fold
@@ -73,7 +77,7 @@ gunzip --keep data/Zika-envelope.n1E4.R1.fastq.gz
73
77
74
78
75
79
76
- # FASTQ format
80
+ # Formato FASTQ
77
81
78
82
```
79
83
% head -n4 data/Zika-envelope.n1E4.R1.fastq
@@ -91,7 +95,7 @@ CCGGGATCTTGTTGATTGTGAACGCTGCGGTACCTAAGGATGACACGCCTTTCAATCCATGTTTGTCCGTT
91
95
92
96
93
97
94
- # Quality scores
98
+ # Puntuaciones de calidad
95
99
96
100
* An empirical estimate of the probability that the base call is incorrect:
97
101
@@ -102,7 +106,7 @@ CCGGGATCTTGTTGATTGTGAACGCTGCGGTACCTAAGGATGACACGCCTTTCAATCCATGTTTGTCCGTT
102
106
103
107
104
108
105
- # Clinical significance of error rates
109
+ # Significaci & oacute ; n cl & iacute ; nica
106
110
107
111
* Every subtype B sample in a MiSeq run had a substantial frequency (~ 3%) of resistance mutation E138A
108
112
* Quality scores were fine!
@@ -388,13 +392,13 @@ python scripts/slice-sam.py -refname NC_012532.1 \
388
392
389
393
390
394
391
- # Much better !
395
+ # & iexcl ; Mucho mejor !
392
396
393
397
![ ] ( slice-1700-2000.png )
394
398
395
399
396
400
397
- # Now we can study evolution
401
+ # Ahora podemos estudiar la evoluci & oacute ; n
398
402
399
403
```
400
404
mafft sandbox/slice-1700-2000.fa > \
0 commit comments