1
- # 'An alignment of HIV1, subtype B sequences
1
+ # ' An alignment of HIV1, subtype B sequences
2
2
# '
3
- # 'A dataset containing 10 HIV1, subtype B polymerase sequences collected in Northern Alberta Canada.
4
- # 'This a 10 sequence sample from popset# 1033910942 on NCBI's genbank Archive
3
+ # ' A dataset containing 10 HIV1, subtype B polymerase sequences collected in Northern
4
+ # ' Alberta Canada. This a 10 sequence sample from popset# 1033910942 on NCBI's genbank
5
+ # ' Archive. The sequence headers also include meta-data for sequences.
5
6
# '
6
- # ' @format An ape DNA object: 10 DNA sequences in binary format stored in a list. All sequences of same length: 1017
7
+ # ' @format An ape DNA object: 10 DNA sequences in binary format stored in a list.
8
+ # ' All sequences of same length: 1017
7
9
# ' @source \url{ https://www.ncbi.nlm.nih.gov/popset?DbFrom=nuccore&Cmd=Link&LinkName=nuccore_popset&IdsFromResult=1033912042 }
8
10
" alignment.ex"
9
11
10
- # 'An example set of sequence meta.data corresponding to alignment.ex
12
+ # ' An example set of sequence meta.data corresponding to alignment.ex
11
13
# '
12
- # 'A dataset describing 10 different HIV1 pol sequences collected in Northern Alberta Canada.
14
+ # ' Built from alignment.ex, the example 10 sequence alignment using pull.headers.
15
+ # ' The date of each sequence's collection, it's genbank unique accession ID, and
16
+ # ' sequence subtype are referenced within the header
13
17
# '
14
- # ' @format A data.table object with 9 variables:
18
+ # ' @format A data.table object with 4 variables:
15
19
# ' \describe{
16
20
# ' \item{ID}{Accession IDs (characters) of sequences}
17
21
# ' \item{CollectionDate}{Collection date of sequences. Full dates given as yyyy-mm-dd}
18
- # ' \item{Subtype}{Subtypes (factors) within a cluster}
19
- # ' \item{Header}{The original headers from the alignement. This matches meta data to sequences}
22
+ # ' \item{Subtype}{Subtypes (factors) of sequences}
23
+ # ' \item{Header}{The original headers from the alignement. This matches meta
24
+ # ' data to sequences in original alignment}
20
25
# ' }
21
26
" seq.info.ex"
22
27
23
- # 'An example set of clusters, built using component.cluster
28
+ # ' An example set of clusters, built using component.cluster
24
29
# '
25
- # 'A dataset describing 5 different clusters. Their member headers are listed, as well as the growth they experienced
26
- # '(ie. the number of new sequences forming clusters with old sequences.). See component.cluster for further information on
27
- # 'how these were assigned based on graph.ex as an input
30
+ # ' A dataset describing 5 different clusters. The headers (from alignment.ex), and
31
+ # ' associated meta data (from seq.info.ex) of cluster members is captured, as well
32
+ # ' as several cluster-level traits, such as growth and size. See component.cluster
33
+ # ' or further information onhow these were assigned based on graph.ex as an input
28
34
# '
29
35
# ' @format A data.table object with 9 variables:
30
36
# ' \describe{
31
- # ' \item{ClusterID}{ The unique identifier number for this cluster. A numberic}
32
- # ' \item{ID}{A list of vectors, each containing the accession IDs (characters) of sequences within a cluster}
33
- # ' \item{CollectionDate}{A list of vectors, each containing the collection date of sequences within a cluster}
34
- # ' \item{Subtype}{A list of vectors, each containing the subtypes (factors) within a cluster}
35
- # ' \item{Header}{A list of vectors, each containing the original headers from the alignement used to build this set of clusters}
36
- # ' \item{Size}{The original size of this cluster before being updated with new cases. This simply the number of sequences within the cluster}
37
+ # ' \item{ClusterID}{ The unique identifier number for this cluster. A numeric}
38
+ # ' \item{ID}{A list of vectors, each containing the accession IDs (characters)
39
+ # ' of sequences within a cluster}
40
+ # ' \item{CollectionDate}{A list of vectors, each containing the collection date
41
+ # ' of sequences within a cluster}
42
+ # ' \item{Subtype}{A list of vectors, each containing the subtypes (factors) within
43
+ # ' a cluster}
44
+ # ' \item{Header}{A list of vectors, each containing the original headers from the
45
+ # ' alignement used to build this set of clusters}
46
+ # ' \item{Size}{The original size of this cluster before being updated with new cases.
47
+ # ' This simply the number of sequences within the cluster}
37
48
# ' \item{Growth}{The growth of the cluster after new cases are added}
38
- # ' \item{DistThresh}{The pairwise distance threshold used to create this complete set of clusters. Corresponds to a setID as an input parameter}
49
+ # ' \item{DistThresh}{The pairwise distance threshold used to create this complete
50
+ # ' set of clusters. Corresponds to a setID as an input parameter}
39
51
# ' \item{SetID}{The unique identifier for this set of clusters. A numeric}
40
52
# ' }
41
53
" cluster.ex"
42
54
43
- # 'An example graph, built based on pairwise TN93 distances
55
+ # ' An example graph, built based on pairwise TN93 distances
44
56
# '
45
- # 'This implementation of a graph is a list, describing a set of sequences and the distances between them.
46
- # 'See create.graph for more information on how this graph was created using alignment.ex as input
57
+ # ' This implementation of a graph is a list, describing a set of sequences and the
58
+ # ' distances between them. See create.graph for more information on how this graph
59
+ # ' was created using alignment.ex as input
47
60
# '
48
61
# ' @format A list of 3 variables
49
62
# ' \describe{
56
69
# ' }
57
70
" graph.ex"
58
71
59
- # 'A tree built based on a subset of alignment.ex
72
+ # ' A tree built based on a subset of alignment.ex
60
73
# '
61
- # 'This is a maximum likelyhood tree built using IQ-TREE with model selection and 1000 parametric bootstraps.
62
- # 'The log information for this tree is stored in data/IQTREE_log_ex.txt. A subset of six older sequences
63
- # '(collected before January 1st 2012) from alignment.ex was used to construct this tree
74
+ # ' A maximum likelihood tree built using IQ-TREE with model selection and 1000
75
+ # ' parametric bootstraps. The log information for this tree is stored in data/IQTREE_log_ex.txt.
76
+ # ' A subset of six older sequences (collected before January 1st 2012) from alignment.ex
77
+ # ' was used to construct this tree.
64
78
# '
65
79
# '
66
- # ' @format An unrooted, phylogenetic tree with 6 tips and 4 internal nodes. Node labels represent certainty
67
- # ' See ape's implementation of phylogenetic tree objects for information about tags within this object
80
+ # ' @format An unrooted, phylogenetic tree with 6 tips and 4 internal nodes.
81
+ # ' Node labels represent certainty. See ape's implementation of phylogenetic tree
82
+ # ' objects for information about tags within this object
68
83
" old.tree.ex"
69
84
70
- # 'A tree built from alignment.ex
85
+ # ' A tree built from alignment.ex
71
86
# '
72
- # 'This is a maximum likelihood tree built using IQ-TREE with automatic model selection and 1000 parametric bootstraps.
87
+ # ' This is a maximum likelihood tree built using IQ-TREE with automatic model
88
+ # ' selection and 1000 parametric bootstraps. Contrasting old.tree.ex. This is a
89
+ # ' complete tree containing all sequences in alignment.ex
73
90
# '
74
- # ' @format An unrooted, phylogenetic tree with 10 tips and 8 internal nodes. Node labels represent certainty
75
- # ' See ape's implementation of phylogenetic tree objects for information about tags within this object
91
+ # ' @format An unrooted, phylogenetic tree with 10 tips and 8 internal nodes. Node
92
+ # ' labels represent certainty. See ape's implementation of phylogenetic tree objects
93
+ # ' for information about tags within this object.
76
94
" full.tree.ex"
77
95
78
- # 'An extension of an ape tree object which can be used to create clusters
96
+ # ' An extension of an ape tree object which can be used to create clusters
79
97
# '
80
- # 'This is a maximum likelihood tree built using IQ-TREE with automatic model selection and 1000 parametric bootstraps.
81
- # 'Additional functions within tree.setup.R were used to annotate information useful for clustering
98
+ # ' An extension of old.tree.ex maximum likelihood tree built using IQ-TREE with automatic
99
+ # ' model selection and 1000 parametric bootstraps. growth information and additional
100
+ # ' information useful for cluster identification were added by extend.tree.
82
101
# '
83
- # ' @format A , phylogenetic tree with 6 tips and 4 internal nodes. Node labels represent certainty
84
- # ' See ape's implementation of phylogenetic tree objects for information about tags within this object.
85
- # ' In addition, there are 4 new objects created by functions within tree.setup.R
102
+ # ' @format A phylogenetic tree with 6 tips and 4 internal nodes. Node labels represent
103
+ # ' certainty. See ape's implementation of phylogenetic tree objects for information
104
+ # ' about tags within this object. In addition, there are 4 new objects created by
105
+ # ' functions within tree.setup.R
86
106
# ' \describe{
87
107
# ' \item{seq.info}{ See seq.info.ex, a data.table containing sequence meta data}
88
- # ' \item{node.info}{ Grouping of the meta.data present in seq.info assigned to various nodes in the tree,
89
- # ' coupled with information important to clustering, such as mean divergence from root, or node certainty }
90
- # ' \item{path.info}{ Information regarding the path of edges from tips to the root of the tree.
91
- # ' This is also necessary for some clustering algorithms, specifically step.cluster}
92
- # ' \item{growth.info}{ a data.table pairing new sequences, to a single node in the tree based on placements assigned by guppy and pplacer.}
108
+ # ' \item{node.info}{ Grouping of the meta.data present in seq.info assigned to
109
+ # ' various nodes in the tree, coupled with information important to clustering,
110
+ # ' such as mean divergence from root, or node certainty }
111
+ # ' \item{path.info}{ Information regarding the path of edges from tips to the root
112
+ # ' of the tree. This is also necessary for some clustering algorithms, specifically
113
+ # ' step.cluster}
114
+ # ' \item{growth.info}{ a data.table pairing new sequences, to a single node in the
115
+ # ' tree based on placements assigned by guppy and pplacer. The certainty of this placement,
116
+ # ' terminal branch length, neighbour, and branch length from new internal node to
117
+ # ' new neighbour are described}
93
118
# ' }
94
119
" extended.tree.ex"
0 commit comments