@@ -9,12 +9,6 @@ by iteratively creating alignments using a divide-and-conquer strategy of the ML
9
9
tree from the previous iteration, and then computing a new ML tree on the new
10
10
alignment.
11
11
12
- Currently RAxML is used for tree inference from aligned sequences. The 'GTRMIX'
13
- model option in RAxML (searching under the 'GTRCAT', but scoring the final tree
14
- under 'GTRGAMMA') is used for DNA sequences, while the 'PROTMIXWAGF' model is
15
- used for amino acid sequences.
16
-
17
-
18
12
The references for the algorithmic approach are:
19
13
20
14
Kevin Liu, Sindhu Raghavan, Serita Nelesen, C. Randal Linder, and Tandy
@@ -71,14 +65,21 @@ Starting conditions
71
65
If you give SATe a starting tree, it will go directly to the iterative portion
72
66
of the algorithm.
73
67
74
- If you do NOT give it a starting tree sequences, then SATe will use RAxML to
75
- infer the initial tree. This requires an alignment. If all of the input
76
- sequences are of the same length, then SATe will assume that you are providing
77
- it with an alignment matrix; it will realign the data during the course of the
78
- algorithm, but the initial tree search will be conducted on the alignment that
79
- you supply. If your initial sequences do not have the same length and you do
80
- not supply a tree, then SATe will use the alignment tool that you have selected
81
- to produce an initial alignment for the entire dataset (this can be slow).
68
+ If you do NOT give it a starting tree, then SATe will use the specified "Tree
69
+ estimator" external tool to infer the initial tree. This requires an alignment.
70
+ If all of the input sequences are of the same length, then SATe will assume
71
+ that you are providing it with an alignment matrix; it will realign the data
72
+ during the course of the algorithm, but the initial tree search will be
73
+ conducted on the alignment that you supply. If your initial sequences do not
74
+ have the same length and you do not supply a tree, then SATe will use the
75
+ alignment tool that you have selected to produce an initial alignment for the
76
+ entire dataset (this can be slow).
77
+
78
+ If the initial alignment is very slow, you might want to use the PartTree tool
79
+ in MAFFT (http://bioinformatics.oxfordjournals.org/content/23/3/372.abstract) to
80
+ estimate a rough starting tree. By providing SATe with the tree estimated by
81
+ PartTree, your analysis will bypass the initial alignment/tree-search, and will
82
+ immediately begin the first iteration of the SATe algorithm.
82
83
83
84
Soon, we will implement an option that allows you to specify an aligner for
84
85
the initial alignment operation and a different aligner for the subproblem
@@ -105,8 +106,12 @@ tools used for each step:
105
106
* "Merger" is used to select the multiple sequence alignment tool used to
106
107
merge the alignments of subproblems into a larger alignment.
107
108
108
- * "Tree estimator" will allow you to choose the software for tree inference
109
- from a fixed alignment (currently only RAxML is supported).
109
+ * "Tree Estimator" will allow you to choose the software for tree inference
110
+ from a fixed alignment.
111
+
112
+ * "Model" allows you to select the substitution model that will be used
113
+ by the tree estimator during tree inference. The options in the drop down
114
+ are contingent on the specified "Tree Estimator".
110
115
111
116
###############################
112
117
Sequences and Tree (lower left)
0 commit comments