Skip to content

Commit c2a165d

Browse files
committed
Updated README: incorporated Tandy's comments and added blurb about using
PartTree if the initial alignment is slow.
1 parent 6a47f2b commit c2a165d

File tree

1 file changed

+21
-16
lines changed

1 file changed

+21
-16
lines changed

Diff for: doc/README.txt

+21-16
Original file line numberDiff line numberDiff line change
@@ -9,12 +9,6 @@ by iteratively creating alignments using a divide-and-conquer strategy of the ML
99
tree from the previous iteration, and then computing a new ML tree on the new
1010
alignment.
1111

12-
Currently RAxML is used for tree inference from aligned sequences. The 'GTRMIX'
13-
model option in RAxML (searching under the 'GTRCAT', but scoring the final tree
14-
under 'GTRGAMMA') is used for DNA sequences, while the 'PROTMIXWAGF' model is
15-
used for amino acid sequences.
16-
17-
1812
The references for the algorithmic approach are:
1913

2014
Kevin Liu, Sindhu Raghavan, Serita Nelesen, C. Randal Linder, and Tandy
@@ -71,14 +65,21 @@ Starting conditions
7165
If you give SATe a starting tree, it will go directly to the iterative portion
7266
of the algorithm.
7367

74-
If you do NOT give it a starting tree sequences, then SATe will use RAxML to
75-
infer the initial tree. This requires an alignment. If all of the input
76-
sequences are of the same length, then SATe will assume that you are providing
77-
it with an alignment matrix; it will realign the data during the course of the
78-
algorithm, but the initial tree search will be conducted on the alignment that
79-
you supply. If your initial sequences do not have the same length and you do
80-
not supply a tree, then SATe will use the alignment tool that you have selected
81-
to produce an initial alignment for the entire dataset (this can be slow).
68+
If you do NOT give it a starting tree, then SATe will use the specified "Tree
69+
estimator" external tool to infer the initial tree. This requires an alignment.
70+
If all of the input sequences are of the same length, then SATe will assume
71+
that you are providing it with an alignment matrix; it will realign the data
72+
during the course of the algorithm, but the initial tree search will be
73+
conducted on the alignment that you supply. If your initial sequences do not
74+
have the same length and you do not supply a tree, then SATe will use the
75+
alignment tool that you have selected to produce an initial alignment for the
76+
entire dataset (this can be slow).
77+
78+
If the initial alignment is very slow, you might want to use the PartTree tool
79+
in MAFFT (http://bioinformatics.oxfordjournals.org/content/23/3/372.abstract) to
80+
estimate a rough starting tree. By providing SATe with the tree estimated by
81+
PartTree, your analysis will bypass the initial alignment/tree-search, and will
82+
immediately begin the first iteration of the SATe algorithm.
8283

8384
Soon, we will implement an option that allows you to specify an aligner for
8485
the initial alignment operation and a different aligner for the subproblem
@@ -105,8 +106,12 @@ tools used for each step:
105106
* "Merger" is used to select the multiple sequence alignment tool used to
106107
merge the alignments of subproblems into a larger alignment.
107108

108-
* "Tree estimator" will allow you to choose the software for tree inference
109-
from a fixed alignment (currently only RAxML is supported).
109+
* "Tree Estimator" will allow you to choose the software for tree inference
110+
from a fixed alignment.
111+
112+
* "Model" allows you to select the substitution model that will be used
113+
by the tree estimator during tree inference. The options in the drop down
114+
are contingent on the specified "Tree Estimator".
110115

111116
###############################
112117
Sequences and Tree (lower left)

0 commit comments

Comments
 (0)