In this dialog box you can select and view desired options in the Options Summary. Options are organized in logical sections. A yellow row indicates that you have a choice for that particular attribute. The primary sets of options available in this dialog box are:
Clock Settings
The RelTime analysis provides several options that are specific to divergence time estimation.
Clock Type
A RelTime analysis assumes that there is no global clock. You cannot change this parameter.
Clock Stringency
This selects how much separation clocks need to have before they can be merged. All Clocks keeps all of the clocks without merging any. Many Clocks merges all clocks which are within 1 standard deviation of each other. Few merges within 2 standard deviations, and Fewer merges within 3 standard deviations.
Variance Estimation Method
This offers the same options as the Phylogeny Test option in regular ML Tree construction (Bootstrap and Analytical). For RelTime trees, MEGA will use the selected option to calculate the variance for estimated divergence times.
Phylogeny Test and Options
To assess the reliability of a phylogenetic tree, MEGA provides the Bootstrap test. This uses the bootstrap re-sampling strategy, so you need to enter the number of replicates.
Substitution Model
In this set of options, you can choose various attributes of the substitution models for DNA and protein sequences.
Substitutions Type
Here you may select a substitutions type of Nucleotide, Syn-Nonsynonymous, or Amino Acid. The selection in this row affects the available models in the model row.
Model
Here you select a stochastic model for estimating evolutionary distance by clicking on the row then selecting a model for the current Substitutions Type.
Substitutions to Include
Depending on the distance model or method selected, the evolutionary distance can be teased into two or more components. By clicking on the row, you will be provided with a list of components relevant to the chosen model.
Transition /Transversion Ratio
This option will be visible if the chosen model requires you to provide a value for the Transition /Transversion ratio (R) .
Pattern among Lineages
This option becomes available if the selected model has formulas that allow the relaxation of the assumption of homogeneity of substitution patterns among lineages.
Rates among Sites
This option becomes available if the selected distance model has formulas that allow rate variation among sites. If you choose gamma-distributed rates, then an option to specify the number of discrete categories for modeling the gamma distribution becomes visible.
Data Subset to Use
These are options for handling gaps and missing data, including or excluding codon positions, and restricting the analysis to labeled sites , if applicable.
Gaps and Missing Data
You may choose to remove all sites containing alignment gaps and missing information before the calculation begins (Complete-deletion option). Alternatively, you may choose to retain all such sites initially, excluding them as necessary in the pairwise distance estimation (Pairwise-deletion option), or you may use Partial Deletion (Site coverage) as a percentage.
Codon Positions
Check or uncheck the boxes for any combination of 1st, 2nd, 3rd, and non-coding positions for analysis. This option is available only if the nucleotide sequences contain protein-coding regions and you have selected a nucleotide-by-nucleotide analysis.
This option is available only if some or all of the sites have associated labels. By clicking on the row, you will be provided with the option of including sites with selected labels. If you choose to include only labeled sites , then these sites will be the first extracted from the data. Then all other options mentioned above will be enforced. Note that labels associated with all three positions in the codon must be included for a full codon to be incorporated in the analysis.
Tree Inference Options
ML Heuristic Method
There are two options available Nearest-Neighbor-Interchange (NNI) or Subtree-Pruning-Regrafting (SPR) . For SPR, two search levels are available; search level 3, which searches less exhaustively and search level 5 which performs an extensive search of the tree space.
Initial Tree for ML
The initial tree for ML can be supplied in several ways. Mega can simply create it on the fly using Neighbor Joining (NJ), BioNJ, Maximum Parsimony, or try both NJ and BioNJ and select the superior tree or you may specify your own tree by providing a newick file.
For analyses that use a Maximum Likelihood approach, the ‘branch swap filter’ analysis setting is used to set the stringency of optimization with respect to branch lengths and improvements in log likelihood values. A weaker filter will result in more exhaustive optimization so that the search may take longer but potentially, a larger search space will be explored. A stronger filter will result in less exhaustive optimization so that the search may require less time but potentially, a smaller search space will be explored.
System Resource Usage
Number of Threads
Beginning in MEGA5.1, multithreaded ML phylogeny inference can be performed on computers that have multiple CPUs or cores. If MEGA detects multiple processing resources, an option for setting the number of search threads will be provided in the analysis preferences dialog. However, this option is only available when running a single ML inference analysis. If you wish to run multiple ML based analyses simultaneously, set the number of threads to ‘1’ when initializing the first analysis.