Skip to main content
Advertisement

< Back to Article

Table 1.

Comparative analysis of the 36 individual phylogenetic trees obtained with the initial and the curated datasets.

More »

Table 1 Expand

Fig 1.

Comparison and concatenation of different subsets of the 36 universal proteins.

a. Diagram of the amino-acid lengths of the 36 universal proteins, obtained after alignment and trimming from the curated dataset (details in S1 Table). Ribosomal and non-ribosomal proteins are indicated in solid and hashed-bars, respectively. The markers for which the monophyly of Archaea was obtained in their phylogenetic tree are indicated in red, whereas those related to the paraphyly of Archaea are indicated in blue. * indicates alignments that statistically support in AU test the Woese’s or eocyte topology (in red and blue, respectively) b. Maximum Likelihood (ML) phylogenetic tree of the concatenation of the 11 Woese’s proteins (3,499 positions). c. ML phylogenetic tree of the concatenation of the 25 eocyte proteins (4,868 positions). Detailed trees in S3 and S4 Figs. The scale-bars represent the average number of substitutions per site. Values at nodes represent support calculated by nonparametric bootstrap (out of 100).

More »

Fig 1 Expand

Fig 2.

Eukaryotic-like insertions in the lokiarchaeal EF2 proteins.

a. ML phylogenetic tree of EF2 with the initial dataset (626 positions). The scale-bar represents the average number of substitutions per site. Values at nodes represent support calculated by nonparametric bootstrap (out of 100). b. Schematic representations of the three lokiarchaeal EF2 proteins with the five different domains indicated by colored lines and the positions of the specific eukaryotic insertions indicated blue triangles. c. Alignments of the 6 observed insertions of the EF2 protein (arCOG01559) are showed. Organisms’ names corresponding to Archaea and Eukarya are respectively indicated in black and blue, and lokiarchaeal sequences are surrounded in yellow. The A1, 2, 3 and C3 insertions are aligned with eukaryotic Ria sequences (EF2 paralog), whereas B3 and D3 are aligned with eukaryotic EF2 and Snu5 sequences (EF2 paralog), respectively. Detailed alignments in S13S16 Figs.

More »

Fig 2 Expand

Fig 3.

EF2 phylogenetic trees, based on the curated dataset after inclusion of bathyarchaeal sequences.

a. ML phylogenetic tree of the complete sequence (626 positions). b. ML phylogenetic tree of the C-terminal part only (394 positions). Eury, Thaum, Cren and Euka stand for Euryarchaeota, Thaumarchaeota, Crenarchaeota and Eukaryotes. Detailed trees in S17 and S18 Figs. The scale-bars represent the average number of substitutions per site. Values at nodes represent support calculated by nonparametric bootstrap (out of 100) and ultrafast bootstrap approximation (1,000 replicates), in black and red, respectively.

More »

Fig 3 Expand

Fig 4.

Impact of the EF2 protein on the original concatenated alignment.

a. ML phylogenetic tree of the original concatenated alignment of the 36 markers (10,547 positions). b. ML phylogenetic tree of the original concatenated alignment after removal of the EF2 protein (9,831 positions). c. ML phylogenetic tree of the original concatenated alignment after removal of the Loki 3 EF2 sequence (10,547 positions). Detailed trees in S19, S21 and S23 Figs. The scale-bars represent the average number of substitutions per site. Values at nodes represent support calculated by nonparametric bootstrap (out of 100).

More »

Fig 4 Expand

Fig 5.

Impact of the EF2 protein on the concatenation of the curated datasets.

a. ML phylogenetic tree of the concatenated curated datasets (8,367 positions). b. ML phylogenetic tree of the concatenated curated datasets after removal of the EF2 protein (7,724 positions). c. ML phylogenetic tree of the concatenated curated datasets after removal of the Loki 3 EF2 sequence (8,425 positions). Detailed trees in S20, S22 and S25 Figs. The scale-bars represent the average number of substitutions per site. Values at nodes represent support calculated by nonparametric bootstrap (out of 100).

More »

Fig 5 Expand

Fig 6.

Position of Candidatus Thorarchaeota archaea in the Tree of Life.

ML phylogenetic tree of the concatenated alignments of the 34 markers present in the two most complete thorarchaeal genomes. Detailed tree in S28 Fig. The scale-bar represents the average number of substitutions per site. Values at nodes represent support calculated by nonparametric bootstrap (out of 100).

More »

Fig 6 Expand

Fig 7.

RNA polymerase phylogeny.

Bayesian phylogeny (LG model + Γ4) of the concatenated alignments of the two largest RNA polymerase subunits (1,463 positions) from an equal number (39) of Archaea, Eukaryotes (blue) and Bacteria (red). Among the Archaea, Thaumarchaeota, Crenarchaeota, group I Euryarchaeota and group II Euryarchaeota are indicated in pink, orange, light-green and dark-green, respectively. Values at nodes represent the Bayesian posterior probabilities. Detailed tree in S30 Fig. See S31 Fig for CAT-GTR model tree, and S32 Fig for ML tree. The scale-bar represents the average number of substitutions per site. A red arrow indicates the Lokiarchaea position in the tree. The A subunit status (split or fused) is indicated by adjacency of colored squares. The green arrow indicates the position of the split event among the archaeal phylogeny.

More »

Fig 7 Expand

Fig 8.

RNA polymerase phylogeny with the Asgards archaea.

Tree representing the combined phylogenies obtained in ML (LG model + Γ4) and Bayesian inference (CAT-GTR model) analyses of the two largest RNA polymerase subunits after inclusion of the Asgards archaea in the dataset (detailed trees in S36 and S37 Figs). Bacterial and eukaryotic sequences are indicated in red and blue, respectively. Among the Archaea, Thaumarchaeota, Crenarchaeota, and Euryarchaeota are indicated in pink, orange, and olive-green respectively. Values over the branches (in black) correspond to the posterior probabilities (PP) of the corresponding nodes obtained from Bayesian inferences, while the values below the branches (in grey) represent supports calculated by non parametric bootstrap (BS) from the ML analysis. Branch lengths in this tree are derived from the tree obtained from the Bayesian inference (S37 Fig), and the scale-bar represents the average number of substitutions per site. From base to tips, the three * correspond to 0.95/53, 0.92/61, and 1/100, respectively (PP/BS).

More »

Fig 8 Expand