banner

Blog

Jul 31, 2023

Emergence of putative energy parasites within Clostridia revealed by genome analysis of a novel endosymbiotic clade

The ISME Journal (2023)Cite this article

Metrics details

The Clostridia is a dominant bacterial class in the guts of various animals and are considered to nutritionally contribute to the animal host. Here, we discovered clostridial endosymbionts of cellulolytic protists in termite guts, which have never been reported with evidence. We obtained (near-)complete genome sequences of three endosymbiotic Clostridia, each associated with a different parabasalid protist species with various infection rates: Trichonympha agilis, Pseudotrichonympha grassii, and Devescovina sp. All these protists are previously known to harbor permanently-associated, mutualistic Endomicrobia or Bacteroidales that supplement nitrogenous compounds. The genomes of the endosymbiotic Clostridia were small in size (1.0–1.3 Mbp) and exhibited signatures of an obligately-intracellular parasite, such as an extremely limited capability to synthesize amino acids, cofactors, and nucleotides and a disrupted glycolytic pathway with no known net ATP-generating system. Instead, the genomes encoded ATP/ADP translocase and, interestingly, regulatory proteins that are unique to eukaryotes in general and are possibly used to interfere with host cellular processes. These three genomes formed a clade with metagenome-assembled genomes (MAGs) derived from the guts of other animals, including human and ruminants, and the MAGs shared the characteristics of parasites. Gene flux analysis suggested that the acquisition of the ATP/ADP translocase gene in a common ancestor was probably key to the emergence of this parasitic clade. Taken together, we provide novel insights into the multilayered symbiotic system in the termite gut by adding the presence of parasitism and present an example of the emergence of putative energy parasites from a dominant gut bacterial clade.

Clostridia, a class in the phylum Bacillota (Firmicutes), is one of the dominant bacterial taxa in the gastrointestinal tract of various animals and contributes to host nutrition via the degradation of food materials and homeostasis via the production of various compounds [1, 2]. For instance, two clusters of Clostridia corresponding to Clostridium clusters IV and XIVa [3] occupy a large part (10–40%) of human-gut bacterial microbiota, and their production of short-chain fatty acids and catecholamines contributes to immune homeostasis in the gut environment and gut-brain axis [2]. Clostridia are also dominant in the gut microbiota of various termite species [4,5,6,7,8]. Termites thrive exclusively on dead plant matter and play a key role in the terrestrial carbon cycle in temperate to tropical regions [9], while being notorious as pests of wooden buildings. The ability of termites to efficiently digest lignocellulose and survive on nitrogen-poor food is mostly attributable to the activities of their gut microbiota, comprising protists, bacteria, and archaea [10]. The protists play a central role in lignocellulose digestion in the gut of “lower”, or phylogenetically basal, termites, and Clostridia as well as other bacterial assemblages contribute in reductive acetogenesis [11], polysaccharide degradation [12, 13], nitrogen fixation [14], and nitrogen recycling via uric acid [15].

Certain members of Clostridia colonize the mucus layer of the human and murine guts [16], and in rumens, Clostridia attach to plant fibers and are involved in polysaccharide degradation [17]. In termite guts, previous 16S rRNA-based analyses revealed that Clostridia inhabit the gut wall or luminal fluid [18, 19]. In the gut of lower termites, the cellulolytic protists generally harbor phylogenetically diverse prokaryotic symbionts within the cytoplasm [20,21,22,23,24], nucleoplasm [25], and/or on the surface of the cell [26,27,28,29,30,31,32]. Many studies have reported the mutualistic functions of those endo- (i.e., intracellular) and ecto- (i.e., extracellularly attached) symbionts based on genome analyses. For example, “Candidatus Endomicrobium trichonymphae” (phylum Elusimicrobiota) is an obligate endosymbiont of the protist Trichonympha agilis and supplements various amino acids and cofactors [20]. “Candidatus Azobacteroides pseudotrichonymphae” (phylum Bacteroidota), an obligate endosymbiont of the protist Pseudotrichonympha grassii, fixes dinitrogen and recycles nitrogen via uric acid [21], and “Candidatus Armantifilum devescovinae” (Bacteroidota), an obligate ectosymbiont of the protist Devescovina spp., also fixes dinitrogen [32]. Although direct microscopic evidence for the presence of clostridial endosymbionts of termite-gut protists has never been reported, clostridial 16S rRNA gene sequences have occasionally been obtained from cell suspension of certain protist species [23, 33]. Apart from the termite gut microbiota, it has been reported that an endosymbiont of the aquatic ciliate Metopus sp. is closely related to “Clostridium aminobutyricum” (family “Anaerovoracaceae”) [34] and that an endosymbiont of the ciliate Tryimyema compressum belongs to the family Syntrophomonadaceae [35]. These two are only known examples of endosymbionts belonging to the class Clostridia, and no information on their functions is available.

Here, we report the discovery of endosymbiotic Clostridia within the cells of protists in termite guts and our examination of their ecology and evolution based on morphological, phylogenetic, and genomic characteristics. Our results indicated that the endosymbiotic Clostridia were obligately-intracellular parasites; this is the first report of parasites infecting cellulolytic protists in the termite gut with genome-based evidence. In addition, we found that these endosymbiotic Clostridia constituted a clade with metagenome-assembled genomes (MAGs) exhibiting similar parasitic traits from the gastrointestinal tracts of various animals, including humans and ruminants. Our results revealed another factor, i.e., parasitism, within the multilayered symbiotic system in the termite gut and extend the phylogenetic, ecological, and physiological diversity of Clostridia, providing novel insights into the evolutionary emergence of a parasitic lineage within a free-living bacterial clade.

We collected three lower termite species: Reticulitermes speratus (family Rhinotermitidae) from Saitama Prefecture, Japan; Coptotermes formosanus (Rhinotermitidae) from Chiba, Kagoshima, and Okinawa Prefectures, Japan; and Neotermes sugioi (formerly N. koshunensis) (family Kalotermitidae) from Okinawa Prefecture, Japan. The entire gut of a worker termite was removed, and the gut contents were suspended in Trager’s solution U [36]. Single cells of the parabasalid protists Trichonympha agilis, Pseudotrichonympha grassii, and Devescovina sp. in the guts of R. speratus, C. formosanus, and N. sugioi, respectively, were physically isolated using a Leica AM6000 inverted micromanipulation microscope system.

The isolated protist cells were subjected to whole-genome amplification (WGA) using the Illustra GenomiPhi V2 Kit (GE Healthcare) as described previously [37]. Near full-length 16S rRNA genes were amplified by PCR using Phusion Hi-Fidelity DNA Polymerase (New England BioLabs) and a bacteria-specific primer set, 27F-mix and 1492R or 1390R [29]. Purification of PCR products, cloning, and Sanger sequencing were performed as described previously [25].

Obtained clostridial sequences were aligned to reference sequences (Table S1) retrieved from the SILVA database r138 [38], the All-Species Living Tree Project database LTP_06_2022 [39], and the Type Strains Genome Database [40], using MAFFT v7.490 with the “linsi” option [41]. The 16S rRNA genes detected in MAGs (see Supplementary Methods) were also added to the dataset. Alignments were trimmed using TrimAL v1.4rev22 [42] with the “gappyout” option, and a maximum-likelihood tree was constructed using IQ-TREE with the GTR+I+G4 model and the ultrafast bootstrap and SH-aLRT tests with 1000 replicates [43].

Oligonucleotide probes specific to the respective clostridial 16S rRNA phylotypes were designed using ARB: RsTa-C01-76 (5′-GTGCCTTGCAAAACTCCG-3′), CfP3-15-656 (5′-CCGTTCGCCTCTACTTTAC-3′), and NkDv07-142 (5′-TCCATCAGCTATCTCCCAC-3′). Fluorescence in situ hybridization (FISH) was conducted as described previously [27, 44] with hybridization at 50 °C for 1.5 h. Cells were observed under an Olympus BX51 epifluorescence microscope. Samples for transmission electron microscopy (TEM) were prepared as described previously [28, 45] and observed under a JEM-1400Plus transmission electron microscope (JEOL).

Collected single protist cells were washed several times in solution U and then transferred to water droplets containing 0.01% Tween 20 (Nacalai Tesque, Kyoto, Japan). Bacterial cells that leaked from the ruptured host cell were collected with a glass capillary attached to the micromanipulation system and subjected to WGA using Thermo Scientific EquiPhi29 DNA Polymerase. The reaction was performed at 10-µL scale at 45 °C for 3 h (see Supplementary Methods for details). PCR was conducted using primer sets specific to the respective 16S rRNA phylotypes (Table S2), to check for the presence of their genomes. The WGA samples containing the target were re-amplified using EquiPhi29 DNA polymerase at 50-µL scale. The reaction was performed at 45 °C for 2 h.

Genome sequencing was conducted on the MiSeq platform (Illumina) and the MinION platform (Oxford Nanopore Technologies). The libraries for MiSeq were prepared using the TruSeq DNA Sample Preparation Kit for low DNA input (Illumina). Sequencing was performed with the MiSeq Reagent Kit V3 (Illumina). Sequencing libraries for MinION were prepared using the SQK-LSK308 1D2 Sequencing Kit (Oxford Nanopore Technologies) for the NkDv07 sample and the SQK-LSK109 Ligation Sequencing Kit for RsTa-C01 and CfP3-15. The prepared libraries were loaded onto an R9.5 Spot-On Flow cell (FLO-MIN107) for NkDv07 and onto R10.3 Spot-On Flow cells (FLO-MIN109) for RsTa-C01 and CfP3-15. Detailed procedures for the MinION library preparation were described in Supplementary Methods.

Adapter removal and quality filtering of MiSeq reads were performed as described previously [26]. For MinION reads, base calling, adapter removal, and quality trimming, were performed using guppy v4.0.11, porechop v0.2.4 [46], and Nanofilt v2.7.1 [47], respectively. Quality-trimmed reads were assembled to construct contigs using SPAdes v3.14.1 [48] and Flye v2.8 [49] (Supplementary Methods). Automatic gene prediction and annotation were performed using DFAST [50], and the gene loci and annotations were manually curated based on the results of BLASTp searches of the NCBI non-redundant protein database. Metabolic pathways were reconstructed using the KEGG automatic annotation server (KAAS) [51] and KEGG Mapper [52]. The genomes were taxonomically classified using GTDB-tk v1.7.0 [53].

To construct a phylogenomic tree, we retrieved genome sequences of “Acutalibacteraceae” as references from GTDB r202 [54]. Also added were genome sequences assigned using GTDB-tk v1.7.0 to “Acutalibacteraceae” in the Unified Human Gastrointestinal Genome collection [55] and the Cow-rumen catalog v1.0 (https://www.ebi.ac.uk/metagenomics/genome-catalogs/cow-rumen-v1-0) as well as MAGs detected in a recent large-scale analysis of the ruminant gut microbiome [56] and those from the termite gut microbiome [57]. These reference genome sequences were clustered based on the criterion of ≥95% average nucleotide identity (ANI), and representative genomes were selected using Galah (https://github.com/wwood/galah). ANI and average amino acid identity (AAI) were calculated using ANI/AAI-Matrix [58]. Gene prediction was conducted using Prodigal v2.6.3 [59], and 31 single-copy gene markers used by Graham et al. (2018) [60] were extracted using the IdentifyHMM script (https://github.com/edgraham/PhylogenomicsWorkflow). The genome sequences with at least 16 marker genes were used to construct a phylogenomic tree (Table S3). Amino acid sequences were aligned using MAFFT v7.471 with default settings, concatenated, and trimmed using TrimAL v1.4rev22 with the “-automated1” option. A maximum-likelihood tree was constructed using IQ-TREE as above with a substitution model selected by its model finder.

The predicted genes and pseudogenes were classified into clusters of orthologous genes (COG) using RPS-BLAST v2.13.0+ with a cut-off e-value <10–4 [61], and each COG was classified into a COG functional category. For each genome, the relative abundance of the COG functional categories was calculated, and a centered log-ratio transformation was performed. The transformed data were used for principal component analysis, which was performed using a Scikit-learn python package [62]. All genes of “Acutalibacteraceae” genomes without COG annotation were subjected to de novo clustering using OrthoFinder v2.5.5 with default settings [63], and orthogroups were defined. Gene flux analysis was performed using Count (http://www.iro.umontreal.ca/~csuros/gene_content/count.html) [64] with Wagner parsimony, based on the matrix of COG and orthogroup and the phylogenetic tree obtained from the above analysis. The gene gain/loss penalty was set to 2.

For the phylogenetic analysis of ATP/ADP translocase genes, up to 100 reference sequences were identified by BLASTp searches of the UniProtKB database and were retrieved. Those used by Schmitz-Esser et al. (2004) [65] were added to the reference sequences (Table S4). After removing redundant sequences, the amino acid sequences were aligned using MAFFT v7.471 with “linsi” and trimmed with TrimAL v1.4rev22 using “gappyout”.

For eukaryotic-like genes, reference sequences were similarly retrieved (Table S5). For those genes in the RsTa-C01 genome, a tBLASTn search of the T. agilis draft genome sequence from an ongoing genome project was performed, and the aligned region of the top-hit sequence (LC770199–206) was added. For those in the CfP3-15 genome, BLASTp searches of a P. grassii transcriptome dataset [66] was performed, and the top-hit sequence was added. Alignments were carried out using T-Coffee [67] in the “expresso” mode and trimmed using TrimAL v1.4rev22 with “gappyout”. For genes for which alignment of the entire region was difficult, the most conserved region identified by the NCBI Conserved Domain Search was used for alignment. The sequences containing gaps in more than 50% of the aligned region were removed. Maximum-likelihood trees were constructed using IQ-TREE with settings as in the phylogenomic analysis.

The gut contents of 30 R. speratus or C. formosanus workers were suspended in solution U. The suspensions were centrifuged, and the precipitates were subjected to total RNA extraction using the Quick-RNA Fungal/Bacterial Microprep Kit (Zymo Research). DNA removal, reverse transcription, and RT-PCR with specific primer sets for respective eukaryotic-like genes (Table S2), were conducted as described previously [68]. No specific amplifications were detected in RNA samples without reverse transcription. The RT-PCR products were cloned using the Invitrogen TOPO TA Cloning Kit and were sequenced as described above.

During our 16S rRNA gene cloning analyses of bacterial microbiota associated with protist cells in termite guts, we identified three phylotypes affiliated with Clostridia: RsTa-C01 (LC760312) from T. agilis in an R. speratus gut, CfP3-15 (LC760310) from P. grassii in a C. formosanus gut, and NkDv07 (LC760311) from Devescovina sp. in an N. sugioi gut (see Supplementary Results for details). We performed FISH with specific probes and detected cocci exclusively within the cytoplasm of each host protist species (Fig. 1A–H). The rates of infection to the host protist species greatly varied, depending on the termite colony: 0–71% (RsTa-C01, surveyed termite colonies, n = 7), 0–100% (CfP3-15, n = 10), and 0–18% (NkDv07, n = 14) (Table S6). A single host cell harbored ca. 40–100 cells of RsTa-C01, 102–103 cells of CfP3-15, and 10–100 cells of NkDv07. RsTa-C01 cells were mostly diplococci (Fig. 1E), while CfP3-15 were detected as single, diplo-, or triplococci (Fig. 1G), and NkDv07 as aggregates (Fig. 1H) or single cocci when its cell number was small (Fig. S1A, B). TEM images showed that both RsTa-C01 and NkDv07 frequently contained an endospore and that their cells or aggregates were surrounded by the host membrane (Fig. 1I–L). We also putatively identified CfP3-15 cells by TEM among the cells of “Ca. Azobacteroides pseudotrichonymphae” [21] within P. grassii cells (Fig. S1C–E).

Fluorescence in situ hybridization (FISH) analysis of RsTa-C01, CfP3-15, and NkDv07 (A–H) and transmission electron micrographs of endosymbionts of Trichonympha agilis and Devescovina sp. (I–K). T. agilis from the gut of Reticulitermes speratus (A, E, I, J), Pseudotrichonympha grassii from the gut of Coptotermes formosanus (B, C, F, G), and Devescovina sp. from the gut of Neotermes sugioi (D, H, K, L) were observed. Endosymbiotic clostridial cells were detected with oligonucleotide probes specific to each phylotype (6FAM-labeled, green), and “Candidatus Armantifilum” cells [28, 99] were detected with probe Bactd-937 (5′-CCACATGTTCCTCCGCTT-3′) (Texas red-labeled, red) [100]. C, G Magnified images of the area enclosed by a rectangle in (B). J, L Magnified images of the area indicated in (I) and (K), respectively. White arrowheads indicate an endospore, and black arrowhead indicates a membrane structure. Bars: (A–C, E−G), 20 μm; (D, H), 10 µm; (I), 5 µm; (J), 500 nm; (K), 2 µm; (L), 200 nm.

Phylogenetic analysis based on the 16S rRNA gene revealed that these endosymbionts belonged to Clostridium cluster IV (or the Clostridium leptum group) [3] within the family Oscillospiraceae (Fig. 2 and S2A). This cluster is one of the dominant groups in the termite gut [4,5,6,7,8] as well as in the human and cattle gastrointestinal tracts [69, 70]. The endosymbionts formed a monophyletic group with sequences derived from the gastrointestinal tracts of various animals (e.g., AB746684), wastewater reactors (GQ134270), denitrifying biofilm (KJ399537), and clones from Trichonympha cells in Reticulitermes santonensis guts [23] (Fig. 2 and S2B).

Described species of “Acutalibacteraceae” in the GTDB taxonomy and uncultured clones closely related to endosymbiotic Clostridia were included in the tree. Maximum-likelihood tree was constructed with the GTR+I+G4 nucleotide substitution model and rooted with Sporobacter termitidis (Z49863) and Oscillibacter ruminantium (JF750939) as the outgroup. Highly supported nodes (SH-aLRT value ≥ 80% and ultrafast bootstrap support value ≥ 95%) are indicated with a closed circle.

To predict the functions of these three endosymbiotic Clostridia, we obtained the complete genome sequences of RsTa-C01 and CfP3-15 and a draft genome sequence of NkDv07. To estimate the completeness of the NkDv07 genome consisting of four contigs, we modified the universal set of bacterial marker genes used by CheckM [71] according to the complete CfP3-15 genome, as shown in Table S7. The estimated completeness was 100% with no contaminating sequence detected, using the modified marker gene set. All MAGs simultaneously obtained from the WGA samples are listed in Table S8. The basic genomic features of the three endosymbiotic Clostridia are shown in Table 1 in comparison with Clostridium leptum DSM753 and Ruminococcus bromii ATCC27255, which are well-characterized gut bacteria among most closely related described species within Clostridium cluster IV (Fig. 2 and S2). The genome sizes of these three endosymbionts were small (1.0–1.3 Mbp), and CfP3-15 boasted the smallest genome (1,007,634 bp) among the known complete genome sequences of Clostridia (Fig. S3). Accordingly, the number of protein-coding sequences (CDSs) was much fewer (790–839) than in the free-living members of Oscillospiraceae (ca. 1500–4700). The GC content was also lower than those of the majority of Clostridia (Fig. S3). The GC skew profile of RsTa-C01 did not exhibit clear shifts that generally enable the identification of the replication origin and terminus site (Fig. S4).

The predicted CDSs were classified into COG functional categories and compared (Fig. S5). The distribution patterns of CDSs were very similar among the three endosymbionts, and the number of genes in most categories were reduced to approximately a half to one-tenth of their free-living relatives (Fig. S5A). In particular, the reduction in categories E (amino acid transport and metabolism), G (carbohydrate transport and metabolism), and K (transcription), was prominent; the proportion of genes was reduced to 0.4–0.6 of the free-living relatives (Fig. S5B). Important genes involved in DNA replication were missing; CfP3-15 and NkDv07 lacked the polA gene for DNA polymerase I, and RsTa-C01 lacked dnaA, rnhA, and recG, encoding chromosomal replication initiator protein DnaA, ribonuclease HI, and ATP-dependent DNA helicase RecG, respectively. The three genomes contained 94–248 pseudogenes (Table 1 and S9), and the pseudogenes were classified into mainly categories X (mobilome: prophages, transposons) and L (replication, recombination, and repair) (Fig. S6). No CRISPR-Cas system was found in these three genomes.

Phylogenomic analysis based on conserved single-copy genes inferred that the endosymbiotic Clostridia belonged to the family “Acutalibacteraceae” in the GTDB taxonomy, which corresponds to a portion of the validly approved family Oscillospiraceae. “Acutalibacteraceae” contain many cultured species (Fig. 2 and S2A) and MAGs from the gastrointestinal tracts of various animal species, and this is the first report of intracellular symbionts in this group.

The predicted metabolic capacities based on the genome sequences of NkDv07 and CfP3-15 were very similar, while that of RsTa-C01 exhibited substantial differences from the other two (Fig. 3). Although all three genomes encoded a phosphotransferase system (PTS) that functions to import and phosphorylate glucose to glucose-6-phosphate (G6P), the glycolytic pathway is fragmented and cannot be used to generate ATP from glucose (Fig. 3). All three genomes encoded isomerase, which converts G6P to fructose-6-phosphate, a precursor for peptidoglycan biosynthesis. However, the RsTa-C01 genome lacked phosphofructokinase and fructose-bisphosphate aldolase; only CfP3-15 and NkDv07 can produce dihydroxyacetone phosphate (DHAP) from glucose, which is a precursor for glycerophospholipid biosynthesis. All three genomes lacked the gene for enolase, and therefore phosphoenolpyruvate (PEP), which is required for the activity of PTS, must be produced from pyruvate by the action of pyruvate-phosphate dikinase with ATP hydrolysis or from oxaloacetate by the action of phosphoenolpyruvate carboxykinase with GTP hydrolysis. Nevertheless, the genes for these enzymes were found only in the genomes of CfP3-15 and NkDv07 and were not identified in the RsTa-C01 genome. RsTa-C01 retained the pathway from 2-phosphoglycerate to DHAP (Fig. 3); thus, RsTa-C01 apparently needs to import phosphorylated glycerate to synthesize glycerophospholipids and needs to import PEP to activate PTS, although the associated transporters were not identified in its genome.

The metabolic capacity of NkDv07 is not shown because of its substantial similarity to that of CfP3-15. Pathways found only in the RsTa-C01 genome are highlighted with blue, and those only in the CfP3-15 genome are highlighted with yellow. Incomplete pathways are highlighted with gray.

Certain species of Clostridia possess the Rnf complex and/or energy-converting hydrogenase for ATP production; [72] however, the related genes were not identified in any of the three genomes. Alternative ATP-generating pathways, such as the Entner-Doudoroff pathway, Stickland reaction, and arginine dihydrolase pathway, were also not identified in these genomes. Instead, the gene encoding ATP/ADP translocase, a transporter that imports ATP and exports ADP and phosphate, was found in all three genomes. Important amino acid residues for nucleotide transport [73] were conserved in these homologs (Fig. S7). The apparent absence of net ATP-generating system and the presence of ATP/ADP translocase strongly suggest that the endosymbiotic Clostridia rely on ATP imported from the host. This transporter may be used also for other nucleotides [74]. The transcription of the ATP/ADP translocase genes of CfP3-15 and RsTa-C01 was confirmed by RT-PCR (Fig. S8). We did not attempt RT-PCR for NkDv07 because we lost the termite colony that contained this endosymbiont before the experiment.

CfP3-15 and NkDv07, but not RsTa-C01, retained the non-oxidative pentose phosphate pathway used to synthesize phosphoribosyl pyrophosphate (PRPP) (Fig. 3). However, CfP3-15 and NkDv07 did not possess the genes required for the de novo biosynthesis of purines and can only produce UMP/UDP/uridine for pyrimidine biosynthesis. Almost no biosynthetic pathways for amino acids or cofactors were present in the three genomes (Fig. 3). Each genome encoded several transporters for amino acids and cofactors. For example, the RsTa-C01 genome encoded transporters for formate and folate, which are used as substrates to synthesize 10-formyl-tetrahydrofolate. Although CfP3-15 and NkDv07 did not possess these transporters, their genomes encoded the biosynthetic pathway used for obtaining tetrahydrofolate from GTP. No known functions beneficial for the protist and/or termite hosts were identified in all three genomes, including nitrogen fixation/recycling [21, 22, 24, 31, 32, 75], reductive acetogenesis [22, 24], hydrogen removal [21, 22, 24, 26, 76, 77], polysaccharide degradation [31, 77, 78], and nutrient production [20,21,22, 24, 31, 75, 79].

Several genes unique to eukaryotes in general were identified in all three genomes (Table S10). For example, the CfP3-15 genome contained 18 genes for small GTPase, mainly of the Rab family, which are involved in intracellular trafficking, cell motility, and signal transduction in eukaryotic cells [80]. These proteins, such as small GTPases, Ser/Thr protein kinases, and Myb-like transcription factors, have been found almost exclusively in eukaryotes and several pathogenic bacterial clades, including Legionella [81] and Chlamydiota [82]. Among them, genes for Rab-GTPase have been found almost exclusively in Legionella outside of eukaryotes and never in the Clostridia class. The repertoires of these genes for eukaryotic-like proteins differed among the three genomes; the genes were acquired independently. Phylogenetic analyses indicated that many of the genes were closely related to those of protists, and at least 12 genes were most closely related to those of parabasalids (Fig. 4 and S9, Table S10). A Ser/Thr kinase of RsTa-C01 was phylogenetically closest to that of Trichonympha agilis (LC770206) (Fig. 4A), implying that this gene was acquired horizontally from the host protist. It was predicted that at least 20 of the 42 eukaryotic-like protein-coding genes contained signal peptides (Table S10); these are most probably secreted outside the cell through the Sec system.

A Phylogenetic relationships of Ser/Thr kinase of RsTa-C01 and its homologs. B Phylogenetic relationships of Rab-GTPase of CfP3-15 and its homologs. Maximum likelihood trees were generated using the LG+I+G4 amino acid substitution model. Highly supported nodes (SH-aLRT value ≥ 80% and ultrafast bootstrap support value ≥ 95%) are indicated with a closed circle or actual values (SH-aLRT/ultrafast bootstrap).

Transcription of three out of ten eukaryotic-like genes tested was confirmed by RT-PCR and Sanger sequencing (Fig. S10, Table S10). The failure to detect the other transcripts may be due to intraspecific genetic variations because we had to use different termite colonies from different geographic locations for the RT-PCR experiments after genome sequencing analysis. Indeed, the sequence obtained for a small GTPase gene contained multiple single-nucleotide polymorphisms against the genome sequence (Fig. S11).

The phylogenetic positions of the three endosymbionts in the family “Acutalibacteraceae” were analyzed based on conserved single-copy genes (Fig. 5A, Table S3). The three genomes formed a monophyletic group with 23 MAGs from the gastrointestinal tracts of several animal species, including a termite, ruminants, and humans. This monophyletic group was divided into at least six genus-level clades based on GTDB r207 (Fig. 5A). Genomes of each genus-level clade shared at least 50% AAI, although ANI was occasionally below 70%, including the ANI among the three endosymbionts (Fig. S12). The predicted genome size and GC content of these MAGs were similar to those of the three endosymbionts: 0.86–1.4 Mbp (except for MGYG000004496: 1.9 Mb) and 30–39%, respectively (Fig. 5B, Table S11). This monophyletic group is designated hereafter the “small genome clade”, the genomic features and host information for which are summarized in Table S11.

A Phylogenomic tree of bacteria belonging to “Acutalibacteraceae” based on concatenated amino acid sequences of single-copy marker genes. Maximum likelihood tree was constructed using the LG+F+I+G4 amino acid substitution model and rooted with Syntrophomonas wolfei (GCF_000014725.1) and Thermosyntropha lipolytica (GCF_900129805.1) as the outgroup. Highly supported nodes (SH-aLRT value ≥ 80% and ultrafast bootstrap support value ≥ 95%) are indicated with a closed circle. The small genome clade is magnified, and the genus-level clades based on GTDB r207 and distribution of ATP/ADP translocase (TlcC) and eukaryotic-like domains (Tables S10 and S12) are also depicted. The domains of protein-coding genes with a star were detected only in the small genome clade among “Acutalibacteraceae” genomes. B Relationships between genome size (or total contig lengths of draft genomes) and GC content. C Principal component analysis of genomes based on the distribution patterns of the COG functional categories. Genomes of the small genome clade are highlighted in red in (B) and (C).

The COG distribution patterns were compared within “Acutalibacteraceae” (Fig. 5C). The genomes of the small genome clade clustered together, with a characteristic reduction in the proportion of genes related to metabolisms, especially G (carbohydrate transport and metabolism), and an increase in those related to J (translation, ribosomal structure, and biogenesis) (Fig. S13). Several genes containing eukaryotic-like domains were also identified from MAGs of the small genome clade. The identified domains included those of small GTPases, protein kinases, protein phosphatases, and ubiquitinyl hydrolases (Fig. 5A, Table S12).

To estimate the loss and gain events of gene families in the ancestors of the small genome clade, we conducted the gene flux analysis. The results suggested that the last common ancestor of the small genome clade gained the genes for ATP/ADP translocase (COG3202), 3-methyladenine DNA glycosylase Mpg (COG2094), serine transporter YbeC (COG0531), and others (Table S13). Among these, genes for ATP/ADP translocase and hypothetical proteins OG0003398, OG0004343, and OG0005234 were exclusively found in the small genome clade, whereas the others were also sparsely found in other clades in “Acutalibacteraceae” (Fig. S14). Of these, the ATP/ADP translocase gene was retained by all members of the small genome clade except for GCA_017634155.1 (Fig. 5A). The three hypothetical proteins were absent from 30 to 70% of genomes in the small genome clade, including the complete genomes of RsTa-C01 and CfP3-15 (Fig. S14).

The ATP/ADP translocase genes from the small genome clade formed two monophyletic clusters; one was a sister group of a clade mainly comprising those from Alphaproteobacteria and Chlamydiae (designated Clostridia ATP/ADP translocase I), and the other was in a large clade mainly consisting of those from “Candidatus Dependentiae” and Chlamydiae (Clostridia ATP/ADP translocase II) (Fig. 6). All genomes in the small genome clade, except GCA_017634155.1, encoded Clostridia ATP/ADP translocase I, and only a portion of them additionally possessed the other gene. Because the tree topology of Clostridia ATP/ADP translocase I was largely congruent with the species tree (Fig. S15A), the ATP/ADP translocase gene acquired by the last common ancestor of the small genome clade was most probably Clostridia ATP/ADP translocase I. Clostridia ATP/ADP translocase II was possibly acquired from a member of “Ca. Dependentiae” or Chlamydiae after the small genome clade had diversified (Figs. 6 and S15B).

Maximum likelihood tree was constructed using the LG+F+G4 amino acid substitution model. Only support values (SH-aLRT/ultrafast bootstrap support values) for major clades are shown for clarity.

In this study, we discovered and analyzed the genomes of endosymbiotic Clostridia found exclusively within the cells of cellulolytic protists in termite guts. The total dependence of nutrition on their protist hosts, i.e., the lack of most biosynthetic pathways of nucleotides, amino acids, cofactors, and of systems to generate ATP, made it obvious that these Clostridia are obligately-intracellular parasites unable to reproduce outside the host cell, although the possibility that they possess certain beneficial functions not evident from the annotated genes cannot be excluded. To our knowledge, they are the most host-dependent Clostridia with the smallest genome sizes. The obligately parasitic life style of the endosymbiotic Clostridia contrasts with diverse endo- and ectosymbiotic bacteria of termite-gut protists studied thus far based on genome sequences. All these previously analyzed symbionts were suggested to be mutualists that critically contribute to the nutrition and metabolism of their protist hosts [20,21,22, 26, 31, 32, 75,76,77,78,79]. It seems likely that the endosymbiotic Clostridia have exploited these protist-bacteria mutualistic complexes as resources.

The habitat of this clostridial parasitic clade is not confined within the gut of lower termites; we found that the small genome clade is widely distributed in the gastrointestinal tracts of various animals, including “higher” (or phylogenetically apical) termites with no cellulolytic gut protists, humans throughout the world, and ruminants such as cattle, sheep, and deer (Figs. 5A and S16, Table S11). Although their localizations remain unknown in the gastrointestinal tract of these animals, higher termites can house a small number of protists in their gut, including Nyctotherus ciliates and Trichomonas-like parabasalids [83] and human guts can harbor commensal or pathogenic genera such as Trichomonas, Entamoeba, and Giardia [84]. In addition, clostridia belonging to the families Oscillospiraceae and Lachnospiraceae were reportedly enriched in ciliate fractions compared to protist-free bacterial fractions in the 16S rRNA gene amplicon sequencing analysis of cow ruminal contents [85]. Indeed, we confirmed the presence of 16S rRNA gene sequences related to those of the endosymbiotic Clostridia from ciliate fractions (Fig. S16), whereas no evidence was provided by FISH or TEM in a previous study [85]. Thus, these protists might be the hosts for the clostridia of the small genome clade. Previous reports focused on prey-predator relationships between eukaryotes and prokaryotes in the gastrointestinal tract of mammals [86, 87]; intracellular parasitism should also be considered.

The evolutionary emergence of this parasitic clade in a dominant gut bacterial group is likely attributable to the horizontal acquisition of ATP/ADP translocase, possibly from alphaproteobacteria, by the last common ancestor of the small genome clade, as suggested by our gene flux analysis (Fig. 6). Microsporidia, eukaryotic parasites, were previously suggested to have acquired the ATP/ADP translocase gene at an early stage in the transition to an intracellular lifestyle [88]. Because the acquisition of ATP/ADP translocase is also inferred in an early evolutionary stage of obligate host association of Rickettsiales [89], it is assumed that the acquisition of this gene was significant in establishing parasitism in organisms, irrespective of Gram-negative bacteria, Gram-positive bacteria, or eukaryotes. After intracellular parasitism emerged by the acquisition of ATP/ADP translocase, members of the small genome clade appear to have taken a common strategy: they independently acquired genes for eukaryotic-like regulatory proteins possibly from their protist hosts. These proteins may be used to adapt to their respective host cell environments by manipulating the host cellular processes. For example, the eukaryotic-like Ser/Thr protein kinase of diverse pathogenic bacteria reportedly disrupts various host cell functions, such as the innate immune system and the formation of actin cytoskeleton [90]. Although experimental evidence is absent, it has been hypothesized that Rab-GTPase proteins mimic and disrupt host cell functions [81]. Thus, it seems likely that convergent evolution has occurred in the small genome clade as seen in previously known, other intracellular parasites.

Evolutionary convergence was also observed in the DNA replication system of the small genome clade as endosymbionts; CfP3-15 and NkDv07 lack polA, and RsTa-C01 lacks dnaA, rnhA, and recG. The absence of polA has been reported in several mutualistic endosymbionts of insects [91] and the endosymbiotic cyanobacterium (chromatophore) of the photosynthetic protist Paulinella [92]. Possibly, slow-growing bacteria, including endosymbionts, can withstand the absence of PolA, as shown in an Escherichia coli strain lacking polA [93, 94]. The dnaA gene is also absent or pseudogenized in certain endosymbiotic bacteria of insects [95], water ferns [96], and a protist in the termite gut [20, 68]. The lack of this gene might be a consequence of streamlining adaptations along with the use of other replication mechanisms independent of DnaA [94]. The absence of dnaA is consistent with the atypical GC skew pattern of RsTa-C01 (Fig. S4), as seen in the dnaA-pseudogenized genome of “Ca. Endomicrobium trichonymphae” [20]. On the other hand, because double mutants of rnhA and recG are lethal in E. coli [97, 98], it is unclear how RsTa-C01 endures under the absence of these genes. Further studies of members of the small genome clade in environments other than termite guts will deepen our understanding of the ecology and evolution of this novel parasitic clade.

Here, we propose the novel genera and species names “Candidatus Improbicoccus pseudotrichonymphae”, “Candidatus Improbicoccus devescovinae”, and “Candidatus Paraimprobicoccus trichonymphae”, for CfP3-15, NkDv07, and RsTa-C01, respectively, based on their phylogeny, morphology, and genome sequences. The details are described in Supplementary Results.

The genome sequences of CfP3-15, NkDv07, and RsTa-C01 are available under the accession numbers AP027924, BSWA01000001–4, and AP027925, respectively in BioProject PRJDB15329. The 16S rRNA sequences of endosymbiotic Clostridia and partial CDSs of T. agilis genes are available at DDBJ under accession numbers LC760310–12 and LC770199–206, respectively. Metagenomic reads were deposited in DDBJ Sequence Read Archive under accession numbers DRR451059–66. All MAGs produced in this study are available at BioSample under accession numbers SAMD00586941–3 and SAMD00634410–20.

Cryan JF, O’Riordan KJ, Cowan CSM, Sandhu KV, Bastiaanssen TFS, Boehme M, et al. The microbiota-gut-brain axis. Physiol Rev. 2019;99:1877–2013.

Article CAS PubMed Google Scholar

Lopetuso LR, Scaldaferri F, Petito V, Gasbarrini A. Commensal clostridia: leading players in the maintenance of gut homeostasis. Gut Pathog. 2013;5:e23.

Article Google Scholar

Collins MD, Lawson PA, Willems A, Cordoba JJ, Fernandez-Garayzabal J, Garcia P, et al. The phylogeny of the genus Clostridium: proposal of five new genera and eleven new species combinations. Int J Syst Bacteriol. 1994;44:812–26.

Article CAS PubMed Google Scholar

Hongoh Y, Ohkuma M, Kudo T. Molecular analysis of bacterial microbiota in the gut of the termite Reticulitermes speratus (Isoptera; Rhinotermitidae). FEMS Microbiol Ecol. 2003;44:231–42.

Article CAS PubMed Google Scholar

Abdul Rahman N, Parks DH, Willner DL, Engelbrektson AL, Goffredi SK, Warnecke F, et al. A molecular survey of Australian and North American termite genera indicates that vertical inheritance is the primary force shaping termite gut microbiomes. Microbiome. 2015;3:e5.

Article Google Scholar

Mikaelyan A, Dietrich C, Kohler T, Poulsen M, Sillam-Dusses D, Brune A. Diet is the primary determinant of bacterial community structure in the guts of higher termites. Mol Ecol. 2015;24:5284–95.

Article CAS PubMed Google Scholar

Bourguignon T, Lo N, Dietrich C, Sobotnik J, Sidek S, Roisin Y, et al. Rampant host switching shaped the termite gut microbiome. Curr Biol. 2018;28:649–54.e2.

Article CAS PubMed Google Scholar

Schmitt-Wagner D, Friedrich MW, Wagner B, Brune A. Phylogenetic diversity, abundance, and axial distribution of bacteria in the intestinal tract of two soil-feeding termites (Cubitermes spp.). Appl Environ Microbiol. 2003;69:6007–17.

Article CAS PubMed PubMed Central Google Scholar

Bignell DE, Eggleton P Termites in ecosystems. In: Abe Y, Bignell DE, Higashi T (eds). Termites: Evolution, Sociality, Symbioses, Ecology. Springer, New York, 2000, pp 363-87.

Brune A. Symbiotic digestion of lignocellulose in termite guts. Nat Rev Microbiol. 2014;12:168–80.

Article CAS PubMed Google Scholar

Grech-Mora I, Fardeau ML, Patel BKC, Ollivier B, Rimbault A, Prensier G, et al. Isolation and characterization of Sporobacter termitidis gen. nov., sp. nov., from the digestive tract of the wood-feeding termite Nasutitermes lujae. Int J Syst Bacteriol. 1996;46:512–8.

Article Google Scholar

He S, Ivanova N, Kirton E, Allgaier M, Bergin C, Scheffrahn RH, et al. Comparative metagenomic and metatranscriptomic analysis of hindgut paunch microbiota in wood- and dung-feeding higher termites. PLoS One. 2013;8:e61126.

Article CAS PubMed PubMed Central Google Scholar

Arora J, Kinjo Y, Sobotnik J, Bucek A, Clitheroe C, Stiblik P, et al. The functional evolution of termite gut microbiota. Microbiome. 2022;10:e78.

Article Google Scholar

Yamada A, Inoue T, Noda S, Hongoh Y, Ohkuma M. Evolutionary trend of phylogenetic diversity of nitrogen fixation genes in the gut community of wood-feeding termites. Mol Ecol. 2007;16:3768–77.

Article CAS PubMed Google Scholar

Thong-On A, Suzuki K, Noda S, Inoue J, Kajiwara S, Ohkuma M. Isolation and characterization of anaerobic bacteria for symbiotic recycling of uric acid nitrogen in the gut of various termites. Microbes Environ. 2012;27:186–92.

Article PubMed PubMed Central Google Scholar

Ouwerkerk JP, de Vos WM, Belzer C. Glycobiome: bacteria and mucus at the epithelial interface. Best Pr Res Clin Gastroenterol. 2013;27:25–38.

Article CAS Google Scholar

Morais S, Mizrahi I. Islands in the stream: from individual to communal fiber degradation in the rumen ecosystem. FEMS Microbiol Rev. 2019;43:362–79.

Article CAS PubMed PubMed Central Google Scholar

Yang H, Schmitt-Wagner D, Stingl U, Brune A. Niche heterogeneity determines bacterial community structure in the termite gut (Reticulitermes santonensis). Environ Microbiol. 2005;7:916–32.

Article CAS PubMed Google Scholar

Nakajima H, Hongoh Y, Usami R, Kudo T, Ohkuma M. Spatial distribution of bacterial phylotypes in the gut of the termite Reticulitermes speratus and the bacterial community colonizing the gut epithelium. FEMS Microbiol Ecol. 2005;54:247–55.

Article CAS PubMed Google Scholar

Hongoh Y, Sharma VK, Prakash T, Noda S, Taylor TD, Kudo T, et al. Complete genome of the uncultured Termite Group 1 bacteria in a single host protist cell. Proc Natl Acad Sci USA 2008;105:5555–60.

Article CAS PubMed PubMed Central Google Scholar

Hongoh Y, Sharma VK, Prakash T, Noda S, Toh H, Taylor TD, et al. Genome of an endosymbiont coupling N2 fixation to cellulolysis within protist cells in termite gut. Science. 2008;322:1108–9.

Article CAS PubMed Google Scholar

Ikeda-Ohtsubo W, Strassert JF, Kohler T, Mikaelyan A, Gregor I, McHardy AC, et al. ‘Candidatus Adiutrix intracellularis’, an endosymbiont of termite gut flagellates, is the first representative of a deep-branching clade of Deltaproteobacteria and a putative homoacetogen. Environ Microbiol. 2016;18:2548–64.

Article CAS PubMed Google Scholar

Strassert JF, Kohler T, Wienemann TH, Ikeda-Ohtsubo W, Faivre N, Franckenberg S, et al. ‘Candidatus Ancillula trichonymphae’, a novel lineage of endosymbiotic Actinobacteria in termite gut flagellates of the genus Trichonympha. Environ Microbiol. 2012;14:3259–70.

Article CAS PubMed Google Scholar

Ohkuma M, Noda S, Hattori S, Iida T, Yuki M, Starns D, et al. Acetogenesis from H2 plus CO2 and nitrogen fixation by an endosymbiotic spirochete of a termite-gut cellulolytic protist. Proc Natl Acad Sci USA. 2015;112:10224–30.

Article CAS PubMed PubMed Central Google Scholar

Sato T, Kuwahara H, Fujita K, Noda S, Kihara K, Yamada A, et al. Intranuclear verrucomicrobial symbionts and evidence of lateral gene transfer to the host protist in the termite gut. ISME J. 2014;8:1008–19.

Article CAS PubMed Google Scholar

Takeuchi M, Kuwahara H, Murakami T, Takahashi K, Kajitani R, Toyoda A, et al. Parallel reductive genome evolution in Desulfovibrio ectosymbionts independently acquired by Trichonympha protists in the termite gut. ISME J. 2020;14:2288–301.

Article CAS PubMed PubMed Central Google Scholar

Noda S, Ohkuma M, Yamada A, Hongoh Y, Kudo T. Phylogenetic position and in situ identification of ectosymbiotic spirochetes on protists in the termite gut. Appl Environ Microbiol. 2003;69:625–33.

Article CAS PubMed PubMed Central Google Scholar

Noda S, Inoue T, Hongoh Y, Kawai M, Nalepa CA, Vongkaluang C, et al. Identification and characterization of ectosymbionts of distinct lineages in Bacteroidales attached to flagellated protists in the gut of termites and a wood-feeding cockroach. Environ Microbiol. 2006;8:11–20.

Article CAS PubMed Google Scholar

Hongoh Y, Sato T, Dolan MF, Noda S, Ui S, Kudo T, et al. The motility symbiont of the termite gut flagellate Caduceia versatilis is a member of the “Synergistes” group. Appl Environ Microbiol. 2007;73:6270–6.

Article CAS PubMed PubMed Central Google Scholar

Hongoh Y, Sato T, Noda S, Ui S, Kudo T, Ohkuma M. Candidatus Symbiothrix dinenymphae: bristle-like Bacteroidales ectosymbionts of termite gut protists. Environ Microbiol. 2007;9:2631–5.

Article CAS PubMed Google Scholar

Treitli SC, Kolisko M, Husnik F, Keeling PJ, Hampl V. Revealing the metabolic capacity of Streblomastix strix and its bacterial symbionts using single-cell metagenomics. Proc Natl Acad Sci USA 2019;116:19675–84.

Article CAS PubMed PubMed Central Google Scholar

Desai MS, Brune A. Bacteroidales ectosymbionts of gut flagellates shape the nitrogen-fixing community in dry-wood termites. ISME J. 2012;6:1302–13.

Article CAS PubMed Google Scholar

Sato T, Hongoh Y, Noda S, Hattori S, Ui S, Ohkuma M. Candidatus Desulfovibrio trichonymphae, a novel intracellular symbiont of the flagellate Trichonympha agilis in termite gut. Environ Microbiol. 2009;11:1007–15.

Article PubMed Google Scholar

Hirakata Y, Oshiki M, Kuroda K, Hatamoto M, Kubota K, Yamaguchi T, et al. Identification and detection of prokaryotic symbionts in the ciliate Metopus from anaerobic granular sludge. Microbes Environ. 2015;30:335–8.

Article PubMed PubMed Central Google Scholar

Shinzato N, Watanabe I, Meng XY, Sekiguchi Y, Tamaki H, Matsui T, et al. Phylogenetic analysis and fluorescence in situ hybridization detection of archaeal and bacterial endosymbionts in the anaerobic ciliate Trimyema compressum. Micro Ecol. 2007;54:627–36.

Article Google Scholar

Trager W. The cultivation of a cellulose-digesting flagellate, Trichomonas termopsidis, and of certain other termite protozoa. Biol Bull. 1934;66:182–90.

Article Google Scholar

Igai K, Kitade O, Fu J, Omata K, Yonezawa T, Ohkuma M, et al. Fine-scale genetic diversity and putative ecotypes of oxymonad protists coinhabiting the hindgut of Reticulitermes speratus. Mol Ecol. 2022;31:1317–31.

Article PubMed Google Scholar

Pruesse E, Quast C, Knittel K, Fuchs BM, Ludwig W, Peplies J, et al. SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. Nucleic Acids Res. 2007;35:7188–96.

Article CAS PubMed PubMed Central Google Scholar

Ludwig W, Viver T, Westram R, Francisco Gago J, Bustos-Caparros E, Knittel K, et al. Release LTP_12_2020, featuring a new ARB alignment and improved 16S rRNA tree for prokaryotic type strains. Syst Appl Microbiol. 2021;44:e126218.

Article Google Scholar

Shi W, Sun Q, Fan G, Hideaki S, Moriya O, Itoh T, et al. gcType: a high-quality type strain genome database for microbial phylogenetic and functional research. Nucleic Acids Res. 2021;49:D694–D705.

Article CAS PubMed Google Scholar

Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30:772–80.

Article CAS PubMed PubMed Central Google Scholar

Capella-Gutierrez S, Silla-Martinez JM, Gabaldon T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25:1972–3.

Article CAS PubMed PubMed Central Google Scholar

Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015;32:268–74.

Article CAS PubMed Google Scholar

Utami YD, Kuwahara H, Murakami T, Morikawa T, Sugaya K, Kihara K, et al. Phylogenetic diversity and single-cell genome analysis of “Melainabacteria”, a non-photosynthetic cyanobacterial group, in the termite gut. Microbes Environ. 2018;33:50–7.

Article PubMed PubMed Central Google Scholar

Murakami T, Onouchi S, Igai K, Ohkuma M, Hongoh Y. Ectosymbiotic bacterial microbiota densely colonize the surface of thelastomatid nematodes in the gut of the wood-feeding cockroach Panesthia angustipennis. FEMS Microbiol Ecol. 2019;95:fiy238.

Article CAS Google Scholar

Wick RR, Judd LM, Gorrie CL, Holt KE. Completing bacterial genome assemblies with multiplex MinION sequencing. Micro Genom. 2017;3:e000132.

Google Scholar

De Coster W, D’Hert S, Schultz DT, Cruts M, Van Broeckhoven C. NanoPack: visualizing and processing long-read sequencing data. Bioinformatics. 2018;34:2666–9.

Article PubMed PubMed Central Google Scholar

Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012;19:455–77.

Article CAS PubMed PubMed Central Google Scholar

Kolmogorov M, Yuan J, Lin Y, Pevzner PA. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol. 2019;37:540–6.

Article CAS PubMed Google Scholar

Tanizawa Y, Fujisawa T, Nakamura Y. DFAST: a flexible prokaryotic genome annotation pipeline for faster genome publication. Bioinformatics. 2018;34:1037–9.

Article CAS PubMed Google Scholar

Moriya Y, Itoh M, Okuda S, Yoshizawa AC, Kanehisa M. KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res. 2007;35:W182–5.

Article PubMed PubMed Central Google Scholar

Kanehisa M, Sato Y. KEGG Mapper for inferring cellular functions from protein sequences. Protein Sci. 2020;29:28–35.

Article CAS PubMed Google Scholar

Chaumeil PA, Mussig AJ, Hugenholtz P, Parks DH. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics. 2019;36:1925–7.

Article PubMed PubMed Central Google Scholar

Parks DH, Chuvochina M, Chaumeil PA, Rinke C, Mussig AJ, Hugenholtz P. A complete domain-to-species taxonomy for Bacteria and Archaea. Nat Biotechnol. 2020;38:1079–86.

Article CAS PubMed Google Scholar

Almeida A, Nayfach S, Boland M, Strozzi F, Beracochea M, Shi ZJ, et al. A unified catalog of 204,938 reference genomes from the human gut microbiome. Nat Biotechnol. 2021;39:105–14.

Article CAS PubMed Google Scholar

Xie F, Jin W, Si H, Yuan Y, Tao Y, Liu J, et al. An integrated gene catalog and over 10,000 metagenome-assembled genomes from the gastrointestinal microbiome of ruminants. Microbiome. 2021;9:e137.

Article Google Scholar

Herve V, Liu P, Dietrich C, Sillam-Dusses D, Stiblik P, Sobotnik J, et al. Phylogenomic analysis of 589 metagenome-assembled genomes encompassing all major prokaryotic lineages from the gut of higher termites. PeerJ. 2020;8:e8614.

Article PubMed PubMed Central Google Scholar

Rodriguez-R LM, Konstantinidis KT. The enveomics collection: a toolbox for specialized analyses of microbial genomes and metagenomes. PeerJ Prepr. 2016;4:e1900v1.

Google Scholar

Hyatt D, Chen GL, Locascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinform. 2010;11:e119.

Article Google Scholar

Graham ED, Heidelberg JF, Tully BJ. Potential for primary productivity in a globally-distributed bacterial phototroph. ISME J. 2018;12:1861–6.

Article CAS PubMed PubMed Central Google Scholar

Marchler-Bauer A, Panchenko AR, Shoemaker BA, Thiessen PA, Geer LY, Bryant SH. CDD: a database of conserved domain alignments with links to domain three-dimensional structure. Nucleic Acids Res. 2002;30:281–3.

Article CAS PubMed PubMed Central Google Scholar

Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: machine learning in python. J Mach Learn Res. 2011;12:2825–30.

Google Scholar

Emms DM, Kelly S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 2019;20:e238.

Article Google Scholar

Csuros M. Count: evolutionary analysis of phylogenetic profiles with parsimony and likelihood. Bioinformatics. 2010;26:1910–2.

Article PubMed Google Scholar

Schmitz-Esser S, Linka N, Collingro A, Beier CL, Neuhaus HE, Wagner M, et al. ATP/ADP translocases: a common feature of obligate intracellular amoebal symbionts related to Chlamydiae and Rickettsiae. J Bacteriol. 2004;186:683–91.

Article CAS PubMed PubMed Central Google Scholar

Nishimura Y, Otagiri M, Yuki M, Shimizu M, Inoue J-I, Moriya S, et al. Division of functional roles for termite gut protists revealed by single-cell transcriptomes. ISME J. 2020;14:2449–60.

Article CAS PubMed PubMed Central Google Scholar

Di Tommaso P, Moretti S, Xenarios I, Orobitg M, Montanyola A, Chang JM, et al. T-Coffee: a web server for the multiple sequence alignment of protein and RNA sequences using structural information and homology extension. Nucleic Acids Res. 2011;39:W13–7.

Article PubMed PubMed Central Google Scholar

Izawa K, Kuwahara H, Kihara K, Yuki M, Lo N, Itoh T, et al. Comparison of intracellular “Ca. Endomicrobium trichonymphae” genomovars illuminates the requirement and decay of defense systems against foreign DNA. Genome Biol Evol. 2016;8:3099–107.

Article CAS PubMed PubMed Central Google Scholar

Almeida A, Mitchell AL, Boland M, Forster SC, Gloor GB, Tarkowska A, et al. A new genomic blueprint of the human gut microbiota. Nature. 2019;568:499–504.

Article CAS PubMed PubMed Central Google Scholar

Stewart RD, Auffret MD, Warr A, Walker AW, Roehe R, Watson M. Compendium of 4941 rumen metagenome-assembled genomes for rumen microbiome biology and enzyme discovery. Nat Biotechnol. 2019;37:953–61.

Article CAS PubMed PubMed Central Google Scholar

Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25:1043–55.

Article CAS PubMed PubMed Central Google Scholar

Schoelmerich MC, Katsyv A, Donig J, Hackmann TJ, Muller V. Energy conservation involving 2 respiratory circuits. Proc Natl Acad Sci USA. 2020;117:1167–73.

Article CAS PubMed Google Scholar

Trentmann O, Decker C, Winkler HH, Neuhaus HE. Charged amino-acid residues in transmembrane domains of the plastidic ATP/ADP transporter from Arabidopsis are important for transport efficiency, substrate specificity, and counter exchange properties. Eur J Biochem. 2000;267:4098–105.

Article CAS PubMed Google Scholar

Tjaden J, Winkler HH, Schwöppe C, van der Laan M, Möhlmann T, Neuhaus HE. Two nucleotide transport proteins in Chlamydia trachomatis, one for net nucleoside triphosphate uptake and the other for transport of energy. J Bacteriol. 1999;181:1196–202.

Article CAS PubMed PubMed Central Google Scholar

Tai V, Carpenter KJ, Weber PK, Nalepa CA, Perlman SJ, Keeling PJ. Genome evolution and nitrogen fixation in bacterial ectosymbionts of a protist inhabiting wood-feeding cockroaches. Appl Environ Microbiol. 2016;82:4682–95.

Article CAS PubMed PubMed Central Google Scholar

Kuwahara H, Yuki M, Izawa K, Ohkuma M, Hongoh Y. Genome of ‘Ca. Desulfovibrio trichonymphae’, an H2-oxidizing bacterium in a tripartite symbiotic system within a protist cell in the termite gut. ISME J. 2017;11:766–76.

Article CAS PubMed Google Scholar

Utami YD, Kuwahara H, Igai K, Murakami T, Sugaya K, Morikawa T, et al. Genome analyses of uncultured TG2/ZB3 bacteria in ‘Margulisbacteria’ specifically attached to ectosymbiotic spirochetes of protists in the termite gut. ISME J. 2019;13:455–67.

Article CAS PubMed Google Scholar

Yuki M, Kuwahara H, Shintani M, Izawa K, Sato T, Starns D, et al. Dominant ectosymbiotic bacteria of cellulolytic protists in the termite gut also have the potential to digest lignocellulose. Environ Microbiol. 2015;17:4942–53.

Article CAS PubMed Google Scholar

Strassert JFH, Mikaelyan A, Woyke T, Brune A. Genome analysis of ‘Candidatus Ancillula trichonymphae’, first representative of a deep-branching clade of Bifidobacteriales, strengthens evidence for convergent evolution in flagellate endosymbionts. Environ Microbiol Rep. 2016;8:865–73.

Article PubMed Google Scholar

Song S, Cong W, Zhou S, Shi Y, Dai W, Zhang H, et al. Small GTPases: structure, biological function and its interaction with nanoparticles. Asian J Pharm Sci. 2019;14:30–9.

Article PubMed Google Scholar

Gomez-Valero L, Rusniok C, Carson D, Mondino S, Perez-Cobas AE, Rolando M, et al. More than 18,000 effectors in the Legionella genus genome provide multiple, independent combinations for replication in human cells. Proc Natl Acad Sci USA. 2019;116:2265–73.

Article CAS PubMed PubMed Central Google Scholar

Martyn JE, Gomez-Valero L, Buchrieser C. The evolution and role of eukaryotic-like domains in environmental intracellular bacteria: the battle with a eukaryotic cell. FEMS Microbiol Rev. 2022;46:fuac012.

Article PubMed Google Scholar

Kirby H. Protozoa in termites of the genus Amitermes. Parasitology. 1932;24:289–304.

Article Google Scholar

Chabe M, Lokmer A, Segurel L. Gut protozoa: friends or foes of the human gut microbiota? Trends Parasitol. 2017;33:925–34.

Article PubMed Google Scholar

Levy B, Jami E. Exploring the prokaryotic community associated with the rumen ciliate protozoa population. Front Microbiol. 2018;9:e2526.

Article Google Scholar

Andersen TO, Altshuler I, Vera-Ponce de Leon A, Walter JM, McGovern E, Keogh K, et al. Metabolic influence of core ciliates within the rumen microbiome. ISME J. 2023;17:1128–40.

Article CAS PubMed PubMed Central Google Scholar

Solomon R, Wein T, Levy B, Eshed S, Dror R, Reiss V, et al. Protozoa populations are ecosystem engineers that shape prokaryotic community structure and function of the rumen microbial ecosystem. ISME J. 2022;16:1187–97.

Article PubMed Google Scholar

Heinz E, Hacker C, Dean P, Mifsud J, Goldberg AV, Williams TA, et al. Plasma membrane-located purine nucleotide transport proteins are key components for host exploitation by microsporidian intracellular parasites. PLoS Pathog. 2014;10:e1004547.

Article PubMed PubMed Central Google Scholar

Schon ME, Martijn J, Vosseberg J, Kostlbacher S, Ettema TJG. The evolutionary origin of host association in the Rickettsiales. Nat Microbiol. 2022;7:1189–99.

Article PubMed PubMed Central Google Scholar

Canova MJ, Molle V. Bacterial serine/threonine protein kinases in host-pathogen interactions. J Biol Chem. 2014;289:9473–9.

Article CAS PubMed PubMed Central Google Scholar

Moran NA, Bennett GM. The tiniest tiny genomes. Annu Rev Microbiol. 2014;68:195–215.

Article CAS PubMed Google Scholar

Nowack EC, Melkonian M, Glockner G. Chromatophore genome sequence of Paulinella sheds light on acquisition of photosynthesis by eukaryotes. Curr Biol. 2008;18:410–8.

Article CAS PubMed Google Scholar

Joyce CM, Grindley ND. Method for determining whether a gene of Escherichia coli is essential: application to the polA gene. J Bacteriol. 1984;158:636–43.

Article CAS PubMed PubMed Central Google Scholar

Kogoma T. Stable DNA replication: Interplay between DNA replication, homologous recombination, and transcription. Microbiol Mol Biol Rev. 1997;61:212–38.

CAS PubMed PubMed Central Google Scholar

Akman L, Yamashita A, Watanabe H, Oshima K, Shiba T, Hattori M, et al. Genome sequence of the endocellular obligate symbiont of tsetse flies, Wigglesworthia glossinidia. Nat Genet. 2002;32:402–7.

Article CAS PubMed Google Scholar

Ran L, Larsson J, Vigil-Stenman T, Nylander JA, Ininbergs K, Zheng WW, et al. Genome erosion in a nitrogen-fixing vertically transmitted endosymbiotic multicellular cyanobacterium. PLoS One. 2010;5:e11486.

Article PubMed PubMed Central Google Scholar

Sandler SJ. Requirements for replication restart proteins during constitutive stable DNA replication in Escherichia coli K-12. Genetics. 2005;169:1799–806.

Article CAS PubMed PubMed Central Google Scholar

Hong X, Cadwell GW, Kogoma T. Escherichia coli RecG and RecA proteins in R-loop formation. EMBO J. 1995;14:2385–92.

Article CAS PubMed PubMed Central Google Scholar

Desai MS, Strassert JF, Meuser K, Hertel H, Ikeda-Ohtsubo W, Radek R, et al. Strict cospeciation of devescovinid flagellates and Bacteroidales ectosymbionts in the gut of dry-wood termites (Kalotermitidae). Environ Microbiol. 2010;12:2120–32.

CAS PubMed Google Scholar

Hongoh Y, Deevong P, Hattori S, Inoue T, Noda S, Noparatnaraporn N, et al. Phylogenetic diversity, localization, and cell morphologies of members of the candidate phylum TG3 and a subphylum in the phylum Fibrobacteres, recently discovered bacterial groups dominant in termite guts. Appl Environ Microbiol. 2006;72:6780–8.

Article CAS PubMed PubMed Central Google Scholar

Download references

We are grateful to Gaku Tokuda, Kumiko Kihara, and Yukihiro Kinjo for cooperation in termite collection. We also thank Michiru Shimizu, Katsura Igai, and Tomoyuki Sato for assisting with experiments, and Osamu Kanazawa for checking the Latin grammar of scientific names. Sanger sequencing was done in the RIKEN BSI and in the Biomaterial Analysis Center in the Tokyo Institute of Technology. TEM was assisted by Keiko Ikeda in the latter facility. This study was financially supported by NEXT and KAKENHI grants from JSPS to YHongoh (GS009, 22241046, 16H04840, 20H02897, 20H05584, 22K19342) and to MO (17H01447 and 19H05689), by JST-CREST (14532219) to YHongoh, and also by JST SPRING (JPMJSP2106) to KT.

School of Life Science and Technology, Tokyo Institute of Technology, Tokyo, 152–8550, Japan

Kazuki Takahashi, Hirokazu Kuwahara, Yutaro Horikawa, Kazuki Izawa, Daiki Kato, Tatsuya Inagaki & Yuichi Hongoh

Japan Collection of Microorganisms, RIKEN BioResource Research Center, Tsukuba, 305–0074, Japan

Masahiro Yuki, Moriya Ohkuma & Yuichi Hongoh

You can also search for this author in PubMed Google Scholar

You can also search for this author in PubMed Google Scholar

You can also search for this author in PubMed Google Scholar

You can also search for this author in PubMed Google Scholar

You can also search for this author in PubMed Google Scholar

You can also search for this author in PubMed Google Scholar

You can also search for this author in PubMed Google Scholar

You can also search for this author in PubMed Google Scholar

You can also search for this author in PubMed Google Scholar

YHongoh and MO conceived research and provided equipment and reagents; YHongoh and KT designed research; YHongoh, TI, HK and KT collected termites; KT, HK and MY sequenced genomes; KT, YHorikawa and YHongoh performed FISH and Sanger sequencing; KI, HK and KT performed TEM; KT, YHongoh, DK and TI analyzed data; KT and YHongoh wrote the paper with input from all authors.

Correspondence to Kazuki Takahashi or Yuichi Hongoh.

The authors declare no competing interests.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and Permissions

Takahashi, K., Kuwahara, H., Horikawa, Y. et al. Emergence of putative energy parasites within Clostridia revealed by genome analysis of a novel endosymbiotic clade. ISME J (2023). https://doi.org/10.1038/s41396-023-01502-0

Download citation

Received: 26 March 2023

Revised: 21 August 2023

Accepted: 23 August 2023

Published: 31 August 2023

DOI: https://doi.org/10.1038/s41396-023-01502-0

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

SHARE