Research Article |
Corresponding author: Elif Yılmaz ( elfyilmaz38@gmail.com ) Academic editor: Bing Liu
© 2024 Elif Yılmaz, David G. Mann, Romain Gastineau, Rosa Trobajo, Cüneyt Nadir Solak, Ewa Górecka, Monique Turmel, Claude Lemieux, Nesil Ertorun, Andrzej Witkowski.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Yılmaz E, Mann DG, Gastineau R, Trobajo R, Solak CN, Górecka E, Turmel M, Lemieux C, Ertorun N, Witkowski A (2024) Description of Navicula vanseea sp. nov. (Naviculales, Naviculaceae), a new species of diatom from the highly alkaline Lake Van (Republic of Türkiye) with complete characterisation of its organellar genomes and multigene phylogeny. PhytoKeys 241: 27-48. https://doi.org/10.3897/phytokeys.241.118903
|
The current article describes Navicula vanseea sp. nov., a new species of diatom from Lake Van, a highly alkaline lake in Eastern Anatolia (Türkiye). The description is based on light and scanning electron microscopy performed on two monoclonal cultures. The complete nuclear rRNA clusters and plastid genomes have been sequenced for these two strains and the complete mitogenome for one of them. The plastome of both strains shows the probable loss of a functional ycf35 gene. They also exhibit two IB4 group I introns in their rrl, each encoding for a putative LAGLIDADG homing endonuclease, with the first L1917 IB4 intron reported amongst diatoms. The Maximum Likelihood phylogeny inferred from a concatenated alignment of 18S, rbcL and psbC distinguishes N. vanseea sp. nov. from the morphologically similar species Navicula cincta and Navicula microdigitoradiata.
Group I intron, LAGLIDADG, mitogenome, Naviculaceae, plastome, pseudogene, soda lake
Lake Van is located in Eastern Anatolia, Turkey (Republic of Türkiye). It is Turkey’s largest inland water body and also world’s largest soda lake. The lake is surrounded by dormant volcanoes and its formation was a consequence of the eruption of the Nemrut stratovolcano (not to be confused with the Nemrut Mountain, also in Turkey), which is 2247 m above sea level. As a result of the erosion of volcanic rocks in the catchment and evaporation, the lake water is salty (21.4‰) and alkaline (155 m mEq–1, pH 9.81) (
The genus Navicula is amongst the most species-rich genera of Bacillariophyceae, although this is partly because it was used for a long time as a ‘catch-all’ for simply structured, bilaterally symmetrical raphid diatoms. It was erected as early as 1822 by Bory in his ‘Dictionnaire Classique d’Histoire Naturelle’ (
The only account ever published on the diatoms from Lake Van was written by
Navicula species are rather well documented in inland waters where they are known for their bioindicator potential (
Preliminary results from a new sampling campaign conducted in 2021 in Lake Van strongly suggested that the biodiversity of diatoms had been underestimated in the previous work of
The aim of the following article is to formally describe Navicula vanseea sp. nov. from Lake Van. The description combines the use of light microscopy (live specimen and cleaned frustules) and scanning electron microscopy. The complete cluster of nuclear ribosomal RNA genes and the complete plastid genome were obtained for both strains by means of next generation sequencing and also the mitogenome of one of these strains. These results were included in a multigene Maximum Likelihood phylogeny which unambiguously separated Navicula vanseea sp. nov. from morphologically similar known species, whose differences with Navicula vanseea sp. nov. are discussed. As it was the first time that a L1917 group I intron with its putative LAGLIDADG homing endonuclease gene had been discovered in the plastid genome of a diatom, special attention has been paid to this feature, with a phylogeny of the putative LAGLIDADG protein being performed.
Epilithic samples were collected by brushing rocks in the littoral of Lake Van in July 2021, in the vicinity of Erciş Municipality (Fig.
Map of the sampling location A location of Lake Van in Turkey. The red frame indicates the position of Lake Van B general view of the lake. The pin indicates the position of the sampling area C photo of the epilithic sampling area on the rock (Esri. (2023). ArcGIS Pro 3.1.0. Environmental Systems Research Institute).
Pictures of living diatoms were taken using a Light Microscope (LM) Zeiss Axio Scope A1 (Carl Zeiss, Jena, Germany at a magnification of 1000× by transferring diatom cultures directly on to the glass slide.
To prepare cleaned frustules for microscopy, 5 ml of monoclonal cultures were transferred into 20 ml beakers with 10 ml of 10% hydrochloric acid (HCl). After 24h, samples were washed four times with distilled water then re-suspended in 30% hydrogen peroxide (H2O2) and boiled for about four hours. Finally, samples were washed again four times with distilled water. For LM, cleaned material was then air-dried on cover glasses and mounted on glass slide with Naphrax® (Brunel Microscopes Ltd., Chippenham, UK) solution and pictures were taken with the Zeiss Axio Scope A1. For SEM, a drop of cleaned sample was deposited on a Nuclepore Track-Etch membrane from Whatman (Maidstone, England). The membranes were air-dried overnight, mounted on aluminium stubs with carbon tape and coated with gold using a Q150T coater from Quorum Technologies (Laughton, U.K.). SEM observations were made at the Faculty of Chemical Technology and Engineering, Western Pomeranian University of Technology in Szczecin (Poland), using a Hitachi SU8020 (Tokyo, Japan) and Eskişehir Technical University (Türkiye) using a ZEISS Ultra microscope (Oberkochen, Germany).
DNA was extracted from clones SZCZEY2172 and SZCZEY2262 using the protocol of
The three gene datasets (18S, rbcL and psbC) already used in previous publications (
For the phylogeny of the putative LAGLIDADG endonuclease proteins, protein sequences found in IB4 - L1917 and - L1931 introns presented in
Holotype
: Slide number SZCZEY2172 in the collection of Andrzej Witkowski at the University of Szczecin, Poland. Valves representing the holotype population are illustrated in Fig.
Isotype : Slide number TR_Erciş_Van_2021 deposited in Kütahya Dumlupınar University (Turkey).
Erciş Van, Türkiye (38°59'47.3"N 43°24'15.3"E) collected by: Elif Yilmaz, 31 July 2021.
The name given to the species refers to the German name of Lake Van (Vansee, the sea of Van) as it was used in the work of Legler and Krasske and is meant as a tribute to these authors and their work.
The taxon was exclusively observed within benthic epilithic assemblages in Lake Van (salinity 21.4‰ and pH 9.5).
LM
(Fig.
SEM External valve surface
(Figs
SEM micrographs of Navicula vanseea sp. nov. SZCZEY2172 A external view of the entire valve B details of central area showing simple, slightly drop-shaped proximal raphe endings and shortened striae C, D details of the two apices of a single valve showing the terminal fissures E internal view of the entire valve F details of central area showing filiform proximal raphe endings in a fusiform expansion of the raphe-sternum G, H details of apices showing well-developed helictoglossae showing two isolated lineolae (white arrows). Scale bars: 10 μm (A, E); 3 μm (B–D, F–H).
SEM micrographs of Navicula vanseea sp. nov. SZCZEY2262 A external view of the entire valve B details of central area showing simple proximal raphe endings and shortened striae C details of apex showing the terminal fissure D, E internal view of two entire valves, showing the central area and filiform proximal raphe endings F details of apex showing well-developed helictoglossae G, H girdle view of valves showing continuous areolation on mantle and two isolated lineolae (white arrows). Scale bars: 5 μm (A, D, E, G, H); 3 μm (B, C, F).
SEM Internal valve surface
(Figs
The complete rRNA gene cluster was sequenced for both clones and deposited in NCBI GenBank with accession numbers OR797294 (SZCZEY2172) and OR797293 (SZCZEY2262). The cluster is 4902 bp long, distributed as follows: 18S – 1792 bp, ITS1 – 195 bp, 5.8S – 155 bp, ITS2 – 260 bp, 28S – 2500 bp. Comparing the two clones, there was one single nucleotide polymorphism (SNP) found in the 18S (in the V2 region), three in the ITS1, one in the 5.8S, three in the ITS2 and two in the 28S (both in the D1/D2 region).
A 43997 bp contig corresponding to the mitochondrial genome was retrieved for strain SZCZEY2262, but could not be circularised because of the presence of repeated sequences at its ends. However, for easier reading, it is displayed as circular on the map (Fig.
Despite several attempts, it was impossible to assemble the mitogenome of strain SZCZEY2172. Lowering the k-mer parameter to 75 only allowed the recovery of a short ca. 500 bp fragment with a low coverage. This fragment was used as a seed to try an assembly with NOVOPlasty 4.3.3 (
The plastome is 158,005 bp long for SZCZEY2262 (Fig.
For SZCZEY2172 (GenBank: OR795086), the LSC is 72,913 bp long and has an identical gene content compared to SZCZEY2262, the SSC is 49727 bp long and encodes 51 conserved protein-coding genes, eight tRNAs and six non-conserved ORFs of more than 100 AA. The IR is 17,675 bp long and has an identical gene and intron content to SZCZEY2262, with the same overlap of rbcR between the IRB and the SSC.
Both genomes contain a 43 AA ORF in their SSC that cannot be extended because of the presence of stop-codons. This ORF shows similarities to the hypothetical chloroplast RF35 encoded by ycf35, which is missing between both strains. The position of this ORF also corresponds to the position of ycf35 in Navicula veneta, between clpC and rps13 (
It is worth noting that, in addition to the differences in length and content in the non-conserved ORFs, there is a slight degree of extra polymorphism in the two strains, the extent depending on the part of the genome considered. There were only two SNPs in the inverted repeat (one in the spacer between ycf45 and tRNA-Pro and the other inside rbcR). On the other hand, a gene such as psbC displayed three SNPs, two of them silent, but one leading to a phenylalanine–leucine substitution. The two xerC genes, although present in both strains, differed in length.
The subtree containing Naviculaceae (Fig.
Once rooted with sequence ABR25263, the phylogenetic tree of LAGLIDADG proteins (Fig.
Maximum Likelihood phylogenetic tree inferred from the alignment of the putative LAGLIDADG endonuclease proteins found in the group I introns of Navicula vanseea sp. nov. and other taxa. The type of genome is indicated between brackets: cp – plastome, mt – mitogenome, bact – bacteria, cyan – cyanobacteria.
Navicula vanseea sp. nov. has elliptic valves that taper towards cuneately rounded apices in smaller specimens and linear-elliptic-lanceolate with narrowly rounded, protracted endings in larger specimens (Table
Navicula vanseea sp. nov. | Navicula meulemansii | Navicula microdigitoradiata | Navicula cincta | Navicula cariocincta | Navicula veronensis | Navicula dealpina | |
---|---|---|---|---|---|---|---|
Valve length (µm) | 11.5–28.5 | 12–30 | 15–40 | 14–45 | 30–50 | 19–40 | 26–86 |
Valve width (µm) | 5–6 | 3.5–5.5 | 5–7 | 5.5–8 | 5.5–7 | 4–7 | 8–12 |
Stria density (in 10 µm) | 12–13 | 15–21 | 10–11 | 8–10 | 10–12 | 11–13 | 8–10 |
Striae around central area | irregularly shortened | irregularly shortened | single shortened on either side | irregularly shortened | irregularly shortened | 2–4 striae distinctly shortened on either side | irregularly shortened |
Lineolae density (in 10 µm) | ca. 50 | ca. 44–51 | -- | ca. 40 | ca. 30 | ca. 50 | ca. 26 (LM) |
Valve shape | smaller elliptic, larger linear-elliptic-lanceolate | elliptic-lanceolate | elliptic-lanceolate to linear-lanceolate | elliptic to lanceolate to linear-elliptic-lanceolate | linear- elliptic -lanceolate | linear-lanceolate | linear-lanceolate to almost elliptical |
Central area | elliptic and small | very small and asymmetric | elliptic and very small | small | large, transversely rectangular to elliptical | broadly rectangular or transapically elliptic | almost symmetrical, transversally rectangular |
Valve apices | narrowly rounded | cuneately rounded | obtusely rounded | obtusely rounded | narrowed to a wedge, obtusely- rounded, never protracted | gradually wedge-shaped, finally obtusely rounded | obtusely wedge-shaped |
Raphe | filiform | weakly lateral; central pores very close together | weakly lateral, central pores very close | filiform | strongly radiate | filiform | weakly to strongly lateral, lying outside the median area towards the central pores |
References | this study | Mertens et al. (2014) |
|
|
|
|
|
In relation to the three taxa mentioned in the introduction as having been found in Van Lake by
The organellar genomes, especially the plastomes, show some interesting features. For example, introns are not considered to be conserved genetic elements and are known to vary amongst isolates of a single species (e.g.
In the plastome of N. vanseea, the ycf35 gene has seemingly been turned to a pseudogene, which would be the first time to our knowledge that this has been observed in diatoms, although ycf35 pseudogenes have been observed in Rhodophyta (
Our study also illustrates the added value that next generation sequencing provides when describing new species, in three ways. First, it is a convenient way to gather data for multigene phylogenies, whatever the species considered. Second, in the current case with N. vanseea, it made it possible to find SNPs in supposedly conserved genes of two sympatric strains of the same species. This needs to be taken into account in interpreting phylogenetic and metabarcoding analyses. Third, serendipitous discoveries can occur that increase our knowledge of the organellar genomes of diatoms and other stramenopiles, such as the loss of a functional ycf35 gene here or the first documented L1917 intron found in a stramenopile.
This work started under the supervision of our beloved colleague and mentor, the late Prof. Dr. Hab. Andrzej Witkowski, who passed away on 17 September 2023. His guidance is already deeply missed.
The authors have declared that no competing interests exist.
No ethical statement was reported.
This work was co-financed by the Minister of Science under the “Regional Excellence Initiative” Program (2024–2027). Claude Lemieux and Monique Turmel were supported by grant RGPIN-2017-04506 from the Natural Sciences and Engineering Research Council of Canada (NSERC). We also acknowledge support from the CERCA Programme/Generalitat de Catalunya. The Royal Botanic Garden Edinburgh is supported by the Scottish Government’s Rural and Environment Science and Analytical Services Division.
Conceptualisation: EY. Data curation: EY. Funding acquisition: AW. Investigation: EY, RG, CNS, EG, CL, MT, NE. Methodology: AW, RT, DGM. Project administration: CNS, AW. Supervision: AW, DGM, RT, CNS, RG. Visualisation: EY, NE, CNS, EG. Writing - original draft: EY. Writing - review and editing: DGM, RT, CL, MT, CNS, RG.
Elif Yılmaz https://orcid.org/0000-0001-7939-1814
David G. Mann https://orcid.org/0000-0003-0522-6802
Romain Gastineau https://orcid.org/0000-0001-8661-5118
Rosa Trobajo https://orcid.org/0000-0001-9498-3797
Cüneyt Nadir Solak https://orcid.org/0000-0003-2334-4271
Ewa Górecka https://orcid.org/0000-0003-0590-7480
Monique Turmel https://orcid.org/0000-0001-7060-035X
Claude Lemieux https://orcid.org/0000-0001-9580-8042
Nesil Ertorun https://orcid.org/0000-0001-6224-7314
Andrzej Witkowski https://orcid.org/0000-0003-1714-218X