Introduction
Chlorella vulgaris is a green microalga in division Chlorophyta characterized by fast growth [8] and adaptation to various culture conditions [15]; it contains numerous useful com- pounds, such as lipids and proteins. Recently, Chlorella has received considerable attention regarding its potential application in aquaculture and the production of biofuels and nu- trients, and possible utility for treating wastewater. Recently, the genome of C. vulgaris UTEX 395 was sequenced and deposited in the NCBI database [5], which enabled annotation of different C. vulgaris genes.
Higher plants, algae, bacteria, and yeast assimilate nitrate as a source of nitrogen. First, the nitrate is reduced to nitrite, which is then converted into ammonia and used as a constituent of amino acids, which are involved in protein and nucleic acid synthesis. The initial step in nitrate assimilation is catalyzed by the nitrate reductase (NR) gene, which is an NAD(P)H-dependent enzyme that plays a major role in nitrogen metabolism [4]. The NR gene has been characterized in plants, fungi, yeast, and algae. It is a complex protein containing three prosthetic groups: FAD, molybdopterin, and NADH. Its expression is controlled at different levels, including transcription and enzyme activity. The degradation and activity of NR protein are influenced by environmental factors, such as repression of nitrate reductase in the presence of ammonia; NR protein is induced by nitrate in higher plants, fungi, and green algae [9].
Recently, our laboratory acquired a new C. vulgaris strain PKVL7422 (KCTC1.331BP), which has higher homology (99.88%) to C. vulgaris based on their 18S RNA sequences. The new strain of C. vulgaris is characterized by fast growth, ease of culture, and cultivability in the dark. However, there is no information on the various genes, or their sequences, for this new strain. Therefore, this study characterized this new alga and its NR gene by constructing its full-length cDNA and analyzing its structure.
Materials and Methods
Algae culture
C. vulgaris PKVL7422 was cultured in a 250 ml flask containing 100 ml of BG-11 medium at 20℃, with a light source of 50 µmol photons m-2 s-1 in an orbital shaker (HB-203L, Korea) at 80 rpm. The cells were collected at the mid-log phase and flash-frozen in liquid nitrogen.
RNA extraction and cDNA construction
RNA was extracted using TRIzol reagent (Invitrogen, USA), following the manufacturer’s instructions. After ex- traction, genomic DNA was removed using the TURBO™ DNase kit (Invitrogen). The RNA concentration, and integrity thereof, were confirmed using a NanoDrop fluorometer and gel electrophoresis. The first cDNA strands were constructed using the SuperScript™ VILO™ cDNA Synthesis Kit (Invi- trogen). Primers spanning the full cDNA were designed based on the sequence predicted from our previous transcriptome study [1], and designated NR-F (ATGACAGTGCTCCTGGC AGG) and NR-R (TCAAAACTGAATGCACTGCTC TGG). The constructed cDNA was used as a template for PCR and the PCR product was purified using a gel extraction kit (GeneAll, South Korea). Then, the DNA fragments were ligated into pMD19 vector (Takara, Japan). The ligate was transformed into Escherichia coli DH5α and plated on LB- agar plates containing 100 μg/ml of ampicillin. The plasmid DNA extracted from transformed E. coli DH5α were sent for sequencing at Bionics (South Korea). Then, the BLASTp was used to analyze the deduced amino-sequence of NR gene of C. vulgaris PKVL7422 with an E-value cutoff of 10-4 against the non-redundant protein sequences of NCBI database.
The 5' and 3' ends of the NR cDNA were cloned by rapid amplification of cDNA ends (RACE) PCR, following the manufacturer’s instructions for the SMARTer RACE 5′/3′ Kit (Clontech, USA). From the cDNA sequence amplified using NR-F and NR-R, two specific primers spanning the 5' and 3' ends (NR-3GSP-F, CATTGGAGAGCTGGC AGAAGAG GGGC and NR-5GSP-R, CTCACGCGCACGCCTGTCCAG GTGGA) were designed. The PCR product was ligated into T-vector and submitted for sequencing.
Genomic DNA extraction and sequencing of the NR gene
Genomic DNA was extracted from C. vulgaris using the cetyltrimethylammonium bromide (CTAB) method [6], with some modifications. Briefly, cells were collected by centrifugation at 3,500× g for 10 min, snap frozen using liquid nitrogen, pulverized with mortar, and then transferred to warm CTAB buffer (1.5% w/v of CTAB, 1.4 M NaCl, 2 mM EDTA, 100 mM Tris HCl pH 8.0, 2% v/v β-mercaptoetha- nol). The extract was vortexed and incubated at 65℃ for 1 hr with shaking every 10 min. The genomic DNA was ex-tracted using phenol/chloroform/isoamyl alcohol (25:24:1) and precipitated using ethanol after removing contaminating RNA with RNase A. The NR gene was amplified using the primers NR-F and NR-R and subcloned into pMD19, which was sent for sequencing. The cDNA and genomic DNA sequences of the NR gene were analyzed using ClustalX [13] and the introns were identified. The amino acid sequences of the NR genes of reference organisms were retrieved from GenBank and used to construct a phylogenetic tree with the neighbor-joining method using Mega X software [7]. The domain analysis of NR protein was performed by using the SMART program (http://smart.embl-heidelberg.de/) to identify the active and other functional domains in the protein database [10].
Results
Characterization of the NR gene
The full cDNA of NR was amplified and cloned from C. vulgaris PKVL7422; it was 2,946 bp in length with an 80-bp 5' untranslated region, a 232-bp 3' untranslated region, a 2,634-bp open reading frame (ORF), and a polyadenylation signal (TGTAA) (Fig. 1). The NR ORF encodes an 877-ami- no-acid protein with a putative molecular weight of 96.0 kDa. SMART program analysis [10] showed that the NR protein contains five conserved domains, including the molybdopterin binding, dimer interface, FAD binding, NADH binding, and heme binding domains (Fig. 1). The NR gene structure was determined by cloning the NR gene from genomic DNA that was submitted for sequencing. The alignment of the cDNA sequence with the genomic sequence showed that NR possesses 19 exons with 18 introns and consists of 7,364 nucleotides (Fig. 2). The gene sequence was deposited at Gene Bank with accession number MN627215)
Fig. 1. Nucleotide and deduced amino acid sequences of NR for C. vulgaris PKVL7422. The molybdopterin binding domain is highlighted in light grey, the dimer binding domain is highlighted in dark grey, FAD binding domain is underlined, and the NADH binding domain is highlighted in green. The polyadenylation signal (TGTAA) is in bold.
Fig. 2. Genomic structure of NR gene in the genome of C. vulgaris PKVL7422. Nineteen exons and eighteen introns are shown. Each exon is represented as a black box and each intron is represented as a line.
Phylogenetic analysis and multiple sequence alignment of NR protein
The homology alignment of the deduced NR protein showed high similarity with several species of green algae, including C. vulgaris (97%), C. variabilis (80.02%), Chlamydomonas reinhardtii (55.59%), and Dunaliella salina (51.92 %). Furthermore, the phylogenetic tree analysis showed that the NR gene of C. vulgaris PKVL7422 was more closely related to the green algae NR genes than to those of higher plants (Fig. 3). The NR gene clustered with Chlamydomonas reinhardtii and was very similar to the NR gene of C. vulgaris. The alignment of the NR protein with sequences from other species showed high conservation of domains with the plant domains, including the molybdopterin binding, dimer interface, heme binding, FAD binding, and NADH binding domains (Fig. 4).
Fig. 3. Phylogenetic tree analysis based on amino acid of NR of C. vulgaris PKVL7422. The tree shows that C. vulgaris PKVL7422 NR gene belongs to the cluster of algal NRs. The Genbank accession number of all NR sequences are indicated.
Fig. 4. Alignment of amino acid of algal NRs and Arabidopsis thaliana. The arrows indicate the beginning and end of conserved domain: molybdopterin binding domain (Mo-protein), dimer binding domain (Dimer interface), Heme for heme binding domain, FAD for FAD binding domain, and NADH for NADH binding domain. The five sequences aligned were Chlamydomonas reinhardtii (XP_001696697), Volvox carteri (XP_002955156), Dunaliella salina (AAP75705), Chlorella vulgaris PKVL7422, and Arabidopsis thaliana (NP_177899).
Discussion
In this study, the full-length NR gene was cloned and sequenced using PCR amplification and primers designed from the C. vulgaris PKVL7422 transcriptome. The sequenced gene consists of 19 exons and 18 introns, with a 2,634-bp ORF that encodes an 877-amino-acid protein and contains five conserved domains: the oxidoreductase molybdopterin binding, NAD(P)H-flavin reductase, FAD binding, dimer binding, and heme binding domains. These five domains are present in the NRs of plants such as Arabidopsis thaliana [12] and algae [2]. The high number of introns (18 introns) is characteristic of algae compared with other eukaryotes [14], and might be explained by post-transcriptional processing [3]. Furthermore, a BLAST search of the NR protein showed that the oxidoreductase molybdopterin binding do-main belongs to the sulfite oxidase family; moreover, the NR protein has high homology with that of the C. vulgaris NR protein (97%) and other Chlorella (C. variabilis and C. sor- okiniana), which suggests that the deduced amino acid sequence from C. vulgaris PKVL7422 belongs to the NR family.
In the phylogenetic tree, the NR protein clustered with the green algae NR and was separated from plant NRs. Stolz and Basu [11] analyzed different eukaryotic NR genes based on their phylogenetic relationships and suggested that the NR protein originated early during evolution and divided into two main groups: fungal NR and plant and algae NR [11]. The alignment of NR protein with algal NR and plant NR (Fig. 4) shows the conservation of functional domains in algal and plant NRs.
In conclusion, the NR gene and cDNA of C. vulgaris PKVL7422 were sequenced, and its genomic structure and functional domains were characterized. This work characterized the NR gene in a new strain of C. vulgaris and should increase our understating of the nitrate assimilation mecha- nism.
Acknowledgement
This work was supported by a Research Grant from Pukyong National University (2021).
The Conflict of Interest Statement
The authors declare that they have no conflicts of interest with the contents of this article.
References
- Abdellaoui, N., Kim, M. J. and Choi, T. J. 2019. Transcriptome analysis of gene expression in Chlorella vulgaris under salt stress. World J. Microbiol. Biotechnol. 35, 141.
- Astier, J., Rossi, J., Chatelain, P., Klinguer, A., Besson-Bard, A., Rosnoblet, C. and Wendehenne, D. 2021. Nitric oxide production and signalling in algae. J. Exp. Bot. 72, 781-792. https://doi.org/10.1093/jxb/eraa421
- Baier, T., Jacobebbinghaus, N., Einhaus, A., Lauersen, K. J. and Kruse, O. 2020. Introns mediate post-transcriptional enhancement of nuclear gene expression in the green microalga Chlamydomonas reinhardtii. PLoS Genet. 16, e1008944.
- Chamizo-Ampudia, A., Sanz-Luque, E., Llamas, A., Galvan, A. and Fernandez, E. 2017. Nitrate reductase regulates plant nitric oxide homeostasis. Trends Plant Sci. 22, 163-174. https://doi.org/10.1016/j.tplants.2016.12.001
- Guarnieri, M. T., Levering, J., Henard, C. A., Boore, J. L., Betenbaugh, M. J., Zengler, K. and Knoshaug, E. P. 2018. Genome sequence of the oleaginous green alga, Chlorella vulgaris UTEX 395. Front. Bioeng. Biotechnol. 6, 37.
- Jagielski, T., Gawor, J., Bakula, Z., Zuchniewicz, K., Zak, I. and Gromadka, R. 2017. An optimized method for high quality DNA extraction from microalga Prototheca wickerhamii for genome sequencing. Plant Methods 13, 1-8. https://doi.org/10.1186/s13007-016-0152-4
- Kumar, S., Stecher, G,. Li, M., Knyaz, C. and Tamura, K. 2018. MEGA X: molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 35, 1547-1549. https://doi.org/10.1093/molbev/msy096
- Metsoviti, M. N., Papapolymerou, G., Karapanagiotidis, I. T. and Katsoulas, N. 2019. Comparison of growth rate and nutrient content of five microalgae species cultivated in greenhouses. Plants 8, 279.
- Sanz-Luque, E., Chamizo-Ampudia, A., Llamas, A., Galvan, A. and Fernandez, E. 2015. Understanding nitrate assimilation and its regulation in microalgae. Front. Plant Sci. 6, 899.
- Schultz, J. 2000. SMART: a web-based tool for the study of genetically mobile domains. Nucleic Acids Res. 28, 231-234. https://doi.org/10.1093/nar/28.1.231
- Stolz, J. F. and Basu, P. 2002. Evolution of nitrate reductase: molecular and structural variations on a common function. ChemBioChem. 3, 198-206. https://doi.org/10.1002/1439-7633(20020301)3:2/3<198::AID-CBIC198>3.0.CO;2-C
- Theologis, A., Ecker, J. R., Palm, C. J., Federspiel, N. A., Kaul, S., White, O. and Brooks, S. Y. 2000. Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana. Nature 408, 816-820. https://doi.org/10.1038/35048500
- Thompson, J. D., Gibson, T. J., Plewniak, F., Jeanmougin, F. and Higgins, D. G. 1997. The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 25, 4876-4882. https://doi.org/10.1093/nar/25.24.4876
- Zhou, J. and Kleinhofs, A. 1996 Molecular evolution of nitrate reductase genes. J. Mol. Evol. 42, 432-442. https://doi.org/10.1007/BF02498637
- Zuniga, C., Li, C. T., Huelsman, T., Levering, J., Zielinski, D. C., McConnell, B. O. and Betenbaugh, M. J. 2016. Genome-scale metabolic model for the green alga Chlorella vulgaris UTEX 395 accurately predicts phenotypes under autotrophic, heterotrophic, and mixotrophic growth conditions. Plant Physiol. 172, 589-602. https://doi.org/10.1104/pp.16.00593