Browse > Article
http://dx.doi.org/10.5010/JPB.2016.43.3.293

De novo gene set assembly of the transcriptome of diploid, oilseed-crop species Perilla citriodora  

Kim, Ji-Eun (SEEDERS Inc.)
Choe, Junkyoung (SEEDERS Inc.)
Lee, Woo Kyung (SEEDERS Inc.)
Kim, Sangmi (SEEDERS Inc.)
Lee, Myoung Hee (National Institute of Crop Science, RDA)
Kim, Tae-Ho (National Academy of Agricultural Science, RDA)
Jo, Sung-Hwan (SEEDERS Inc.)
Lee, Jeong Hee (SEEDERS Inc.)
Publication Information
Journal of Plant Biotechnology / v.43, no.3, 2016 , pp. 293-301 More about this Journal
Abstract
High-quality gene sets are necessary for functional research of genes. Although Perilla is a commonly cultivated oil crop and vegetable crop in Southeast Asia, the quality of its available gene set is insufficient. To construct a high-quality Perilla gene set, we sequenced mRNAs extracted from different tissues of Perilla citriodora, the wild species (2n = 20) of Perilla. To make a high-quality gene set for P. citriodora, we compared the quality of assemblies produced by Velvet and Trinity, the two well-known de novo assemblers, and improved the de novo assembly pipeline by optimizing k-mers and removing redundant sequences. We then selected representative transcripts for loci according to several criteria. The improved assembly yielded a total of 86,396 transcripts and 38,413 representative transcripts. We evaluated the assembled transcripts by comparing them to 638 homologous Arabidopsis genes involved in fatty acid and TAG biosynthesis pathways. High proportions of full-length genes and transcripts in the assembled transcripts matched known genes in other species, indicating that the P. citriodora gene set can be applied in future functional studies. Our study provides a reference P. citriodora gene set for further studies. It will serve as valuable genetic resource to elucidate the molecular basis of various metabolisms.
Keywords
Perilla citriodora; Transcriptome; de novo assembly; gene set; fatty acid biosynthesis; TAG (triacylglyceride) biosynthesis metabolic pathway; oilseed crop;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Chen G, Yin K, Wang C, Shi T. (2011a) De novo transcriptome assembly of RNA-Seq reads with different strategies. Sci China Life Sci 54(12), 1129-1133   DOI
2 Chen G, Li R, Shi L, Qi J, Hu P, Luo J, et al. (2011b) Revealing the missing expressed genes beyond the human reference genome by RNA-Seq. BMC Genomics 12, 590   DOI
3 Cox MP, Peterson DA, Biggs PJ. (2010) SolexaQA: At-a-glance quality assessment of Illumina second-generation sequencing data. BMC Bioinformatics 11, 485   DOI
4 Fukushima A, Nakamura M, Suzuki H, Saito K, Yamazaki M. (2015) High-Throughput Sequencing and De Novo Assembly of Red and Green Forms of the Perilla frutescens var. crispa Transcriptome. PLoS One 10(6), e0129154   DOI
5 Garber M, Grabherr MG, Guttman M, Trapnell C. (2011) Computational methods for transcriptome annotation and quantification using RNA-seq. Nat Methods 8(6), 469-477   DOI
6 Gongora-Castillo E, Buell CR. (2013) Bioinformatics challenges in de novo transcriptome assembly using short read sequences in the absence of a reference genome sequence. Nat Prod Rep 30(4), 490-500   DOI
7 Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, et al. (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 29(7), 644-652   DOI
8 Harris MA, Clark J, Ireland A, Lomax J, Ashburner M, Foulger R, et al. (2004) The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res 32(Database issue), D258-261   DOI
9 Iorizzo M, Senalik DA, Grzebelus D, Bowman M, Cavagnaro PF, Matvienko M, et al. (2011) De novo assembly and characterization of the carrot transcriptome reveals novel genes, new markers, and genetic diversity. BMC Genomics 12, 389   DOI
10 Ito M, Kiuchi F, Yang LL, Honda G. (2000) Perilla citriodora from Taiwan and its phytochemical characteristics. Biol Pharm Bull 23(3), 359-362   DOI
11 Kim HA, Lim CJ, Kim S, Choe JK, Jo SH, Baek N, et al. (2014) High-throughput sequencing and de novo assembly of Brassica oleracea var. Capitata L. for transcriptome analysis. PLoS One 9(3), e92087   DOI
12 Illumina (http://www.illumina.com/products/truseq_rna_library_prep_kit_v2.html)
13 Jung CS, Lee MH,Oh KW, HK Kim, Park CB, Sung JD, Suh DY. (2005) Discovery of New Diploid Perilla Species in Korea. Korean J. Breed 37(3), 152-154
14 Kanehisa M, Goto S. (2000) KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res 28(1), 27-30   DOI
15 Kim HU, Chen GQ. (2015) Identification of hydroxy fatty acid and triacylglycerol metabolism-related genes in lesquerella through seed transcriptome analysis. BMC Genomics 16, 230   DOI
16 KAPA biosystems (http://www.kapabiosystems.com/productapplications/products/next-generation-sequencing-2/libraryquantification/)
17 Li B, Fillmore N, Bai Y, Collins M, Thomson JA, Stewart R, et al. (2014) Evaluation of de novo transcriptome assemblies from RNA-Seq data. Genome Biol 15(12), 553   DOI
18 Langmead B, Trapnell C, Pop M, Salzberg SL. (2009) Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10(3), R25   DOI
19 Lee SC, Lee JK, Kim NH, Park JY, Kim HU, Lee HO, et al. (2014) Analysis of expressed sequence tags from a normalized cDNA library of perilla (Perilla frutescens). Journal of Plant Biology 57(5), 312-320   DOI
20 Marguerat S, Bahler J. (2010) RNA-seq: from technology to biology. Cell Mol Life Sci 67(4), 569-579   DOI
21 Martin JA, Wang Z. (2011) Next-generation transcriptome assembly. Nat Rev Genet 12(10), 671-682   DOI
22 Zerbino DR, Birney E. (2008) Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 18(5), 821-829   DOI
23 Ness RW, Siol M, Barrett SC. (2011) De novo sequence assembly and characterization of the floral transcriptome in cross- and self-fertilizing plants. BMC Genomics 12, 298   DOI
24 Schulz MH, Zerbino DR, Vingron M, Birney E. (2012) Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels. Bioinformatics 28(8), 1086-1092   DOI
25 Wang Z, Gerstein M, Snyder M. (2009) RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet 10(1), 57-63   DOI
26 Zhang J, Liang S, Duan J, Wang J, Chen S, Cheng Z, et al. (2012) De novo assembly and characterisation of the transcriptome during seed development, and generation of genic-SSR markers in peanut (Arachis hypogaea L.). BMC Genomics 13, 90   DOI
27 Bumblauskiene L, Jakstas V, Janulis V, Mazdzieriene R, Ragazinskiene O. (2009) Preliminary analysis on essential oil composition of Perilla L. cultivated in Lithuania. Acta Pol Pharm 66(4), 409-413
28 Bates PD, Johnson SR, Cao X, Li J, Nam JW, Jaworski JG, et al. (2014) Fatty acid synthesis is inhibited by inefficient utilization of unusual fatty acids for glycerolipid assembly. Proc Natl Acad Sci USA 111(3), 1204-1209   DOI